dev-0 | ||
test-A | ||
train | ||
.gitignore | ||
config.txt | ||
in-header.tsv | ||
link_to_collab.txt | ||
out-header.tsv | ||
README.md | ||
reddit_date.py |
Guess the date of reddits (large edition)
Guess a reddit date based on its text. This is larger version with more reddits and subrredits (topics) than in https://gonito.net/challenge/guess-reddit-date.
Output label is FLOAT-YEAR
, a human friendly timestamp.
FLOAT-YEAR
=1970 + posix_time
/(60*60*24*365.25)
Sources
Data taken from https://archive.org/details/2015_reddit_comments_corpus.