Go to file
2020-04-18 20:39:32 +02:00
linear_regression.py add linear_regression 2020-04-18 20:39:32 +02:00
README.md first commit 2020-04-17 09:08:00 +02:00

Guess the date of reddits (large edition)

Guess a reddit date based on its text. This is larger version with more reddits and subrredits (topics) than in https://gonito.net/challenge/guess-reddit-date.

Output label is FLOAT-YEAR, a human friendly timestamp. FLOAT-YEAR=1970 + posix_time/(60*60*24*365.25)

Sources

Data taken from https://archive.org/details/2015_reddit_comments_corpus.