Go to file
2020-04-19 11:13:11 +02:00
dev-0 Linear regression TfIdf, dim reduced 2020-04-19 11:13:11 +02:00
test-A Linear regression TfIdf, dim reduced 2020-04-19 11:13:11 +02:00
train Init master 2020-04-07 00:18:10 +02:00
.gitignore Init master 2020-04-07 00:18:10 +02:00
config.txt Init master 2020-04-07 00:18:10 +02:00
in-header.tsv Init master 2020-04-07 00:18:10 +02:00
link_to_collab.txt Linear regression TfIdf, dim reduced 2020-04-19 11:13:11 +02:00
out-header.tsv Init master 2020-04-07 00:18:10 +02:00
README.md Init master 2020-04-07 00:18:10 +02:00
reddit_date.py Linear regression TfIdf, dim reduced 2020-04-19 11:13:11 +02:00

Guess the date of reddits (large edition)

Guess a reddit date based on its text. This is larger version with more reddits and subrredits (topics) than in https://gonito.net/challenge/guess-reddit-date.

Output label is FLOAT-YEAR, a human friendly timestamp. FLOAT-YEAR=1970 + posix_time/(60*60*24*365.25)

Sources

Data taken from https://archive.org/details/2015_reddit_comments_corpus.