Init master

This commit is contained in:
siulkilulki 2020-04-06 15:32:46 +02:00
commit ea55197312
10 changed files with 24 additions and 0 deletions

8
.gitignore vendored Normal file
View File

@ -0,0 +1,8 @@
*~
*.swp
*.bak
*.pyc
*.o
.DS_Store
.token

13
README.md Normal file
View File

@ -0,0 +1,13 @@
Guess the date of reddits (large edition)
================================
Guess a reddit date based on its text.
This is larger version with more reddits and subrredits (topics) than in <https://gonito.net/challenge/guess-reddit-date>.
Output label is `FLOAT-YEAR`, a human friendly timestamp.
`FLOAT-YEAR`=1970 + `posix_time`/(60\*60\*24\*365.25)
Sources
-------
Data taken from <https://archive.org/details/2015_reddit_comments_corpus>.

1
config.txt Normal file
View File

@ -0,0 +1 @@
--metric MSE --precision 4 --in-header in-header.tsv --out-header out-header.tsv

BIN
dev-0/expected.tsv.xz Normal file

Binary file not shown.

BIN
dev-0/in.tsv.xz Normal file

Binary file not shown.

1
in-header.tsv Normal file
View File

@ -0,0 +1 @@
PostText
1 PostText

1
out-header.tsv Normal file
View File

@ -0,0 +1 @@
Label
1 Label

BIN
test-A/in.tsv.xz Normal file

Binary file not shown.

BIN
train/expected.tsv.xz Normal file

Binary file not shown.

BIN
train/in.tsv.xz Normal file

Binary file not shown.