This commit is contained in:
Filip Gralinski 2021-08-23 10:55:15 +02:00
commit b33be77ce1
11 changed files with 442565 additions and 0 deletions

8
.gitignore vendored Normal file
View File

@ -0,0 +1,8 @@
*~
*.swp
*.bak
*.pyc
*.o
.DS_Store
.token

9
README.md Normal file
View File

@ -0,0 +1,9 @@
Challenging America word-gap prediction
===================================
Guess a word in a gap.
Evaluation metric
-----------------
LikelihoodHashed is the metric

1
config.txt Normal file
View File

@ -0,0 +1 @@
--metric PerplexityHashed --precision 2 --in-header in-header.tsv --out-header out-header.tsv

10519
dev-0/expected.tsv Normal file

File diff suppressed because it is too large Load Diff

BIN
dev-0/in.tsv.xz Normal file

Binary file not shown.

1
in-header.tsv Normal file
View File

@ -0,0 +1 @@
FileId Year LeftContext RightContext
1 FileId Year LeftContext RightContext

1
out-header.tsv Normal file
View File

@ -0,0 +1 @@
Word
1 Word

BIN
test-A/in.tsv.xz Normal file

Binary file not shown.

432022
train/expected.tsv Normal file

File diff suppressed because it is too large Load Diff

BIN
train/in.tsv.xz Normal file

Binary file not shown.

4
train/train.tsv Normal file
View File

@ -0,0 +1,4 @@
Ala ma psa i kota
Basia ma psa
Nie kupujemy kota w worku
Czesia ma kota
1 Ala ma psa i kota
2 Basia ma psa
3 Nie kupujemy kota w worku
4 Czesia ma kota