dev-0 | ||
test-A | ||
train | ||
.gitignore | ||
config.txt | ||
postprocess.py | ||
README.md | ||
tau_fairseq.ipynb |
wmt-2020-pl-en
Translate from Polish to English.
This is a challenge created from http://www.statmt.org/wmt20/translation-task.html . Train set is created from europarl wmt pl-en training data. Dev and test set are created from wmt pl-en development data.
Directory structure
README.md
— this fileconfig.txt
— configuration filetrain/
— directory with training datatrain/train.tsv
— sample parallel corpus (Finnish text in the first column, Māori text in the second one)dev-0/
— directory with dev (test) datadev-0/in.tsv
— Finnish input text for the dev setdev-0/expected.tsv
— Māori reference translation for the dev settest-A
— directory with test datatest-A/in.tsv
— Finnish input data for the test settest-A/expected.tsv
— Māori reference translation for the test set