petite-difference-challenge2/dataset.md

5 lines
216 B
Markdown
Raw Normal View History

2020-05-01 08:45:59 +02:00
# Dataset
Dataset in this submission is tokenized with [moses](https://github.com/moses-smt/mosesdecoder/tree/master/scripts/tokenizer), used: `tokenizer.perl` (with pl language) and `deescape-special-chars.perl`.