petite-difference-challenge2/dataset.md
2020-05-01 08:45:59 +02:00

5 lines
216 B
Markdown

# Dataset
Dataset in this submission is tokenized with [moses](https://github.com/moses-smt/mosesdecoder/tree/master/scripts/tokenizer), used: `tokenizer.perl` (with pl language) and `deescape-special-chars.perl`.