Updated README.md

This commit is contained in:
Przemysław Lipka 2019-01-17 14:12:44 +01:00
parent bf94068487
commit ea753a8c28

View File

@ -57,19 +57,17 @@ there should be no TAB there).
Evaluation metrics
------------------
Two evaluation metrics are used:
One evaluation metric is used:
* Soft-F1 - a "softer" version of F1 in which overlap is also counted
* BIO-F1, F1 metric on NER tags
Directory structure
-------------------
* `README.md` — this file
* `config.txt` — GEval configuration file
* `train/` — directory with training data (15/16 of the original train set)
* `train/` — directory with training data
* `train/train.tsv.xz` — train set
* `train/meta.tsv` — metadata (the original ID of the document), do **not* use during training,
this is just for reference (e.g. when you need to go back to the original document)
* `dev-0/` — directory with dev (test) data (split preserved from CoNLL-2003)
* `dev-0/in.tsv` — input data for the dev set
* `dev-0/expected.tsv` — expected (reference) data for the dev set