Add files

This commit is contained in:
Ryszard Staruch 2024-05-10 14:40:58 +02:00
commit 2909b8e625
8 changed files with 64658 additions and 0 deletions

8
.gitignore vendored Normal file
View File

@ -0,0 +1,8 @@
*~
*.swp
*.bak
*.pyc
*.o
.DS_Store
.token

23
README.md Normal file
View File

@ -0,0 +1,23 @@
FCE - Grammatical error detection
===========================
Detect errors in english text.
This is a Gonito.net challenge based on data from https://ilexir.co.uk/datasets/index.html
The aim of the challenge is to predict which tokens are incorrect.
MultiLabel-F0.5 is used as the evaluation metric.
## Directory structure
* `README.md` — this file
* `config.txt` — configuration file
* `train/` — directory with training data
* `train/in.tsv` — Original input text for the train set
* `train/expected.tsv` — Incorrect token indexes. Indexes start from 1.
* `dev-0/` — directory with dev data
* `dev-0/in.tsv` — Original input text for the dev set
* `dev-0/expected.tsv` — Incorrect token indexes. Indexes start from 1.
* `test-A` — directory with test data
* `test-A/in.tsv` — Original input text for the test set

1
config.txt Normal file
View File

@ -0,0 +1 @@
--metric Mean/MultiLabel-F0.5:N<Mean>N<F0.5> --metric MultiLabel-F0.5:P<2>N<Total>N<F0.5> --metric MultiLabel-F0.0:P<2>N<Precision> --metric MultiLabel-F99999.0:P<2>N<Recall> --precision 4 -%

2222
dev-0/expected.tsv Normal file

File diff suppressed because it is too large Load Diff

2222
dev-0/in.tsv Normal file

File diff suppressed because it is too large Load Diff

2720
test-A/in.tsv Normal file

File diff suppressed because it is too large Load Diff

28731
train/expected.tsv Normal file

File diff suppressed because it is too large Load Diff

28731
train/in.tsv Normal file

File diff suppressed because it is too large Load Diff