Add files
This commit is contained in:
commit
762845cc6d
8
.gitignore
vendored
Normal file
8
.gitignore
vendored
Normal file
@ -0,0 +1,8 @@
|
||||
|
||||
*~
|
||||
*.swp
|
||||
*.bak
|
||||
*.pyc
|
||||
*.o
|
||||
.DS_Store
|
||||
.token
|
32
README.md
Normal file
32
README.md
Normal file
@ -0,0 +1,32 @@
|
||||
FCE - Grammatical error detection
|
||||
===========================
|
||||
|
||||
Detect errors in English text.
|
||||
|
||||
This is a Gonito.net challenge based on data from https://ilexir.co.uk/datasets/index.html
|
||||
The aim of the challenge is to predict which tokens are incorrect.
|
||||
|
||||
MultiLabel-F0.5 is used as the evaluation metric.
|
||||
|
||||
Dataset reference:
|
||||
1) Compositional Sequence Labeling Models for Error Detection in Learner Writing
|
||||
Marek Rei and Helen Yannakoudakis
|
||||
In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL-2016)
|
||||
|
||||
2) A New Dataset and Method for Automatically Grading ESOL Texts
|
||||
Helen Yannakoudakis, Ted Briscoe and Ben Medlock
|
||||
In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL-2011)
|
||||
|
||||
## Directory structure
|
||||
|
||||
* `README.md` — this file
|
||||
* `config.txt` — configuration file
|
||||
* `train/` — directory with training data
|
||||
* `train/in.tsv` — Original input text for the train set
|
||||
* `train/expected.tsv` — Incorrect token indexes. Indexes start from 1.
|
||||
* `dev-0/` — directory with dev data
|
||||
* `dev-0/in.tsv` — Original input text for the dev set
|
||||
* `dev-0/expected.tsv` — Incorrect token indexes. Indexes start from 1.
|
||||
* `test-A` — directory with test data
|
||||
* `test-A/in.tsv` — Original input text for the test set
|
||||
|
1
config.txt
Normal file
1
config.txt
Normal file
@ -0,0 +1 @@
|
||||
--metric Mean/MultiLabel-F0.5:N<Mean>N<F0.5> --metric MultiLabel-F0.5:P<2>N<Total>N<F0.5> --metric MultiLabel-F0.0:P<2>N<Precision> --metric MultiLabel-F99999.0:P<2>N<Recall> --precision 4 -%
|
2222
dev-0/expected.tsv
Normal file
2222
dev-0/expected.tsv
Normal file
File diff suppressed because it is too large
Load Diff
2222
dev-0/in.tsv
Normal file
2222
dev-0/in.tsv
Normal file
File diff suppressed because it is too large
Load Diff
2222
dev-0/out.tsv
Normal file
2222
dev-0/out.tsv
Normal file
File diff suppressed because it is too large
Load Diff
2720
test-A/in.tsv
Normal file
2720
test-A/in.tsv
Normal file
File diff suppressed because it is too large
Load Diff
2720
test-A/out.tsv
Normal file
2720
test-A/out.tsv
Normal file
File diff suppressed because it is too large
Load Diff
28731
train/expected.tsv
Normal file
28731
train/expected.tsv
Normal file
File diff suppressed because it is too large
Load Diff
28731
train/in.tsv
Normal file
28731
train/in.tsv
Normal file
File diff suppressed because it is too large
Load Diff
Loading…
Reference in New Issue
Block a user