Go to file
Ryszard Staruch 2909b8e625 Add files
2024-05-10 14:40:58 +02:00
dev-0 Add files 2024-05-10 14:40:58 +02:00
test-A Add files 2024-05-10 14:40:58 +02:00
train Add files 2024-05-10 14:40:58 +02:00
.gitignore Add files 2024-05-10 14:40:58 +02:00
config.txt Add files 2024-05-10 14:40:58 +02:00
README.md Add files 2024-05-10 14:40:58 +02:00

FCE - Grammatical error detection

Detect errors in english text.

This is a Gonito.net challenge based on data from https://ilexir.co.uk/datasets/index.html The aim of the challenge is to predict which tokens are incorrect.

MultiLabel-F0.5 is used as the evaluation metric.

Directory structure

  • README.md — this file
  • config.txt — configuration file
  • train/ — directory with training data
  • train/in.tsv — Original input text for the train set
  • train/expected.tsv — Incorrect token indexes. Indexes start from 1.
  • dev-0/ — directory with dev data
  • dev-0/in.tsv — Original input text for the dev set
  • dev-0/expected.tsv — Incorrect token indexes. Indexes start from 1.
  • test-A — directory with test data
  • test-A/in.tsv — Original input text for the test set