Shared Gonito challenge

This commit is contained in:
Artur Nowakowski 2021-06-01 10:33:46 +02:00
commit bbf7c5f350
10 changed files with 102607 additions and 0 deletions

1
.gitignore vendored Normal file
View File

@ -0,0 +1 @@
*~

5
README.md Normal file
View File

@ -0,0 +1,5 @@
# Criminal snippets classification challenge
Guess whether a search engine snippet contains possibly criminal content.
The challenge's dataset consists of search engine snippets that have been identified by human annotators as potentially containing criminal content.
Labels were assigned solely on the basis of text content.

1
config.txt Normal file
View File

@ -0,0 +1 @@
--metric F1.0 --precision 5

10570
dev-0/expected.tsv Normal file

File diff suppressed because it is too large Load Diff

BIN
dev-0/in.tsv.xz Normal file

Binary file not shown.

1
in-header.tsv Normal file
View File

@ -0,0 +1 @@
category text
1 category text

1
out-header.tsv Normal file
View File

@ -0,0 +1 @@
label
1 label

BIN
test-A/in.tsv.xz Normal file

Binary file not shown.

92028
train/expected.tsv Normal file

File diff suppressed because it is too large Load Diff

BIN
train/in.tsv.xz Normal file

Binary file not shown.