Classification with RoBERTa (update README.md)

This commit is contained in:
Karol Kaczmarek 2020-05-02 14:32:41 +02:00
parent 6d4bc5ec69
commit b4b30b6d63
2 changed files with 6 additions and 0 deletions

View File

@ -40,3 +40,7 @@ Each models should be trained in different directories or change `cache_dir` (in
- `train.py` - training script used for train classifier - `train.py` - training script used for train classifier
- `eval.py` - evaluation script (using best model) - `eval.py` - evaluation script (using best model)
- concatenate validation sets: `paste expected.tsv in.tsv > valid.tsv` - concatenate validation sets: `paste expected.tsv in.tsv > valid.tsv`
# Branch:
- [tokenized-data](https://git.wmi.amu.edu.pl/s402227/petite-difference-challenge2/src/tokenized-data) - contains tokenized data only
- [roberta](https://git.wmi.amu.edu.pl/s402227/petite-difference-challenge2/src/roberta) - contains this submission with RoBERTa models

View File

@ -2,6 +2,8 @@
"He Said She Said" classification challenge (2nd edition) "He Said She Said" classification challenge (2nd edition)
========================================================= =========================================================
**See [description](README-INTERNAL.md) for more details about this submission.**
Guess whether a text in Polish was written by a man or woman. Guess whether a text in Polish was written by a man or woman.
This challenge is based on the "He Said She Said" corpus for Polish. This challenge is based on the "He Said She Said" corpus for Polish.