879 B
879 B
twitter 140 temporal word gap filling
Dataset from paper "Twitter Sentiment Classification using Distant Supervision"
Dev test contains 100k samples from train set. Test set has deleted neutral samples and added 100k samples from train set.
Directory structure
README.md
— this fileconfig.txt
— configuration filetrain/
— directory with training datatrain/in.tsv
— input data for the train settrain/expected.tsv
— expected (reference) data for the train setdev-0/
— directory with dev (test) datadev-0/in.tsv
— input data for the dev setdev-0/expected.tsv
— expected (reference) data for the dev settest-A
— directory with test datatest-A/in.tsv
— input data for the test settest-A/expected.tsv
— expected (reference) data for the test set