Go to file
Jakub Pokrywka bc4ebeaeab tfidflr 2021-11-06 14:29:51 +01:00
dev-0 tfidflr 2021-11-06 14:29:51 +01:00
mean_from_train mean from train 2021-11-05 15:38:15 +01:00
test-A tfidflr 2021-11-06 14:29:51 +01:00
tfidflr tfidflr 2021-11-06 14:29:51 +01:00
train init 2021-11-01 19:48:16 +01:00
.gitignore init 2021-11-01 19:48:16 +01:00
README.md init 2021-11-01 19:48:16 +01:00
config.txt init 2021-11-01 19:48:16 +01:00

README.md

twitter 140 year prediction

Dataset from paper "Twitter Sentiment Classification using Distant Supervision"

Dev test contains 100k samples from train set. Test set has deleted neutral samples and added 100k samples from train set.

Directory structure

  • README.md — this file
  • config.txt — configuration file
  • train/ — directory with training data
  • train/in.tsv — input data for the train set
  • train/expected.tsv — expected (reference) data for the train set
  • dev-0/ — directory with dev (test) data
  • dev-0/in.tsv — input data for the dev set
  • dev-0/expected.tsv — expected (reference) data for the dev set
  • test-A — directory with test data
  • test-A/in.tsv — input data for the test set
  • test-A/expected.tsv — expected (reference) data for the test set