Go to file
2020-03-23 13:24:57 +01:00
dev-0 ISI-2 naive bayes, self-made tokenizer 2020-03-23 13:24:57 +01:00
test-A ISI-2 naive bayes, self-made tokenizer 2020-03-23 13:24:57 +01:00
train ISI-2 naive bayes, self-made tokenizer 2020-03-23 13:24:57 +01:00
.gitignore ISI-2 naive bayes, self-made tokenizer 2020-03-23 13:24:57 +01:00
config.txt ISI-1 rule-based, baseline 2020-03-09 12:18:18 +01:00
in-header.tsv Init 2020-02-23 17:39:42 +01:00
modelNB.txt ISI-2 naive bayes, self-made tokenizer 2020-03-23 13:24:57 +01:00
out-header.tsv Init 2020-02-23 17:39:42 +01:00
predictNB.py ISI-2 naive bayes, self-made tokenizer 2020-03-23 13:24:57 +01:00
README.md Init 2020-02-23 17:39:42 +01:00
scores.txt ISI-2 naive bayes, self-made tokenizer 2020-03-23 13:24:57 +01:00
solution.py ISI-2 naive bayes, self-made tokenizer 2020-03-23 13:24:57 +01:00
start.sh ISI-1 rule-based, baseline 2020-03-09 12:18:18 +01:00
startNB.sh ISI-2 naive bayes, self-made tokenizer 2020-03-23 13:24:57 +01:00
tokenize.py ISI-2 naive bayes, self-made tokenizer 2020-03-23 13:24:57 +01:00
trainNB.py ISI-2 naive bayes, self-made tokenizer 2020-03-23 13:24:57 +01:00

Skeptic vs paranormal subreddits

Classify a reddit as either from Skeptic subreddit or one of the "paranormal" subreddits (Paranormal, UFOs, TheTruthIsHere, Ghosts, ,Glitch-in-the-Matrix, conspiracytheories).

Output label is S and P.

Sources

Data taken from https://archive.org/details/2015_reddit_comments_corpus.