Go to file
2020-04-27 12:22:07 +02:00
dev-0 Using bigrams; clear text 2020-04-27 12:15:20 +02:00
test-A Using bigrams; clear text 2020-04-27 12:15:20 +02:00
train OMerge branch 'master' of git://gonito.net/paranormal-or-skeptic 2020-04-02 15:34:15 +02:00
.gitignore Updated with stopwords 2020-03-29 23:29:19 +02:00
config.txt Switch to probabilities 2020-04-20 14:53:12 +02:00
exp Bigram implemented 2020-03-29 13:39:47 +02:00
in Bigram implemented 2020-03-29 13:39:47 +02:00
in-header.tsv Updated basline 2020-03-22 10:15:36 +01:00
Makefile Using bigrams; clear text; more passes 2020-04-27 12:22:07 +02:00
model Added liniar regression 2020-04-06 10:41:14 +02:00
naive_bigram.pkl Updated with stopwords 2020-03-29 23:29:19 +02:00
out-header.tsv Updated basline 2020-03-22 10:15:36 +01:00
out.md5 fix predict.py 2020-03-22 12:56:42 +01:00
predict_baseline.py Updated with stopwords 2020-03-29 23:29:19 +02:00
predict_bigram.py Begin lin reg 2020-04-04 22:07:48 +02:00
predict.py Added liniar regression 2020-04-06 10:41:14 +02:00
prepare_data.py Using bigrams; clear text 2020-04-27 12:15:20 +02:00
README.md Switch to probabilities 2020-04-20 14:53:12 +02:00
train_baseline.py Bigram implemented 2020-03-29 13:39:47 +02:00
train_bigram.py Begin lin reg 2020-04-04 22:07:48 +02:00
train.py Added liniar regression 2020-04-06 10:41:14 +02:00
train.py_only_bi Updated with stopwords 2020-03-29 23:29:19 +02:00

Skeptic vs paranormal subreddits

Classify a reddit as either from Skeptic subreddit or one of the "paranormal" subreddits (Paranormal, UFOs, TheTruthIsHere, Ghosts, ,Glitch-in-the-Matrix, conspiracytheories).

Output label is the probability of a paranormal subreddit.

Sources

Data taken from https://archive.org/details/2015_reddit_comments_corpus.