2021-05-09 16:42:53 +02:00
|
|
|
Skeptic vs paranormal subreddits
|
|
|
|
================================
|
|
|
|
|
|
|
|
Classify a reddit as either from Skeptic subreddit or one of the
|
|
|
|
"paranormal" subreddits (Paranormal, UFOs, TheTruthIsHere, Ghosts,
|
|
|
|
,Glitch-in-the-Matrix, conspiracytheories).
|
|
|
|
|
|
|
|
Output label is the probability of a paranormal subreddit.
|
|
|
|
|
2021-05-25 11:17:30 +02:00
|
|
|
# Pytorch logistic regression
|
|
|
|
|
|
|
|
The code can be found in Logistic.py
|
2021-05-25 11:20:37 +02:00
|
|
|
|
2021-05-25 11:17:30 +02:00
|
|
|
Trained models end with .pth extension.
|
2021-05-25 11:20:37 +02:00
|
|
|
|
2021-05-25 11:17:30 +02:00
|
|
|
Geval results:
|
|
|
|
|
|
|
|
```
|
|
|
|
$ ./geval -t dev-0
|
|
|
|
Likelihood 0.0000
|
|
|
|
Accuracy 0.7043
|
|
|
|
F1.0 0.4950
|
|
|
|
Precision 0.6257
|
|
|
|
Recall 0.4094
|
|
|
|
```
|
|
|
|
|
2021-05-25 11:20:37 +02:00
|
|
|
Logs from training have been copy-pasted into `l1_epochs.txt` (for single-layer model) and `l2_epochs.txt (for two-layer model).
|
|
|
|
|
2021-05-25 11:17:30 +02:00
|
|
|
|
2021-05-09 16:42:53 +02:00
|
|
|
Sources
|
|
|
|
-------
|
|
|
|
|
|
|
|
Data taken from <https://archive.org/details/2015_reddit_comments_corpus>.
|