Go to file
siulkilulki c99c218436 Fix mcc score.
Add comments to makefile.
Fix get_utterances condition.
Adjust craweler settings.
Change split-data script
2018-06-18 14:56:41 +02:00
extractor Add redis stats, helper script. 2018-05-24 12:56:02 +02:00
parishwebsites Fix mcc score. 2018-06-18 14:56:41 +02:00
scraper Modifiy error logging in get_parishes_url. Enhance crawl_deon.py 2018-04-06 23:33:18 +02:00
utils Working utterances getting/pickling 2018-05-14 01:51:40 +02:00
webapp Add redis data structures description. Handle banned users. 2018-05-26 19:07:08 +02:00
.gitignore Add basic wsgi app. Rename extractors, change directories. 2018-04-27 22:44:15 +02:00
LICENSE Initial commit 2017-03-10 16:05:59 +01:00
Makefile Fix mcc score. 2018-06-18 14:56:41 +02:00
README.md Update README.md 2018-04-06 23:43:14 +02:00
annotator.py Add test.py for data gathering (data for annotation) 2018-05-11 23:12:21 +02:00
annotator_console.py First version of ml hour classificator. 2018-05-28 15:10:31 +02:00
environment.yml Add test.py for data gathering (data for annotation) 2018-05-11 23:12:21 +02:00
evaluate.py Fix mcc score. 2018-06-18 14:56:41 +02:00
extract_rule_based.py Working utterances getting/pickling 2018-05-14 01:51:40 +02:00
get_utterances.py Fix mcc score. 2018-06-18 14:56:41 +02:00
prepare-environment.sh Switch to pure html download. Enhanced urls filtering. 2018-03-11 18:02:31 +01:00
split-data.sh Fix mcc score. 2018-06-18 14:56:41 +02:00
temat.md Update temat.md 2017-03-14 17:11:33 +01:00
todos.org First version of ml hour classificator. 2018-05-28 15:10:31 +02:00
tsv2fasttext.py First version of ml hour classificator. 2018-05-28 15:10:31 +02:00
wsgi.py Restructure code. Add frontend template. (logic to be done) 2018-05-04 23:25:07 +02:00

README.md

mass-scraper

Polish masses project. beeminder update