Go to file
siulkilulki 7da40e76ac Fix find_hours regex. Fix app.py (adapt to addtion of utterances)
change header of website in index.html
get_utterances.py run with paramter intead of in script filename
2018-05-16 20:33:32 +02:00
extractor Fix find_hours regex. Fix app.py (adapt to addtion of utterances) 2018-05-16 20:33:32 +02:00
parishwebsites Working annotator. Without abuse handling, but logging actions. 2018-05-15 07:13:09 +02:00
scraper Modifiy error logging in get_parishes_url. Enhance crawl_deon.py 2018-04-06 23:33:18 +02:00
utils Working utterances getting/pickling 2018-05-14 01:51:40 +02:00
webapp Fix find_hours regex. Fix app.py (adapt to addtion of utterances) 2018-05-16 20:33:32 +02:00
.gitignore Add basic wsgi app. Rename extractors, change directories. 2018-04-27 22:44:15 +02:00
annotator.py Add test.py for data gathering (data for annotation) 2018-05-11 23:12:21 +02:00
environment.yml Add test.py for data gathering (data for annotation) 2018-05-11 23:12:21 +02:00
extract_rule_based.py Working utterances getting/pickling 2018-05-14 01:51:40 +02:00
get_utterances.py Fix find_hours regex. Fix app.py (adapt to addtion of utterances) 2018-05-16 20:33:32 +02:00
LICENSE Initial commit 2017-03-10 16:05:59 +01:00
Makefile Working utterances getting/pickling 2018-05-14 01:51:40 +02:00
prepare-environment.sh Switch to pure html download. Enhanced urls filtering. 2018-03-11 18:02:31 +01:00
README.md Update README.md 2018-04-06 23:43:14 +02:00
temat.md Update temat.md 2017-03-14 17:11:33 +01:00
wsgi.py Restructure code. Add frontend template. (logic to be done) 2018-05-04 23:25:07 +02:00

mass-scraper

Polish masses project. beeminder update