Commit Graph

7 Commits

Author SHA1 Message Date
7da40e76ac Fix find_hours regex. Fix app.py (adapt to addtion of utterances)
change header of website in index.html
get_utterances.py run with paramter intead of in script filename
2018-05-16 20:33:32 +02:00
95491b20a7 Working annotator. Without abuse handling, but logging actions.
Modify find_hours
Modify get_utterances
Add missing parish2text-commands.sh
workin app.py
add hash.min.js (fingerpirntjs)
modify index.html, make it prettier, add functions and more
2018-05-15 07:13:09 +02:00
1f6b1e6ffe Working utterances getting/pickling
Working converting parishes from html2text.
Add makefile parish2text goal.
Change to non-html(text) parishes in extract_rule_based and get_utterances
Enhance find_hours.py
Wrap render_template in make_response in webapp/app.py
2018-05-14 01:51:40 +02:00
382666c563 Add test.py for data gathering (data for annotation)
Small changes to annotator.py (to be deleted in near future)
Add utils/iterator
Add redis to enviroment.yml
Rename, adapt and move rule based extractor.
Adapt find_hours.
Yapify webapp app (probalby nothing more)
Rename buttons in index.html
2018-05-11 23:12:21 +02:00
6982ac2e59 Add basic wsgi app. Rename extractors, change directories.
Add gunicorn and flask to environment.yml
Update .gitignore
2018-04-27 22:44:15 +02:00
9b76f4e8aa Add robust recrawling of not completed data.
Add annotator.py (highlighing hout within context done)
Enhance parish2text.py (enable more flags, convert button)
2018-04-16 23:54:03 +02:00
Dawid Jurkiewicz
8b72d0b351 Prototype rule based masses extractor.
Added spider.
Started working on testsets.
2018-03-01 14:40:13 +01:00