Go to file
2018-03-15 16:09:59 +01:00
extractor Prototype rule based masses extractor. 2018-03-01 14:40:13 +01:00
parishwebsites Add converter of content field in jsonline from html to text. 2018-03-15 16:09:59 +01:00
scraper Code refactorings. 2018-03-01 18:16:11 +01:00
.gitignore Initial commit 2017-03-10 16:05:59 +01:00
environment.yml Switch to pure html download. Enhanced urls filtering. 2018-03-11 18:02:31 +01:00
LICENSE Initial commit 2017-03-10 16:05:59 +01:00
Makefile Switch to pure html download. Enhanced urls filtering. 2018-03-11 18:02:31 +01:00
plan.org Add prototype basic crawl 2017-11-21 22:51:09 +01:00
prepare-environment.sh Switch to pure html download. Enhanced urls filtering. 2018-03-11 18:02:31 +01:00
README.md Initial commit 2017-03-10 16:05:59 +01:00
temat.md Update temat.md 2017-03-14 17:11:33 +01:00

mass-scraper