mass-scraper/parishwebsites/remove_duplicate_commands.py
siulkilulki 9b76f4e8aa Add robust recrawling of not completed data.
Add annotator.py (highlighing hout within context done)
Enhance parish2text.py (enable more flags, convert button)
2018-04-16 23:54:03 +02:00

14 lines
217 B
Python
Executable File

#!/usr/bin/env python3
import sys
import re
d = {}
for line in sys.stdin:
line = line.rstrip('\n')
id_ = re.search('"(data/.*)" 2>', line).group(1)
d[id_] = line
for line in d.values():
print(line)