magisterka/abstract.tex

10 lines
547 B
TeX
Raw Normal View History

2018-02-01 23:30:49 +01:00
\chapter*{Abstract}
2018-06-22 07:28:04 +02:00
The thesis presents the process of creating a system for extracting information
about opening hours
of holy masses. The methods of collecting data of Polish parishes are being
described, especially the process of creating spiders. Then two methods are shown
for the extraction of opening hours of masses: a rule-based method and
machine learning-based method. More attention is devoted to machine
learning-based method that uses a text classifier.
2018-02-01 23:30:49 +01:00
2018-06-22 07:28:04 +02:00
\textbf{Key words:} information extraction, web spidering, text classification