This commit is contained in:
Jakub Pokrywka 2021-01-04 10:18:57 +01:00
parent d56d9b66ba
commit c7bf3feb47
3 changed files with 44909 additions and 0 deletions

23
TaskG04/description.txt Normal file
View File

@ -0,0 +1,23 @@
Use regular expressions to extract lines containing polish surnames. CASE INSENSITIVE
Download list of polish male and female surnames from here:
* https://dane.gov.pl/pl/dataset/568,nazwiska-wystepujace-w-rejestrze-pesel/resource/22810/table?page=1&per_page=20&q=&sort=
* https://dane.gov.pl/pl/dataset/568,nazwiska-wystepujace-w-rejestrze-pesel/resource/22812/table?page=1&per_page=20&q=&sort=
Extract lines from stdin containing any of the surnames.
Look only for surnames no matter casing (case insensitive).
The surname does not have to be surrounded by space or any other special characters.
Don't search for declined forms of surnames.
Check either NFA (e.g. re python library) and DFA (google re2) and compare them.
Submit solution based on a better method.
NOTE: You could extract the polish surnames list, save it to a file, then commit the file to your repository.
NOTE: You may set max_mem to a higher value than the default in re2 library.
POINTS: 2
DEADLINE: 2021-01-20 15:30:00

File diff suppressed because one or more lines are too long

View File

@ -0,0 +1 @@
../TaskA02/polish_wiki_excerpt.in