djfz-2023-s464986/TaskG03/description.txt

24 lines
1.0 KiB
Plaintext
Raw Normal View History

2024-01-19 17:14:42 +01:00
Use regular expressions to extract lines containing polish surnames. CASE INSENSITIVE
2024-01-18 17:56:14 +01:00
Download list of polish male and female surnames from here:
* https://dane.gov.pl/pl/dataset/1681,nazwiska-osob-zyjacych-wystepujace-w-rejestrze-pesel/resource/35279/table?page=1&per_page=20&q=&sort=
* https://dane.gov.pl/pl/dataset/1681,nazwiska-osob-zyjacych-wystepujace-w-rejestrze-pesel/resource/22817/table?page=1&per_page=20&q=&sort=
2024-01-19 17:14:42 +01:00
Extract lines from stdin containing any of the surnames.
Look only for surnames no matter casing (case insensitive).
2024-01-18 17:56:14 +01:00
The surname does not have to be surrounded by space or any other special characters.
Don't search for declined forms of surnames.
2024-01-19 17:14:42 +01:00
Check either NFA (e.g. re python library) and DFA (google re2) and compare them.
2024-01-18 17:56:14 +01:00
2024-01-19 17:14:42 +01:00
Submit solution based on a better method.
2024-01-18 17:56:14 +01:00
NOTE: You could extract the polish surnames list, save it to a file, then commit the file to your repository.
NOTE: You may set max_mem to a higher value than the default in re2 library.
2024-01-19 17:14:42 +01:00
POINTS: 2
2024-01-18 17:56:14 +01:00
DEADLINE: 2024-01-27 23:59:59