concordia-library/tests/resources/tokenizer/named_entities.txt
2015-06-25 10:12:51 +02:00

4 lines
117 B
Plaintext

[0-9]{1,2}[\.\-/][0-9]{1,2}[\.\-/][0-9]{4} ne_date
[\w\._\d]+@\w+(\.\w+)* ne_email
[0-9]+([\.\,][0-9]+)? ne_number