concordia-library/tests/resources/tokenizer/named_entities.txt

6 lines
207 B
Plaintext
Raw Normal View History

[0-9]{1,2}[\.\-/][0-9]{1,2}[\.\-/][0-9]{4} ne_date
2017-04-27 10:37:29 +02:00
[0-9]{4}[\.\-/][0-9]{1,2}[\.\-/][0-9]{1,2} ne_date
[\w\._\d]+@\w+(\.\w+)* ne_email
2017-04-26 17:02:18 +02:00
[0-9]+[\.\)]([0-9]+\.)+ ne_bullet
\b[0-9]+([\.\,][0-9]+)?\b ne_number