Commit Graph

8 Commits

Author SHA1 Message Date
5e809efcce corrected tokenizer 2017-05-05 12:58:32 +02:00
bd73749388 new tokenizer 2017-04-26 17:02:18 +02:00
bbf3853d2a added lowercasing when tokenizing by space 2015-12-29 21:44:46 +01:00
0a8d2fdd39 tokenize by whitespace option 2015-12-27 20:54:40 +01:00
68fecaddf8 adding all tokenized examples 2015-08-19 20:49:26 +02:00
5a57406875 finished original word positions 2015-06-27 12:40:24 +02:00
9b1735516c working sentence tokenizer 2015-06-25 20:49:22 +02:00
8432dd321f tokenizer in progress 2015-06-25 10:12:51 +02:00