Commit Graph

23 Commits

Author SHA1 Message Date
0a8d2fdd39 tokenize by whitespace option 2015-12-27 20:54:40 +01:00
1adabf4833 add index path as required argument to concordia constructor 2015-10-16 22:14:11 +02:00
68fecaddf8 adding all tokenized examples 2015-08-19 20:49:26 +02:00
5a57406875 finished original word positions 2015-06-27 12:40:24 +02:00
dba70b4e24 done word positions 2015-06-26 22:50:53 +02:00
724bf0d080 new responsibilities of tokenized sentence 2015-06-26 15:38:24 +02:00
8432dd321f tokenizer in progress 2015-06-25 10:12:51 +02:00
0baf3e4ef2 character intervals in progress 2015-06-22 13:52:56 +02:00
07d5d4438b clear index, examples 2015-05-04 20:40:44 +02:00
0d4bdf12de removed using namespace std
Former-commit-id: dbb5129e1f94d83eca887ada0f89d6bb45250f1e
2015-04-15 14:14:10 +02:00
3a03b01f42 std vectors
Former-commit-id: 5816e87c856f7edc242cc707851a0e2ad05aeb38
2015-04-15 10:55:26 +02:00
2533fd5b44 extended markers - length, bitwise operators
Former-commit-id: 948a7fc68bf0b2284ce631d877fc13fa3eaa4882
2015-04-09 22:17:19 +02:00
f83aaef4ed trimming anonymized sentence
Former-commit-id: 316b76717e4075e466828c628e064076d39481c5
2014-08-15 13:22:04 +02:00
e99eb77b28 anonymizing sentences
Former-commit-id: 5d8bd7e16258fda7c02a7cc0e1da589d73418f0d
2014-04-29 14:46:04 +02:00
13c97f572d sentence anonymizer stub, regex replacement
Former-commit-id: edb1247f7b29fd62913114be84d3391507a0890e
2014-04-13 12:21:30 +02:00
4b921decae limits control
Former-commit-id: 83d90cb63b3f1447938d16010e66f4345dfe0617
2014-03-14 11:30:17 +01:00
b318770752 redesigned project
Former-commit-id: d35841126fda627a04a1a16a26b91943401b6fcf
2013-12-14 15:23:17 +01:00
47405834a3 concordia-console, new approach to suffix array - 4 sauchars per one saidx 2013-12-06 22:29:25 +01:00
d3cccff654 concordia index 2013-11-20 17:43:29 +01:00
656e9dbae9 concordia index stub 2013-11-14 20:36:34 +01:00
b238995a16 working hash generator 2013-11-14 15:44:50 +01:00
3aa4091e4d word map 2013-11-12 22:08:37 +01:00
7f062c49e4 hash generator stub 2013-11-12 16:58:31 +01:00