Commit Graph

27 Commits

Author SHA1 Message Date
5a7cbbe9e9 full search stub - tests needed 2019-01-09 15:30:56 +01:00
61631c52a3 lexicon search 2017-10-10 15:39:47 +02:00
970dda5dc2 option of white space tokenization while searching 2017-04-22 23:45:51 +02:00
31e4f091ad mutliple results 2017-04-21 14:51:58 +02:00
b3d7c993aa tokenize only option - no word map 2016-01-01 20:45:07 +01:00
0a8d2fdd39 tokenize by whitespace option 2015-12-27 20:54:40 +01:00
873d7c300c added parameterless constructor for concordia 2015-10-19 15:38:10 +02:00
1adabf4833 add index path as required argument to concordia constructor 2015-10-16 22:14:11 +02:00
fa3138df29 count occurences feature 2015-10-01 13:36:54 +02:00
68fecaddf8 adding all tokenized examples 2015-08-19 20:49:26 +02:00
a765443a01 simple search returns matched pattern fragments 2015-08-07 12:54:57 +02:00
28704c2f43 separated tokenization and adding to index 2015-08-01 17:03:39 +02:00
5a57406875 finished original word positions 2015-06-27 12:40:24 +02:00
dba70b4e24 done word positions 2015-06-26 22:50:53 +02:00
07d5d4438b clear index, examples 2015-05-04 20:40:44 +02:00
024fbf72aa concordia search
Former-commit-id: 609c3a54e930ebae45a2e9a07f63991ec4abc9a6
2015-04-17 14:17:59 +02:00
4e02afc897 anubis search v1 - very slow for some patterns
Former-commit-id: ae327d7d24f4bc959d3749745a8c395093a17a50
2015-04-16 11:39:39 +02:00
0d4bdf12de removed using namespace std
Former-commit-id: dbb5129e1f94d83eca887ada0f89d6bb45250f1e
2015-04-15 14:14:10 +02:00
3a03b01f42 std vectors
Former-commit-id: 5816e87c856f7edc242cc707851a0e2ad05aeb38
2015-04-15 10:55:26 +02:00
f03b4ad954 fixed lcp search
Former-commit-id: 18192126d134323569bc43205ccc60788d9e6cb6
2015-04-12 12:06:41 +02:00
13c97f572d sentence anonymizer stub, regex replacement
Former-commit-id: edb1247f7b29fd62913114be84d3391507a0890e
2014-04-13 12:21:30 +02:00
a1312b7668 jrc test
Former-commit-id: 8ddc453cc0350f8ef96becf2fbff7898045e1f77
2014-02-20 12:26:36 +01:00
fb65cc9c66 suffix markers
Former-commit-id: 7426cce771f548dcd4eb7478aafa912fb73784bf
2014-02-20 10:49:17 +01:00
b318770752 redesigned project
Former-commit-id: d35841126fda627a04a1a16a26b91943401b6fcf
2013-12-14 15:23:17 +01:00
47405834a3 concordia-console, new approach to suffix array - 4 sauchars per one saidx 2013-12-06 22:29:25 +01:00
0d8a057278 suffix array simple search 2013-11-28 16:47:57 +01:00
3c208270b9 init 2013-10-24 17:08:58 +02:00