|
0a8d2fdd39
|
tokenize by whitespace option
|
2015-12-27 20:54:40 +01:00 |
|
|
873d7c300c
|
added parameterless constructor for concordia
|
2015-10-19 15:38:10 +02:00 |
|
|
1adabf4833
|
add index path as required argument to concordia constructor
|
2015-10-16 22:14:11 +02:00 |
|
|
fa3138df29
|
count occurences feature
|
2015-10-01 13:36:54 +02:00 |
|
|
68fecaddf8
|
adding all tokenized examples
|
2015-08-19 20:49:26 +02:00 |
|
|
a765443a01
|
simple search returns matched pattern fragments
|
2015-08-07 12:54:57 +02:00 |
|
|
28704c2f43
|
separated tokenization and adding to index
|
2015-08-01 17:03:39 +02:00 |
|
|
5a57406875
|
finished original word positions
|
2015-06-27 12:40:24 +02:00 |
|
|
dba70b4e24
|
done word positions
|
2015-06-26 22:50:53 +02:00 |
|
|
07d5d4438b
|
clear index, examples
|
2015-05-04 20:40:44 +02:00 |
|
|
024fbf72aa
|
concordia search
Former-commit-id: 609c3a54e930ebae45a2e9a07f63991ec4abc9a6
|
2015-04-17 14:17:59 +02:00 |
|
|
4e02afc897
|
anubis search v1 - very slow for some patterns
Former-commit-id: ae327d7d24f4bc959d3749745a8c395093a17a50
|
2015-04-16 11:39:39 +02:00 |
|
|
0d4bdf12de
|
removed using namespace std
Former-commit-id: dbb5129e1f94d83eca887ada0f89d6bb45250f1e
|
2015-04-15 14:14:10 +02:00 |
|
|
3a03b01f42
|
std vectors
Former-commit-id: 5816e87c856f7edc242cc707851a0e2ad05aeb38
|
2015-04-15 10:55:26 +02:00 |
|
|
f03b4ad954
|
fixed lcp search
Former-commit-id: 18192126d134323569bc43205ccc60788d9e6cb6
|
2015-04-12 12:06:41 +02:00 |
|
|
13c97f572d
|
sentence anonymizer stub, regex replacement
Former-commit-id: edb1247f7b29fd62913114be84d3391507a0890e
|
2014-04-13 12:21:30 +02:00 |
|
|
a1312b7668
|
jrc test
Former-commit-id: 8ddc453cc0350f8ef96becf2fbff7898045e1f77
|
2014-02-20 12:26:36 +01:00 |
|
|
fb65cc9c66
|
suffix markers
Former-commit-id: 7426cce771f548dcd4eb7478aafa912fb73784bf
|
2014-02-20 10:49:17 +01:00 |
|
|
b318770752
|
redesigned project
Former-commit-id: d35841126fda627a04a1a16a26b91943401b6fcf
|
2013-12-14 15:23:17 +01:00 |
|
|
47405834a3
|
concordia-console, new approach to suffix array - 4 sauchars per one saidx
|
2013-12-06 22:29:25 +01:00 |
|
|
0d8a057278
|
suffix array simple search
|
2013-11-28 16:47:57 +01:00 |
|
|
3c208270b9
|
init
|
2013-10-24 17:08:58 +02:00 |
|