|
bd73749388
|
new tokenizer
|
2017-04-26 17:02:18 +02:00 |
|
|
1adabf4833
|
add index path as required argument to concordia constructor
|
2015-10-16 22:14:11 +02:00 |
|
|
a8c5fa0c75
|
original word positions
|
2015-06-27 10:09:49 +02:00 |
|
|
8432dd321f
|
tokenizer in progress
|
2015-06-25 10:12:51 +02:00 |
|
|
87a26bfa3b
|
cleaned configuration, doc
|
2015-04-30 21:15:18 +02:00 |
|
|
f64449311d
|
removed stop words - works slower
Former-commit-id: 97ce33b0a6ea3c89aaa5a4c69cad248c7b2c8203
|
2015-04-21 21:33:08 +02:00 |
|
|
4e02afc897
|
anubis search v1 - very slow for some patterns
Former-commit-id: ae327d7d24f4bc959d3749745a8c395093a17a50
|
2015-04-16 11:39:39 +02:00 |
|
|
e99eb77b28
|
anonymizing sentences
Former-commit-id: 5d8bd7e16258fda7c02a7cc0e1da589d73418f0d
|
2014-04-29 14:46:04 +02:00 |
|
|
9358863f8d
|
text utils stub
Former-commit-id: d4459220f5696839d98848e9c30a61c084763a91
|
2014-04-24 08:36:48 +02:00 |
|
|
13c97f572d
|
sentence anonymizer stub, regex replacement
Former-commit-id: edb1247f7b29fd62913114be84d3391507a0890e
|
2014-04-13 12:21:30 +02:00 |
|
|
fb65cc9c66
|
suffix markers
Former-commit-id: 7426cce771f548dcd4eb7478aafa912fb73784bf
|
2014-02-20 10:49:17 +01:00 |
|
|
7c1ed7fb6e
|
suachar_t changed to int
|
2013-12-01 23:34:46 +01:00 |
|
|
4a572784fa
|
clean
|
2013-11-28 16:55:52 +01:00 |
|
|
47c70d2509
|
empty temp
|
2013-11-28 16:52:43 +01:00 |
|
|
0d8a057278
|
suffix array simple search
|
2013-11-28 16:47:57 +01:00 |
|
|
d3cccff654
|
concordia index
|
2013-11-20 17:43:29 +01:00 |
|
|
656e9dbae9
|
concordia index stub
|
2013-11-14 20:36:34 +01:00 |
|
|
12ac566533
|
init
|
2013-10-24 17:06:17 +02:00 |
|
|
b484958412
|
init
|
2013-10-24 17:06:00 +02:00 |
|