Commit Graph

23 Commits

Author SHA1 Message Date
4faae4e91a slight change 2017-04-27 13:52:03 +02:00
dceb0d9f47 date recognition 2017-04-27 10:37:29 +02:00
bd73749388 new tokenizer 2017-04-26 17:02:18 +02:00
1adabf4833 add index path as required argument to concordia constructor 2015-10-16 22:14:11 +02:00
a8c5fa0c75 original word positions 2015-06-27 10:09:49 +02:00
8432dd321f tokenizer in progress 2015-06-25 10:12:51 +02:00
87a26bfa3b cleaned configuration, doc 2015-04-30 21:15:18 +02:00
f64449311d removed stop words - works slower
Former-commit-id: 97ce33b0a6ea3c89aaa5a4c69cad248c7b2c8203
2015-04-21 21:33:08 +02:00
4e02afc897 anubis search v1 - very slow for some patterns
Former-commit-id: ae327d7d24f4bc959d3749745a8c395093a17a50
2015-04-16 11:39:39 +02:00
0d4bdf12de removed using namespace std
Former-commit-id: dbb5129e1f94d83eca887ada0f89d6bb45250f1e
2015-04-15 14:14:10 +02:00
e99eb77b28 anonymizing sentences
Former-commit-id: 5d8bd7e16258fda7c02a7cc0e1da589d73418f0d
2014-04-29 14:46:04 +02:00
Rafał Jaworski
6ddba32f48 utf8
Former-commit-id: fa7407621e839f87613476596c6589aeceb9d796
2014-04-24 11:51:04 +02:00
9358863f8d text utils stub
Former-commit-id: d4459220f5696839d98848e9c30a61c084763a91
2014-04-24 08:36:48 +02:00
13c97f572d sentence anonymizer stub, regex replacement
Former-commit-id: edb1247f7b29fd62913114be84d3391507a0890e
2014-04-13 12:21:30 +02:00
fb65cc9c66 suffix markers
Former-commit-id: 7426cce771f548dcd4eb7478aafa912fb73784bf
2014-02-20 10:49:17 +01:00
7c1ed7fb6e suachar_t changed to int 2013-12-01 23:34:46 +01:00
4a572784fa clean 2013-11-28 16:55:52 +01:00
47c70d2509 empty temp 2013-11-28 16:52:43 +01:00
0d8a057278 suffix array simple search 2013-11-28 16:47:57 +01:00
d3cccff654 concordia index 2013-11-20 17:43:29 +01:00
656e9dbae9 concordia index stub 2013-11-14 20:36:34 +01:00
12ac566533 init 2013-10-24 17:06:17 +02:00
b484958412 init 2013-10-24 17:06:00 +02:00