Commit Graph

99 Commits

Author SHA1 Message Date
873d7c300c added parameterless constructor for concordia 2015-10-19 15:38:10 +02:00
1adabf4833 add index path as required argument to concordia constructor 2015-10-16 22:14:11 +02:00
f585ff9e01 corpus figures creator 2015-10-06 13:34:03 +02:00
96c74c47ac corpus analyzer 2015-10-04 16:24:58 +02:00
2601dc83bf test corpus for corpus analyzer 2015-10-03 16:19:10 +02:00
4e17e28f7f working corpus analyzer 2015-10-03 16:18:49 +02:00
fa3138df29 count occurences feature 2015-10-01 13:36:54 +02:00
fd32ff7e12 todo 2015-09-07 08:15:46 +02:00
cdeb57ccfa todo 2015-08-26 20:14:43 +02:00
bd62420cd5 updated tutorial 2015-08-24 14:30:20 +02:00
0a3fd8a04e added an extremely important improvement to the concordia search algorithm - gapped overlays cut-off 2015-08-24 13:10:06 +02:00
209e374226 repaired concordia test 2015-08-19 20:53:40 +02:00
68fecaddf8 adding all tokenized examples 2015-08-19 20:49:26 +02:00
a765443a01 simple search returns matched pattern fragments 2015-08-07 12:54:57 +02:00
28704c2f43 separated tokenization and adding to index 2015-08-01 17:03:39 +02:00
5a57406875 finished original word positions 2015-06-27 12:40:24 +02:00
a8c5fa0c75 original word positions 2015-06-27 10:09:49 +02:00
dba70b4e24 done word positions 2015-06-26 22:50:53 +02:00
724bf0d080 new responsibilities of tokenized sentence 2015-06-26 15:38:24 +02:00
9b1735516c working sentence tokenizer 2015-06-25 20:49:22 +02:00
8432dd321f tokenizer in progress 2015-06-25 10:12:51 +02:00
0baf3e4ef2 character intervals in progress 2015-06-22 13:52:56 +02:00
4c0f2fd08d modified todo 2015-06-12 12:25:02 +02:00
dff52abff7 modified todo 2015-06-11 11:17:45 +02:00
680eb54ae5 modified todo, removed concordia-server 2015-06-09 13:09:10 +02:00
07d5d4438b clear index, examples 2015-05-04 20:40:44 +02:00
abbd5b1ae8 finished documentation 2015-05-01 14:52:53 +02:00
9e550ca1cf more doc 2015-04-30 22:22:54 +02:00
87a26bfa3b cleaned configuration, doc 2015-04-30 21:15:18 +02:00
b790c6898f stable release 2015-04-30 09:29:10 +02:00
db63cf776e doc 2015-04-28 21:34:07 +02:00
952b94971f only configure prod resources if available 2015-04-27 16:20:54 +02:00
bb7608d05e anubis searcher -> concordia searcher
Former-commit-id: 8afe194adf3163ee62caa30732d9c9dd095df66b
2015-04-24 11:48:32 +02:00
23aa113747 order in scripts
Former-commit-id: 945423dec6f4007f780b27fd590fb09578117b54
2015-04-24 11:10:17 +02:00
04df67c6f0 100% test in concordia-console. All passed!
Former-commit-id: 6e6186a148d637ba5a0d324d6d68c78708f0942d
2015-04-22 16:50:12 +02:00
d9112e209a updated TODO, concordia is not slower after all
Former-commit-id: 3621c98c7e30f4a446dcc4b64671e336f1b27f44
2015-04-21 21:54:28 +02:00
f64449311d removed stop words - works slower
Former-commit-id: 97ce33b0a6ea3c89aaa5a4c69cad248c7b2c8203
2015-04-21 21:33:08 +02:00
5c2ae86097 output concordia score
Former-commit-id: fa7db09fe9319fa844d294ca4e7deb22d1328151
2015-04-21 20:44:49 +02:00
7549703414 best overlay computation
Former-commit-id: 986f3d6b611fd276a7b26073daa0094caf078d1e
2015-04-21 15:14:48 +02:00
9b97ff2fa9 optimal match notes
Former-commit-id: de82da8922bae9fe913fd85e80397c95a69198ca
2015-04-19 12:46:34 +02:00
e3d477dc3a notes
Former-commit-id: a0b292acca6154f4c27e29ce21b8702d178ef583
2015-04-17 14:19:45 +02:00
024fbf72aa concordia search
Former-commit-id: 609c3a54e930ebae45a2e9a07f63991ec4abc9a6
2015-04-17 14:17:59 +02:00
0927e2ed1f added profiling, which is very important and private notes, which are even importanter :)
Former-commit-id: 1f1746c2de27b52aab4615e64d6b11b0c1e17624
2015-04-16 17:18:17 +02:00
4e02afc897 anubis search v1 - very slow for some patterns
Former-commit-id: ae327d7d24f4bc959d3749745a8c395093a17a50
2015-04-16 11:39:39 +02:00
fc41bb251a todo
Former-commit-id: a73e0c0d0887afabdd4ff25b6cc3b11b5a85cb14
2015-04-15 14:14:38 +02:00
0d4bdf12de removed using namespace std
Former-commit-id: dbb5129e1f94d83eca887ada0f89d6bb45250f1e
2015-04-15 14:14:10 +02:00
a09999c130 repaired tm matches
Former-commit-id: ee2e73ab1e37db051b8be36b97bc503241c798c0
2015-04-15 11:50:59 +02:00
3a03b01f42 std vectors
Former-commit-id: 5816e87c856f7edc242cc707851a0e2ad05aeb38
2015-04-15 10:55:26 +02:00
e02bbaa0fa getTmMatches
Former-commit-id: 94aa3db2db88195c61c6ac70006c0e1d743dc854
2015-04-14 20:14:30 +02:00
f03b4ad954 fixed lcp search
Former-commit-id: 18192126d134323569bc43205ccc60788d9e6cb6
2015-04-12 12:06:41 +02:00