From fd32ff7e121aadf2e16e74aeef9263aaa3a89e84 Mon Sep 17 00:00:00 2001 From: rjawor Date: Mon, 7 Sep 2015 08:15:46 +0200 Subject: [PATCH] todo --- TODO.txt | 2 ++ 1 file changed, 2 insertions(+) diff --git a/TODO.txt b/TODO.txt index 1ee91a9..f7d2638 100644 --- a/TODO.txt +++ b/TODO.txt @@ -1,4 +1,6 @@ ---------------------------- Developer's private notes (language may vary, bo tak czasem wygodniej) ----------------------------- +- document analysis - concordia score should be calculated for a document. Idea - for each fragment: (fragment length/document length) * log penalty at sentence level. +- moving/extending concordia matches on demand - powered by concordia-server - rethink passing variables such as TokenizedSentence by smart pointers. Consider using references in getters. - multiple indexes based on different hashes. One can be word-net base forms, other - pos-tags and so on. Develop a method of combining results. IN PROGRESS - document the code (classes, cfg files) and update tutorial