diff --git a/TODO.txt b/TODO.txt index 1ee91a9..f7d2638 100644 --- a/TODO.txt +++ b/TODO.txt @@ -1,4 +1,6 @@ ---------------------------- Developer's private notes (language may vary, bo tak czasem wygodniej) ----------------------------- +- document analysis - concordia score should be calculated for a document. Idea - for each fragment: (fragment length/document length) * log penalty at sentence level. +- moving/extending concordia matches on demand - powered by concordia-server - rethink passing variables such as TokenizedSentence by smart pointers. Consider using references in getters. - multiple indexes based on different hashes. One can be word-net base forms, other - pos-tags and so on. Develop a method of combining results. IN PROGRESS - document the code (classes, cfg files) and update tutorial