Commit Graph

304 Commits

Author SHA1 Message Date
welp
32290d3715 pure percantage format feature commit 2020-07-13 16:20:36 +02:00
e9ca0cc351 Fix Mean/ with fuzzy matching 2020-07-02 18:25:51 +02:00
6c295a3325 Add hardening 2020-07-02 18:22:29 +02:00
236712c52b Add smart mode 2020-07-02 18:14:56 +02:00
00a2fc7d19 First version of fuzzy matching working 2020-07-02 08:16:05 +02:00
bbeb3ce397 continue work on fuzzy matching 2020-07-01 21:08:33 +02:00
4e3ff20e2c A dead-end when working on fuzzy matching 2020-07-01 18:24:45 +02:00
a00120b1c2 Filtering works on multiple values 2020-05-20 10:50:01 +02:00
0ed06729ba Training a simple ensemble 2020-05-20 09:20:49 +02:00
634dd21b12 Finish filtering 2020-05-13 15:34:16 +02:00
78d2cd6501 Add filtering operation (still not handled) 2020-05-13 13:36:51 +02:00
252493a776 Infrastructure ready 2020-05-13 13:01:35 +02:00
da5304014e Refactor sources 2020-05-13 11:34:51 +02:00
015053ec87 Wrap filter type more 2020-05-13 08:19:52 +02:00
6f6c96969d Introduce Filter type 2020-05-13 08:13:31 +02:00
c1b65531b9 Introduce line source specification 2020-05-13 08:03:16 +02:00
991ce3f09b Refactor code 2020-05-12 21:08:10 +02:00
2f348dd616 Use DataSource in LineByLine 2020-05-12 17:30:33 +02:00
454f60812c Introduce DataSource 2020-05-12 17:13:52 +02:00
8f3550493c Get rid of hpqtypes (mintercalate) 2020-04-14 21:03:03 +02:00
8d429b01cb Show column names in extracted features 2020-02-22 12:13:45 +01:00
6d586c7238 Handle headers 2020-02-22 11:18:34 +01:00
e627106dd6 Fix bug in cross-tabs
It was due to a huge bug in ordered-containers.
2020-02-11 09:37:19 +01:00
593871f19b Fix tests 2020-02-07 11:34:42 +01:00
169931be8f Define FLC-F-score 2020-02-07 11:29:37 +01:00
a66fa35a42 Mean/Multilabel-F works with Bootstrap 2020-02-01 11:54:54 +01:00
5d895ea1c6 Fix order in line-by-line mode 2020-01-31 22:38:31 +01:00
2b4a416a4f Bump up version 2020-01-31 09:50:26 +01:00
8761965e8e Add option to mark worst features 2020-01-31 09:44:21 +01:00
15946b89db Add helper function for Gonito 2020-01-28 23:06:38 +01:00
f21d752634 Fix validating challenges with bootstrap 2020-01-28 22:36:37 +01:00
550358f0cd Automatically set precision when bootstrap resampling is used 2020-01-28 20:32:09 +01:00
59e26b683b Change Bootstrap option name 2020-01-27 22:52:15 +01:00
36a3d8c949 Gracefully degrade when bootstrap is not (yet) handled 2020-01-27 22:23:43 +01:00
659b122625 Make Bootstrap work on MultiLabel-F1 2020-01-27 22:14:13 +01:00
1cea36ac93 Helper functions for confidence bounds 2020-01-27 21:54:34 +01:00
ae2769b7b9 Implement bootstrap in GEval 2020-01-25 23:46:33 +01:00
deb14c6702 Add Bootstrap facilities 2020-01-25 22:05:11 +01:00
bfcd5aa631 Most evaluation metrics are handled with dependency types 2020-01-25 19:26:57 +01:00
608b1f9d73 Merge branch 'master' into bootstrap 2020-01-18 18:09:19 +01:00
e170c37864 Add substitution operation 2020-01-11 17:02:49 +01:00
5171cf0ac6 Results are presented as cross tables (if possible) 2020-01-04 20:48:36 +01:00
4ba61b6e6e Prepare helper functions for cross-tabs 2020-01-04 18:17:14 +01:00
01486d23aa Change the meaning of WER 2019-12-21 16:03:52 +01:00
ad30bb9384 Fix bug with preprocessing ops not handled in --line-by-line mode 2019-12-16 12:47:35 +01:00
9a3a28a813 Add --oracle-item-based option 2019-12-16 11:18:49 +01:00
d95e2878a6 Refactor line-by-line mode 2019-12-14 21:10:40 +01:00
2234efa107 Multiple metrics can be packed via "Cartesian" strings 2019-12-14 20:59:00 +01:00
5f532c71c7 Add setting priorities, names can be set multiple times
If more than one is given for a metric, they are concatenated
(with spaces).
2019-12-14 19:58:02 +01:00
0826d457b2 Complete move to the new style of train files 2019-12-13 20:31:40 +01:00