Commit Graph

122 Commits

Author SHA1 Message Date
59e26b683b Change Bootstrap option name 2020-01-27 22:52:15 +01:00
1cea36ac93 Helper functions for confidence bounds 2020-01-27 21:54:34 +01:00
ae2769b7b9 Implement bootstrap in GEval 2020-01-25 23:46:33 +01:00
deb14c6702 Add Bootstrap facilities 2020-01-25 22:05:11 +01:00
608b1f9d73 Merge branch 'master' into bootstrap 2020-01-18 18:09:19 +01:00
e170c37864 Add substitution operation 2020-01-11 17:02:49 +01:00
4ba61b6e6e Prepare helper functions for cross-tabs 2020-01-04 18:17:14 +01:00
01486d23aa Change the meaning of WER 2019-12-21 16:03:52 +01:00
9a3a28a813 Add --oracle-item-based option 2019-12-16 11:18:49 +01:00
2234efa107 Multiple metrics can be packed via "Cartesian" strings 2019-12-14 20:59:00 +01:00
cb4efe1d6b Introduce :S flag (sorting words within a line) 2019-11-25 21:31:17 +01:00
839ad5ce47 Merge branch 'master' into bootstrap 2019-11-23 13:17:19 +01:00
03aacdef98 Add SegmentAccuracy metric 2019-11-18 18:35:01 +01:00
Filip Graliński
41fe0d2283 Make room for storing the results of bootstrap resampling 2019-11-02 16:44:13 +01:00
b540cba7da Implement Probabilistic-MultiLabel-F1 2019-09-07 14:16:06 +02:00
029e3880f7 Soft2D-F... Metric is inclusive now. 2019-09-03 17:19:05 +02:00
e4a6ed347d Change the meaning of Soft2D-F1 metric.
Now it is averaged per line.
2019-08-22 17:07:32 +02:00
6b63740c4a Add Soft2D-F metric 2019-08-22 13:20:29 +02:00
dab2646798 Start working on --list-metrics options 2019-08-21 23:44:18 +02:00
4d069e8102 Handle preprocessing operations for metrics 2019-08-12 17:50:48 +02:00
9b79b8761d Check whether the maximum values is obtained during the validation 2019-08-10 15:55:51 +02:00
6fa502ccc2 Test challenge creation & validation 2019-08-10 13:00:29 +02:00
b4ad774623 Trying to get references 2019-05-23 16:16:05 +02:00
eb10a4c3b4 Add plotting graphs for selected metrics 2019-03-19 07:31:17 +01:00
ae27029f61 Implement Probabilistic-Soft-F1 2019-03-12 22:35:19 +01:00
8393bec3ae Implement auxiliary calibration function 2019-03-12 08:58:21 +01:00
1a9fe36a9e Handle JSONl (for MultiLabel-F) 2019-02-14 19:01:53 +01:00
709eeec4ef Merge branch 'master' into jsonl 2019-02-14 16:35:41 +01:00
Filip Gralinski
26e9735d31 Handle jsonl files 2019-02-14 10:54:25 +01:00
Filip Gralinski
872724722a Fix SMAPE on zero values 2019-02-12 08:36:52 +01:00
138b77688b Add test for SMAPE 2019-02-01 12:05:22 +01:00
f7bd1b2ccd Add missing file, when generating a challenge 2019-02-01 12:04:52 +01:00
abcce9bf68 Add numerical features 2019-02-01 10:58:29 +01:00
ea5de5c719 Introduce existential features 2019-01-26 17:18:41 +01:00
Filip Gralinski
dbf5c961af Start numerical factors 2019-01-23 13:00:37 +01:00
39bc3964b3 Speed up cartesian features 2019-01-10 22:53:43 +01:00
212457077f Consider word shapes in black-box debugging 2019-01-10 09:58:04 +01:00
5d19fc7585 Add character-by-character tokenization. 2018-12-17 07:54:12 +01:00
60a8c96aa8 Fix tests 2018-12-07 09:22:55 +01:00
Filip Gralinski
9322307813 Handle more than one possibility in TokenAccuracy 2018-10-24 08:02:34 +02:00
Filip Gralinski
2e816c4e38 Add TokenAccuracy metric 2018-10-23 17:01:33 +02:00
Filip Gralinski
30c37c2b40 Merge branch 'master' of git.applica.pl:piotr.halama/geval 2018-10-23 08:50:04 +02:00
Filip Gralinski
f814fc2c79 Merge branch 'warnings' of https://gitlab.com/fintara/geval 2018-10-23 08:46:49 +02:00
Piotr Halama
dc1618a0ec Use correct temporary directory 2018-10-22 13:32:36 +02:00
8735610745 Implement soft f-score 2018-10-17 22:41:46 +02:00
Filip Gralinski
5dc6e13191 Add Pearson and Spearman correlation measures 2018-09-27 21:52:02 +02:00
782c556f8c Add Macro-F1 metric 2018-09-27 18:21:56 +02:00
Filip Gralinski
eb395d9be0 Add WER metric 2018-09-25 08:13:57 +02:00
Filip Gralinski
9b77f08876 more tests for GLEU 2018-09-12 20:37:44 +02:00
Filip Gralinski
5cff29cf06 Add GLEU 2018-09-11 08:03:07 +02:00