Commit Graph

351 Commits

Author SHA1 Message Date
66cfa2397e fix README 2018-09-01 17:16:25 +02:00
c6d48c57f6 improve documentation on geval --submit 2018-09-01 16:39:00 +02:00
081b2507f3 update README 2018-09-01 14:43:35 +02:00
e2c3102cc4 check whether the remote tracking branch exists 2018-09-01 14:39:34 +02:00
eaa791cf2f improvement for "submit" special command 2018-08-28 18:58:51 +02:00
Piotr Halama
bd7c789bae Implement --submit command 2018-08-27 17:57:07 +02:00
Filip Gralinski
d1e9839bee bump up version number 2018-08-17 18:16:11 +02:00
Filip Gralinski
421d2e9797 add minimalistic tokenizer 2018-08-17 18:13:27 +02:00
Filip Gralinski
c79c4b356e fix some warnings 2018-08-17 17:52:41 +02:00
Filip Gralinski
8b7a18b4c7 v14 tokenizer added 2018-08-17 17:45:01 +02:00
Filip Gralinski
5e5a58210e use tokenization when looking for worst features 2018-08-17 17:27:25 +02:00
Filip Gralinski
0871b57bbc add --just-tokenize option 2018-08-17 16:57:47 +02:00
3a68324a6e bump up version number 2018-08-13 10:10:18 +02:00
83550688ce first tokenizer 2018-08-13 10:09:55 +02:00
d3da3a0ca5 WIP 2018-08-13 07:39:06 +02:00
8388ab4d27 towards tokenization 2018-08-11 22:59:43 +02:00
de52a12b03 export some functions from OptionsParser 2018-08-10 16:09:41 +02:00
5098225bc1 improvements in challenge creation 2018-08-10 13:05:42 +02:00
e10f92cf9c create challenge with MultiLabelLikelihood/LogLoss 2018-08-09 16:35:31 +02:00
efcceae26a implement MultiLabel-LogLoss and MultiLabel-Likelihood 2018-08-09 16:00:19 +02:00
bd2bfde287 MultiLabel-F1 works on labels given with probs now 2018-08-09 14:08:54 +02:00
82bdf70031 add missing metric to help 2018-08-09 12:47:52 +02:00
da2114e6d2 reverse sides when diffing 2018-08-07 16:21:37 +02:00
e55b8539f1 option -r can be used with -m 2018-08-07 15:55:04 +02:00
c385710719 showing most worsening features 2018-08-06 22:22:33 +02:00
3f3d1fd287 refactor worst features 2018-08-06 21:34:38 +02:00
6a862c0e82 bump up version number 2018-08-06 12:11:06 +02:00
7503644bbe sort in --worst-features 2018-08-06 12:09:31 +02:00
bc1de4c3e6 worst features show average score now 2018-08-06 11:59:04 +02:00
51abed6fa4 count the number of lines correctly 2018-08-03 11:16:28 +02:00
a3f5f25f69 Merge branch 'worst-features' of ssh://gonito.net/geval into worst-features 2018-08-03 08:24:20 +02:00
6376063a0c more ranking tests 2018-08-03 08:23:55 +02:00
8dac79fab2 clean up listing worst features 2018-08-02 22:09:25 +02:00
020b93ccf8 p-value for features counted 2018-08-02 12:50:13 +02:00
f8418894fb Merge branch 'worst-features' of ssh://gonito.net/geval into worst-features 2018-08-02 08:31:08 +02:00
cd30d88998 fix some warnings 2018-08-02 08:29:52 +02:00
2b1cf80601 implement ranking conduit 2018-08-01 22:39:34 +02:00
4b3a4fa665 implement MultiLabel-F metric 2018-07-26 13:01:10 +02:00
c0fd359590 refactor for Gonito 2018-07-14 09:48:45 +02:00
0c6032d166 print params 2018-07-10 16:22:28 +02:00
9f5882719b param can take an empty value 2018-07-10 12:10:02 +02:00
ab635f2594 add helper function for parsing params in file paths 2018-07-10 11:18:52 +02:00
830de547b0 switch off parallel garbage collector (was slowing down execution on the multi-core system)
see https://github.com/commercialhaskell/stack/issues/680
2018-07-09 09:31:15 +02:00
0708b746a9 fix handling compressed files 2018-06-29 16:59:00 +02:00
010f0f46ab export function needed by Gonito 2018-06-28 17:00:18 +02:00
1278081a48 results are sorted in the natural manner when multiple outputs are evaluated 2018-06-28 16:32:46 +02:00
338ddb7fbf fully handle multiple outputs 2018-06-28 16:22:22 +02:00
ba26cdb9e0 multiple outs are recognised but not handled 2018-06-28 15:36:47 +02:00
656a194f42 start refactoring to enable evaluating multiple outputs 2018-06-28 14:49:44 +02:00
Tsvetan Ovedenski
9c462bdf44
Remove warnings in Spec 2018-06-20 11:57:11 +02:00