geval/test
Filip Gralinski 9b79b8761d Check whether the maximum values is obtained during the validation 2019-08-10 15:55:51 +02:00
..
_submit-tests Implement --submit command 2018-08-27 17:57:07 +02:00
accuracy-probs accuracy can work on probs now 2018-04-07 21:13:37 +02:00
accuracy-simple handle Accuracy 2018-02-20 21:28:12 +01:00
bio-f1-complex implement BIO-F1 2018-05-16 10:51:50 +02:00
bio-f1-complex-labels add BIO-F1-Labels metric 2018-05-29 22:04:19 +02:00
bio-f1-error better diagnostic messages for BIO 2018-05-25 14:44:19 +02:00
bio-f1-perfect implement BIO-F1 2018-05-16 10:51:50 +02:00
bio-f1-simple implement BIO-F1 2018-05-16 10:51:50 +02:00
bio-f1-simple-underscores underscores can be used in the BIO format 2018-05-29 20:59:00 +02:00
bleu-complex BLEU done 2018-02-20 21:28:12 +01:00
bleu-empty fix BLEU for empty output 2018-02-20 21:28:13 +01:00
bleu-perfect BLEU done 2018-02-20 21:28:12 +01:00
bleu-trivial add test case data 2018-02-20 21:28:11 +01:00
bleu-with-tokenization add --just-tokenize option 2018-08-17 16:57:47 +02:00
charmatch-complex make it possible to cover metrics operating on the input, add CharMatch metric 2018-02-20 21:28:13 +01:00
charmatch-complex-compressed automatic decompression 2018-05-17 08:26:57 +02:00
charmatch-no-input make it possible to cover metrics operating on the input, add CharMatch metric 2018-02-20 21:28:13 +01:00
charmatch-perfect make it possible to cover metrics operating on the input, add CharMatch metric 2018-02-20 21:28:13 +01:00
charmatch-simple make it possible to cover metrics operating on the input, add CharMatch metric 2018-02-20 21:28:13 +01:00
clippeu-simple ClippEU passes tests 2018-02-20 21:28:12 +01:00
empty-output check emptiness 2018-02-20 21:28:12 +01:00
error-too-few-lines check the number of lines 2018-02-20 21:28:12 +01:00
error-too-many-lines check the number of lines 2018-02-20 21:28:12 +01:00
f-measure-all-false add F-measure 2018-02-20 21:28:13 +01:00
f-measure-perfect add F-measure 2018-02-20 21:28:13 +01:00
f-measure-simple add F-measure 2018-02-20 21:28:13 +01:00
f-measure-stupid add F-measure 2018-02-20 21:28:13 +01:00
f2-simple add F-measure 2018-02-20 21:28:13 +01:00
files WIP 2018-05-14 10:37:58 +02:00
gleu-empty more tests for GLEU 2018-09-12 20:37:44 +02:00
gleu-perfect more tests for GLEU 2018-09-12 20:37:44 +02:00
gleu-simple Add GLEU 2018-09-11 08:03:07 +02:00
jsonl-simple Handle jsonl files 2019-02-14 10:54:25 +01:00
likelihood-hashed-not-normalized add likelihood as evaluation metrics 2018-05-17 15:21:03 +02:00
likelihood-simple add test for the line-by-line mode 2018-05-26 21:10:22 +02:00
log-loss-hashed-normalization log probs 2018-05-16 20:59:40 +02:00
log-loss-hashed-not-normalized implement softmax in LogLossHashed 2018-02-20 21:28:13 +01:00
log-loss-hashed-probs probs can be given for LogLossHashed 2018-05-15 08:07:47 +02:00
log-loss-hashed-probs-normalized probs can be given for LogLossHashed 2018-05-15 08:07:47 +02:00
log-loss-hashed-simple salt LogLossHashed with line numbers 2018-02-20 21:28:13 +01:00
logloss-perfect add LogLoss 2018-04-07 08:29:58 +02:00
logloss-simple add LogLoss 2018-04-07 08:29:58 +02:00
macro-f-measure-perfect Add Macro-F1 metric 2018-09-27 18:21:56 +02:00
macro-f1-simple Add Macro-F1 metric 2018-09-27 18:21:56 +02:00
mae-simple implement mean absolute error 2018-06-13 12:30:11 +02:00
map-simple add MAP metric 2018-02-20 21:28:13 +01:00
mse-simple add a function for running with args, reading config file 2018-02-20 21:28:11 +01:00
multilabel-f1-simple implement MultiLabel-F metric 2018-07-26 13:01:10 +02:00
multilabel-f1-with-probs MultiLabel-F1 works on labels given with probs now 2018-08-09 14:08:54 +02:00
multilabel-f1-with-probs-and-numbers MultiLabel-F1 works on labels given with probs now 2018-08-09 14:08:54 +02:00
multilabel-f2-simple implement MultiLabel-F metric 2018-07-26 13:01:10 +02:00
multilabel-likelihood-simple implement MultiLabel-LogLoss and MultiLabel-Likelihood 2018-08-09 16:00:19 +02:00
nmi-complex NMI implemented as geval metric 2018-02-20 21:28:13 +01:00
probabilistic-soft-f1-calibrated Implement Probabilistic-Soft-F1 2019-03-12 22:35:19 +01:00
probabilistic-soft-f1-simple Implement Probabilistic-Soft-F1 2019-03-12 22:35:19 +01:00
rmse-simple add a function for running with args, reading config file 2018-02-20 21:28:11 +01:00
smape-simple Fix SMAPE on zero values 2019-02-12 08:36:52 +01:00
soft-f1-perfect Implement soft f-score 2018-10-17 22:41:46 +02:00
soft-f1-simple Implement soft f-score 2018-10-17 22:41:46 +02:00
spearman-simple Add Pearson and Spearman correlation measures 2018-09-27 21:52:02 +02:00
token-accuracy-simple Handle more than one possibility in TokenAccuracy 2018-10-24 08:02:34 +02:00
unexpected-data check whether data is OK 2018-02-20 21:28:12 +01:00
unwanted-data handle numbers combined with text 2018-02-20 21:28:12 +01:00
wer-simple Add WER metric 2018-09-25 08:13:57 +02:00
Spec.hs Check whether the maximum values is obtained during the validation 2019-08-10 15:55:51 +02:00