Filip Gralinski
|
adadf14888
|
Fix simple mistake when plotting a Loess graph
|
2019-03-19 07:46:10 +01:00 |
|
|
eb10a4c3b4
|
Add plotting graphs for selected metrics
|
2019-03-19 07:31:17 +01:00 |
|
|
816c83f183
|
Handle Probabilistic-Soft-F1 when creating a challenge
|
2019-03-12 22:39:32 +01:00 |
|
|
ae27029f61
|
Implement Probabilistic-Soft-F1
|
2019-03-12 22:35:19 +01:00 |
|
|
8393bec3ae
|
Implement auxiliary calibration function
|
2019-03-12 08:58:21 +01:00 |
|
|
19642db43f
|
Add auxilliary functions
|
2019-02-22 11:22:12 +01:00 |
|
|
fcb16d43f1
|
Export extensionsHandled
|
2019-02-14 22:29:44 +01:00 |
|
|
1a9fe36a9e
|
Handle JSONl (for MultiLabel-F)
|
2019-02-14 19:01:53 +01:00 |
|
|
2ea53f92c7
|
Refactor gevalCore
|
2019-02-14 16:48:55 +01:00 |
|
|
709eeec4ef
|
Merge branch 'master' into jsonl
|
2019-02-14 16:35:41 +01:00 |
|
|
4b85c4c1bb
|
Merge branch 'master' of ssh://gonito.net/geval
|
2019-02-14 16:26:19 +01:00 |
|
|
b2e3293a12
|
Refactor line-by-line mode
|
2019-02-14 16:25:28 +01:00 |
|
Filip Gralinski
|
26e9735d31
|
Handle jsonl files
|
2019-02-14 10:54:25 +01:00 |
|
Filip Gralinski
|
872724722a
|
Fix SMAPE on zero values
|
2019-02-12 08:36:52 +01:00 |
|
|
af21031172
|
Do not preprocess outputs for some metrics
|
2019-02-01 13:10:45 +01:00 |
|
|
ad5d614f48
|
Filter out NaN values so that sorting is not poisoned
|
2019-02-01 12:32:56 +01:00 |
|
|
f7bd1b2ccd
|
Add missing file, when generating a challenge
|
2019-02-01 12:04:52 +01:00 |
|
|
abcce9bf68
|
Add numerical features
|
2019-02-01 10:58:29 +01:00 |
|
|
d5a8908599
|
Refactor Features into Factors
|
2019-01-26 19:26:45 +01:00 |
|
|
1c3908b273
|
Refactor CartesianFeature type
|
2019-01-26 18:00:36 +01:00 |
|
|
ea5de5c719
|
Introduce existential features
|
2019-01-26 17:18:41 +01:00 |
|
Filip Gralinski
|
dbf5c961af
|
Start numerical factors
|
2019-01-23 13:00:37 +01:00 |
|
|
1aee476434
|
Add --filtre option
|
2019-01-14 23:23:50 +01:00 |
|
|
4003715726
|
Fix issue with sorting
|
2019-01-13 12:09:15 +01:00 |
|
|
b0c75cac3a
|
Change features into "factors"
(Just the terminology was changed)
|
2019-01-11 16:08:56 +01:00 |
|
|
de901d4c64
|
Add min-cartesian-feature (as optional value)
|
2019-01-11 10:16:39 +01:00 |
|
|
dbe1613052
|
Filtre out unwanted Cartesian features
|
2019-01-11 08:47:11 +01:00 |
|
|
39bc3964b3
|
Speed up cartesian features
|
2019-01-10 22:53:43 +01:00 |
|
|
23aad86e72
|
Add cartesian features to black-box debugging
But it's very slow now, needs to be sped up
|
2019-01-10 14:01:29 +01:00 |
|
|
99e3a10791
|
Add bigram features in black-box debugging
|
2019-01-10 10:41:55 +01:00 |
|
|
13f9629cbc
|
Minor refactor
|
2019-01-10 10:00:51 +01:00 |
|
|
212457077f
|
Consider word shapes in black-box debugging
|
2019-01-10 09:58:04 +01:00 |
|
|
e0cfb9c4b0
|
Add --min-frequency for black box debugging
|
2019-01-10 08:15:34 +01:00 |
|
|
1832a23b75
|
Refactor features
|
2019-01-09 17:45:06 +01:00 |
|
|
5d19fc7585
|
Add character-by-character tokenization.
|
2018-12-17 07:54:12 +01:00 |
|
|
d671989a09
|
Merge branch 'fix-haddock' of git.applica.pl:piotr.halama/geval
|
2018-12-14 07:28:27 +01:00 |
|
Piotr Halama
|
95fd1787ac
|
Reformat type expressions to fix Haddock parse errors
|
2018-12-13 17:14:21 +01:00 |
|
|
310ece7f2d
|
Get rid of f**king Windows end-of-line characters
I hate it so much. Die, die, stupid \r
|
2018-12-11 13:25:19 +01:00 |
|
|
6f1344e73e
|
Git-annex remote can be specified with --submit option
|
2018-12-07 09:21:02 +01:00 |
|
|
57ec495bc6
|
Fix mistakes in the implementation of 13a and v14 tokenizers.
|
2018-11-17 16:56:16 +01:00 |
|
Filip Gralinski
|
9322307813
|
Handle more than one possibility in TokenAccuracy
|
2018-10-24 08:02:34 +02:00 |
|
Filip Gralinski
|
2e816c4e38
|
Add TokenAccuracy metric
|
2018-10-23 17:01:33 +02:00 |
|
Filip Gralinski
|
f814fc2c79
|
Merge branch 'warnings' of https://gitlab.com/fintara/geval
|
2018-10-23 08:46:49 +02:00 |
|
|
8735610745
|
Implement soft f-score
|
2018-10-17 22:41:46 +02:00 |
|
Filip Gralinski
|
5dc6e13191
|
Add Pearson and Spearman correlation measures
|
2018-09-27 21:52:02 +02:00 |
|
Filip Gralinski
|
b3800bc1d9
|
Add Macro-F-measure to help
|
2018-09-27 18:23:26 +02:00 |
|
|
782c556f8c
|
Add Macro-F1 metric
|
2018-09-27 18:21:56 +02:00 |
|
Filip Gralinski
|
3a852ed081
|
Speed up GLEU (cntd.)
|
2018-09-26 22:27:59 +02:00 |
|
Filip Gralinski
|
eb395d9be0
|
Add WER metric
|
2018-09-25 08:13:57 +02:00 |
|
Filip Gralinski
|
4f09a1802f
|
Speed up GLEU
|
2018-09-25 07:10:17 +02:00 |
|
Filip Gralinski
|
b419aa7b08
|
Handle dot decimal separator in parameters
|
2018-09-12 20:52:56 +02:00 |
|
|
83b6b39fca
|
Fix error message
|
2018-09-12 13:48:31 +02:00 |
|
|
b46caaa702
|
Fix README.md generated for LikelihoodHashed
|
2018-09-12 12:44:36 +02:00 |
|
Filip Gralinski
|
5cff29cf06
|
Add GLEU
|
2018-09-11 08:03:07 +02:00 |
|
|
c6d48c57f6
|
improve documentation on geval --submit
|
2018-09-01 16:39:00 +02:00 |
|
|
e2c3102cc4
|
check whether the remote tracking branch exists
|
2018-09-01 14:39:34 +02:00 |
|
|
eaa791cf2f
|
improvement for "submit" special command
|
2018-08-28 18:58:51 +02:00 |
|
Piotr Halama
|
bd7c789bae
|
Implement --submit command
|
2018-08-27 17:57:07 +02:00 |
|
Filip Gralinski
|
421d2e9797
|
add minimalistic tokenizer
|
2018-08-17 18:13:27 +02:00 |
|
Filip Gralinski
|
c79c4b356e
|
fix some warnings
|
2018-08-17 17:52:41 +02:00 |
|
Filip Gralinski
|
8b7a18b4c7
|
v14 tokenizer added
|
2018-08-17 17:45:01 +02:00 |
|
Filip Gralinski
|
5e5a58210e
|
use tokenization when looking for worst features
|
2018-08-17 17:27:25 +02:00 |
|
Filip Gralinski
|
0871b57bbc
|
add --just-tokenize option
|
2018-08-17 16:57:47 +02:00 |
|
|
83550688ce
|
first tokenizer
|
2018-08-13 10:09:55 +02:00 |
|
|
d3da3a0ca5
|
WIP
|
2018-08-13 07:39:06 +02:00 |
|
|
8388ab4d27
|
towards tokenization
|
2018-08-11 22:59:43 +02:00 |
|
|
de52a12b03
|
export some functions from OptionsParser
|
2018-08-10 16:09:41 +02:00 |
|
|
5098225bc1
|
improvements in challenge creation
|
2018-08-10 13:05:42 +02:00 |
|
|
e10f92cf9c
|
create challenge with MultiLabelLikelihood/LogLoss
|
2018-08-09 16:35:31 +02:00 |
|
|
efcceae26a
|
implement MultiLabel-LogLoss and MultiLabel-Likelihood
|
2018-08-09 16:00:19 +02:00 |
|
|
bd2bfde287
|
MultiLabel-F1 works on labels given with probs now
|
2018-08-09 14:08:54 +02:00 |
|
|
82bdf70031
|
add missing metric to help
|
2018-08-09 12:47:52 +02:00 |
|
|
da2114e6d2
|
reverse sides when diffing
|
2018-08-07 16:21:37 +02:00 |
|
|
e55b8539f1
|
option -r can be used with -m
|
2018-08-07 15:55:04 +02:00 |
|
|
c385710719
|
showing most worsening features
|
2018-08-06 22:22:33 +02:00 |
|
|
3f3d1fd287
|
refactor worst features
|
2018-08-06 21:34:38 +02:00 |
|
|
7503644bbe
|
sort in --worst-features
|
2018-08-06 12:09:31 +02:00 |
|
|
bc1de4c3e6
|
worst features show average score now
|
2018-08-06 11:59:04 +02:00 |
|
|
51abed6fa4
|
count the number of lines correctly
|
2018-08-03 11:16:28 +02:00 |
|
|
8dac79fab2
|
clean up listing worst features
|
2018-08-02 22:09:25 +02:00 |
|
|
020b93ccf8
|
p-value for features counted
|
2018-08-02 12:50:13 +02:00 |
|
|
f8418894fb
|
Merge branch 'worst-features' of ssh://gonito.net/geval into worst-features
|
2018-08-02 08:31:08 +02:00 |
|
|
cd30d88998
|
fix some warnings
|
2018-08-02 08:29:52 +02:00 |
|
|
2b1cf80601
|
implement ranking conduit
|
2018-08-01 22:39:34 +02:00 |
|
|
4b3a4fa665
|
implement MultiLabel-F metric
|
2018-07-26 13:01:10 +02:00 |
|
|
c0fd359590
|
refactor for Gonito
|
2018-07-14 09:48:45 +02:00 |
|
|
0c6032d166
|
print params
|
2018-07-10 16:22:28 +02:00 |
|
|
9f5882719b
|
param can take an empty value
|
2018-07-10 12:10:02 +02:00 |
|
|
ab635f2594
|
add helper function for parsing params in file paths
|
2018-07-10 11:18:52 +02:00 |
|
|
0708b746a9
|
fix handling compressed files
|
2018-06-29 16:59:00 +02:00 |
|
|
010f0f46ab
|
export function needed by Gonito
|
2018-06-28 17:00:18 +02:00 |
|
|
1278081a48
|
results are sorted in the natural manner when multiple outputs are evaluated
|
2018-06-28 16:32:46 +02:00 |
|
|
338ddb7fbf
|
fully handle multiple outputs
|
2018-06-28 16:22:22 +02:00 |
|
|
ba26cdb9e0
|
multiple outs are recognised but not handled
|
2018-06-28 15:36:47 +02:00 |
|
|
656a194f42
|
start refactoring to enable evaluating multiple outputs
|
2018-06-28 14:49:44 +02:00 |
|
Tsvetan Ovedenski
|
f6ad2f0a85
|
Remove warnings in Core
|
2018-06-20 11:48:03 +02:00 |
|
Filip Gralinski
|
0a2e1fcc32
|
docs on PrecisionAndRecall
|
2018-06-13 15:36:23 +02:00 |
|
|
e0e06196f0
|
Merge branch 'handle-version-option' into 'master'
Added version flag handling, added changelog
Closes #7
See merge request filipg/geval!1
|
2018-06-13 10:46:34 +00:00 |
|
Filip Gralinski
|
012578f32a
|
implement mean absolute error
|
2018-06-13 12:30:11 +02:00 |
|
Tomasz Weissbek
|
964957b1db
|
Added version flag handling, added changelog
|
2018-06-13 12:19:06 +02:00 |
|
|
1073407760
|
improve documentation
|
2018-06-12 21:52:18 +02:00 |
|
|
86d50b92b7
|
multiple metrics can be specified
|
2018-06-08 12:38:45 +02:00 |
|
|
ffb24509d7
|
handle http(s)://
|
2018-06-02 23:27:49 +02:00 |
|
|
57ee8a1296
|
switch to smart sources
|
2018-06-02 20:24:34 +02:00 |
|
|
18ed47322e
|
Merge branch 'master' into smart-conduit
|
2018-06-02 16:31:36 +02:00 |
|
|
f9dfbc1466
|
accuracy can work on probablity distributions now
|
2018-06-02 12:24:14 +02:00 |
|
|
d370e375a0
|
add --alt-metric option
|
2018-06-02 11:29:54 +02:00 |
|
|
4768931221
|
add BIO-F1-Labels metric
|
2018-05-29 22:04:19 +02:00 |
|
|
65e8d2562e
|
underscores can be used in the BIO format
|
2018-05-29 20:59:00 +02:00 |
|
|
3f7384f467
|
add --sort and --reverse-sort options
|
2018-05-28 10:04:27 +02:00 |
|
|
ab1056301e
|
add sorting for --line-by-line internally
|
2018-05-28 09:45:08 +02:00 |
|
|
f68223409e
|
add test for the line-by-line mode
|
2018-05-26 21:10:22 +02:00 |
|
|
cb655cd2ae
|
refactor LineByLine
|
2018-05-26 14:40:26 +02:00 |
|
|
c71c7a019d
|
remove warning in LineByLine.hs
|
2018-05-26 13:09:06 +02:00 |
|
|
881a77e239
|
better diagnostic messages for BIO
|
2018-05-25 14:44:19 +02:00 |
|
|
3e201d11ef
|
update for Stack LTS 11.9
|
2018-05-19 13:49:53 +02:00 |
|
|
192d531969
|
add likelihood as evaluation metrics
|
2018-05-17 15:21:03 +02:00 |
|
|
438f013914
|
automatic decompression
|
2018-05-17 08:26:57 +02:00 |
|
|
01b93dd243
|
improve help for geval --init
|
2018-05-16 21:00:45 +02:00 |
|
|
b01f9439b7
|
log probs
|
2018-05-16 20:59:40 +02:00 |
|
|
82e794ae3c
|
implement BIO-F1
|
2018-05-16 10:51:50 +02:00 |
|
|
9fc4beaba1
|
improve sample challenge for LogLossHashed
|
2018-05-15 08:14:52 +02:00 |
|
|
06fd093349
|
probs can be given for LogLossHashed
|
2018-05-15 08:07:47 +02:00 |
|
|
bdcd26cddc
|
WIP
|
2018-05-14 10:37:58 +02:00 |
|
|
cea084c789
|
accuracy can work on probs now
|
2018-04-07 21:13:37 +02:00 |
|
|
ff8ec8880e
|
add LogLoss
|
2018-04-07 08:29:58 +02:00 |
|
|
9d4aab5f2c
|
diff
|
2018-02-20 21:28:14 +01:00 |
|
|
88f69156e7
|
refactor code
|
2018-02-20 21:28:14 +01:00 |
|
|
5ae8036efc
|
add short options, improve help
|
2018-02-20 21:28:14 +01:00 |
|
|
b51944b930
|
--init considers --precision now
|
2018-02-20 21:28:14 +01:00 |
|
|
6cfefed0c1
|
precision is part of specification now
|
2018-02-20 21:28:14 +01:00 |
|
|
f32564a42a
|
write wrong line number correctly in line-by-line mode
|
2018-02-20 21:28:14 +01:00 |
|
|
5c00ab6d26
|
show line number when something wrong
|
2018-02-20 21:28:14 +01:00 |
|
|
b323e6148c
|
refactor parse errors (use Either instead of throwing an error)
|
2018-02-20 21:28:14 +01:00 |
|
|
c70d49c418
|
add line-by-line mode
|
2018-02-20 21:28:13 +01:00 |
|
|
a7d2ed8c21
|
refactor
|
2018-02-20 21:28:13 +01:00 |
|
|
11b43b3a2a
|
introduce special command
|
2018-02-20 21:28:13 +01:00 |
|
|
a2814f2d12
|
add function for evaluating single lines
|
2018-02-20 21:28:13 +01:00 |
|
|
8d87ee4c4b
|
refactor Core so that any conduit source could be accepted, not just file names
|
2018-02-20 21:28:13 +01:00 |
|
|
9643719193
|
add MAP metric
|
2018-02-20 21:28:13 +01:00 |
|
|
c10f3579c6
|
fix BLEU for empty output
|
2018-02-20 21:28:13 +01:00 |
|
|
f289cafc03
|
upgrade to Stack LTS 9.5
|
2018-02-20 21:28:13 +01:00 |
|
|
54c899ddfc
|
generating sample CharMatch challenge, CharMatch is F0.5 now
|
2018-02-20 21:28:13 +01:00 |
|
|
72dbf33b8d
|
make it possible to cover metrics operating on the input, add CharMatch metric
|
2018-02-20 21:28:13 +01:00 |
|
|
6144ae6bdf
|
add sample toy challenge for LogLossHashed
|
2018-02-20 21:28:13 +01:00 |
|
|
59f19cbe18
|
change default size of hash
|
2018-02-20 21:28:13 +01:00 |
|
|
b058cd0095
|
implement softmax in LogLossHashed
|
2018-02-20 21:28:13 +01:00 |
|
|
e84a14d069
|
salt LogLossHashed with line numbers
|
2018-02-20 21:28:13 +01:00 |
|
|
0e9c44a5b5
|
start working on LogLossHashed
|
2018-02-20 21:28:13 +01:00 |
|
|
073d92a4e7
|
add sample challenge for NMI
|
2018-02-20 21:28:13 +01:00 |
|