Commit Graph

90 Commits

Author SHA1 Message Date
5f4af21006 Switch to more general announcements
Compatibility break: switch NEW_BEST_RESULT_SLACK_HOOK env
to ANNOUNCEMENT_HOOK
2021-08-21 10:26:46 +02:00
3dc9ae5dd8 Refactor towards generalization of announcements 2021-08-21 10:08:41 +02:00
2a6515d872 Refactor towards general announcements 2021-08-21 09:45:37 +02:00
b432fdd0e5 view-process-log is self-contained 2021-08-09 22:07:44 +02:00
0c5bbd63aa Add API for viewing progress logs 2021-07-30 12:19:27 +02:00
51e98bee68 Handle properly cases where there is no test 2021-07-29 17:36:18 +02:00
30c7727d6a Clean up Swagger documentation, enhance properties
Breaking change: "version" property for leaderboard entry is a list of
integers now
2021-05-29 18:40:13 +02:00
dababb05ed Challenge slug can start with a digit 2021-05-12 07:03:38 +02:00
a8c1761898 Fix issue with web sockets not working 2021-04-24 17:43:53 +02:00
d02d3a824a Whether using web socket for showing progress is configurable 2021-02-27 22:51:40 +01:00
0bc680c7ae Add experimental handling of outputs via Web Sockets 2021-02-27 18:38:38 +01:00
e185f9251b Switch to an incompatible DB scheme
Evaluation is uniquely identified by test, output checksum
and version checksum.
2021-02-27 11:48:30 +01:00
f7d61aa256 Add challenge healing 2021-02-17 09:31:23 +01:00
9155f52315 Add to challenge submission to API 2021-01-17 20:37:25 +01:00
e158459bf8 Handle showing as percentage 2020-09-05 16:45:09 +02:00
dce08f5ce6 Repository is cloned if not found on a local disk 2020-09-05 14:22:12 +02:00
7311a2a012 Minor refactor in cloning code 2020-09-05 11:52:33 +02:00
Filip Gralinski
55d06e3347 Update to the newest GEval 2020-08-08 21:52:44 +02:00
389a018254 Fix issue with multiple evaluation results 2020-03-04 08:36:37 +01:00
d2a59e59a5 Add bootstrap resampling 2020-01-28 23:14:46 +01:00
fc062cbd1f Showing worst lines 2019-12-14 18:21:47 +01:00
c5e43fcbea Implement re-evaluation 2019-12-14 14:10:50 +01:00
770c167753 Refactor towards changing evaluation records 2019-12-14 11:58:52 +01:00
12fd6a1c58 Handle switching to the right commit
(Though not robust)
2019-12-14 11:17:12 +01:00
120c87e44a Prepare application structure for re-evaluation 2019-12-14 10:56:07 +01:00
80020ee404 Show current challenge version 2019-12-13 22:29:41 +01:00
219349e95a Fix opening 2019-12-07 22:48:58 +01:00
247e55ba18 Add a helper function 2019-12-07 21:26:50 +01:00
7b4b8b101d API for scores handles alternative metrics 2019-11-30 11:56:07 +01:00
9a53a258ff Show metric names parsed from evaluation schemes 2019-09-11 21:30:37 +02:00
Filip Graliński
0f4150a4a5 Fix issue with unwanted messages 2019-09-10 08:59:30 +02:00
32c77b3c74 Challenges can be updated 2019-08-29 08:56:22 +02:00
Filip Graliński
3b6846c683 Update for GEval preprocessing operations 2019-08-12 18:19:02 +02:00
Filip Graliński
f58121b407 Local IDs can be longer 2019-03-20 16:32:52 +01:00
ec4947254a Fix slack hook 2018-11-14 20:59:40 +01:00
Filip Graliński
3447b29a82 Handle Slack hooks 2018-11-14 17:41:01 +01:00
Filip Gralinski
876e98f862 Merge branch 'master' of ssh://gonito.net/gonito 2018-10-15 21:15:29 +02:00
Filip Gralinski
687716f6fe Handle tags and description from gonito.yaml 2018-10-15 21:15:03 +02:00
Filip Graliński
32ae5d7e1a Use --single-branch 2018-10-15 14:19:20 +02:00
Filip Graliński
a2ae700158 Add first version of indicator graphs 2018-09-22 19:22:00 +02:00
Filip Gralinski
20fc29159f Show all main metrics in the leaderboard 2018-09-08 19:21:06 +02:00
Filip Gralinski
858f494047 Scores are sorted correctly with N/A and Infinity values correctly handled 2018-09-08 10:20:36 +02:00
Filip Gralinski
1aafff1808 Do not use scientific formatting of scores 2018-09-08 09:03:22 +02:00
c23e28250a numbers in graphs are shown with the right precision 2018-09-01 13:56:18 +02:00
6c6c63114f improve creating challenges 2018-09-01 12:01:35 +02:00
fac5d451d6 use space instead of slash as separator between test-name and metric (in tables)
so that the hear could be wrapped in a more narrow column if needed
(e.g. if the number of test sets/metric is large)
2018-09-01 10:46:39 +02:00
d9a3d56d3d improve showing query results, fix issue with triggering evaluation for the first time 2018-08-30 21:58:27 +02:00
9d54deb68f show param graphs for all non-dev tests 2018-07-28 21:22:52 +02:00
c60e786025 refactor getMainTest 2018-07-28 19:59:29 +02:00
0a51543957 show graphs for params 2018-07-28 17:04:27 +02:00