dev-0 | ||
hf_roberta_base | ||
mean_from_train | ||
test-A | ||
train | ||
.gitignore | ||
config.txt | ||
in-header.tsv | ||
out-header.tsv | ||
README.md |
Wiki Historian En
Guess the masked date in an wikipedia article.
Only dates 1835-1935 are considered.
You should predict only one masked date in an article described as [DATEPREDICT]. [DATEMASK] are other masked dates. The predicted date should be in fractional year format.
Fractional year format:
Year is normalized as follows:
''' days_in_year = 366 if is_leap else 365 normalized = d.year + ((day_of_year-1) / days_in_year) '''
Directory structure
README.md
— this fileconfig.txt
— configuration filetrain/
— directory with training datatrain/in.tsv
— input data for the train settrain/expected.tsv
— expected (reference) data for the train setdev-0/
— directory with dev (test) datadev-0/in.tsv
— input data for the dev setdev-0/expected.tsv
— expected (reference) data for the dev settest-A
— directory with test datatest-A/in.tsv
— input data for the test settest-A/expected.tsv
— expected (reference) data for the test set