Go to file
s444415 ee23d7002e Wikisource trained donut 2023-03-23 11:46:01 +00:00
dev-0 Wikisource trained donut 2023-03-23 11:46:01 +00:00
images Init 2022-07-07 17:11:35 +02:00
test-A Wikisource trained donut 2023-03-23 11:46:01 +00:00
train Init 2022-07-07 17:11:35 +02:00
.gitignore Init 2022-07-07 17:11:35 +02:00
README.md Init 2022-07-07 17:11:35 +02:00
config.txt Init 2022-07-07 17:11:35 +02:00
get-annexed-files.sh Init 2022-07-07 17:11:35 +02:00
in-header.tsv Init 2022-07-07 17:11:35 +02:00
out-header.tsv Init 2022-07-07 17:11:35 +02:00

README.md

Diachronic OCR challenge

Do OCR of a Polish historical text (or post-correction of Tesseract OCR)

Do not do modernization.

Metadata

Tags: pol, ocr, diachronic