2022-12-23 16:48:46 +01:00
|
|
|
|
2022-12-23 16:51:31 +01:00
|
|
|
Book layout
|
|
|
|
==============================
|
|
|
|
|
|
|
|
Detect structure of given page.
|
|
|
|
|
|
|
|
Format
|
|
|
|
------
|
|
|
|
|
2023-04-02 14:53:14 +02:00
|
|
|
Each clipping found in a corresponding PDF/DjVu file. Each clipping should be given as L:X0,Y0,X1,Y1,
|
2022-12-23 16:51:31 +01:00
|
|
|
where:
|
|
|
|
|
2023-04-02 14:53:14 +02:00
|
|
|
L — label
|
2022-12-23 16:51:31 +01:00
|
|
|
X0, Y0, X1, Y1 — clipping coordinates (in pixels)
|
|
|
|
|
|
|
|
|
|
|
|
Directory structure
|
|
|
|
-------------------
|
|
|
|
|
|
|
|
* `README.md` — this file
|
|
|
|
* `config.txt` — configuration file
|
|
|
|
* `train/` — directory with training data
|
|
|
|
* `train/in.tsv` — input data for the train set
|
|
|
|
* `train/expected.tsv` — expected (reference) data for the train set
|
|
|
|
* `dev-0/` — directory with dev (test) data
|
|
|
|
* `dev-0/in.tsv` — input data for the dev set
|
|
|
|
* `dev-0/expected.tsv` — expected (reference) data for the dev set
|
|
|
|
* `test-A` — directory with test data
|
|
|
|
* `test-A/in.tsv` — input data for the test set
|
|
|
|
* `test-A/expected.tsv` — expected (reference) data for the test set
|