mirror of
https://github.com/docling-project/docling-eval.git
synced 2026-05-17 13:10:47 +00:00
c458dc5de1
* Update PubTabNet_benchmarks.md With the default `--split test`, the create-gt method throws an exception `Error creating dataset builder: Unknown split "test". Should be one of ['train', 'val'].`. Indeed, https://huggingface.co/datasets/ds4sd/PubTabNet_OTSL only has train and val splits. Given this, I believe using the `val` split is more suitable for this dataset. Signed-off-by: laurachiticariu <chiti@us.ibm.com> * Small typo Signed-off-by: laurachiticariu <chiti@us.ibm.com> * Update PubTabNet_benchmarks.md Signed-off-by: laurachiticariu <chiti@us.ibm.com> --------- Signed-off-by: laurachiticariu <chiti@us.ibm.com>