shannons's picture
Upload README.md with huggingface_hub
4ad9b34 verified
metadata
license: apache-2.0
tags:
  - document-layout-analysis
  - yolo
  - object-detection
base_model: juliozhao/DocLayout-YOLO-DocStructBench
arxiv: '2410.12628'

DocLayout-YOLO-DocStructBench (Fork)

This is a pinned fork of juliozhao/DocLayout-YOLO-DocStructBench for use by paper-cli. We pin to a specific snapshot to avoid upstream changes breaking our pipeline.

Original Work

All credit for this model goes to the original authors:

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Zhiyuan Zhao, Hengrui Kang, Bin Wang, Conghui He

arXiv:2410.12628 | GitHub

@article{zhao2024doclayout,
  title={DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception},
  author={Zhao, Zhiyuan and Kang, Hengrui and Wang, Bin and He, Conghui},
  journal={arXiv preprint arXiv:2410.12628},
  year={2024}
}

License

Apache 2.0 (same as the original).