Skip to main content

Coarse-to-Fine Document Image Registration for Dewarping

Publication ,  Chapter
Zhang, W; Wang, Q; Huang, K; Gu, X; Guo, F
January 1, 2024

Document dewarping has made great progress in recent years, however it usually requires huge document pairs with pixel-level annotation to learn a mapping function. Although photographed document images are easy to obtain, the pixel-level annotation between warped and flat images is time-consuming and almost impossible for large-scale datasets. To overcome this issue, we propose to register photographed documents with corresponding flat counterparts, obtaining the auto-annotation of pixel-level mapping labels. Due to the severe deformation in the real photographed documents, we introduce a coarse-to-fine registration pipeline to learn global-scale transformation and local details alignment respectively. In addition, the lack of registration labels motivates us to tailor a teacher-student dual branch under semi-supervised training, where the model is initialized on synthetic documents with labels. Furthermore, we contribute a large-scale dataset containing 12,500 triplets of synthetic-real-flat documents. Extensive experiments demonstrate the effectiveness of our proposed registration method. Specifically, trained by our registered pixel-level documents, the dewarping model can obtain comparable performance with SOTAs trained by almost 100× scale of samples, showing the high quality of our registration results. Our dataset and code are available at https://github.com/hanquansanren/DIRD.

Duke Scholars

DOI

Publication Date

January 1, 2024

Volume

14807 LNCS

Start / End Page

343 / 358

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 46 Information and computing sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Zhang, W., Wang, Q., Huang, K., Gu, X., & Guo, F. (2024). Coarse-to-Fine Document Image Registration for Dewarping (Vol. 14807 LNCS, pp. 343–358). https://doi.org/10.1007/978-3-031-70546-5_20
Zhang, W., Q. Wang, K. Huang, X. Gu, and F. Guo. “Coarse-to-Fine Document Image Registration for Dewarping,” 14807 LNCS:343–58, 2024. https://doi.org/10.1007/978-3-031-70546-5_20.
Zhang W, Wang Q, Huang K, Gu X, Guo F. Coarse-to-Fine Document Image Registration for Dewarping. In 2024. p. 343–58.
Zhang, W., et al. Coarse-to-Fine Document Image Registration for Dewarping. Vol. 14807 LNCS, 2024, pp. 343–58. Scopus, doi:10.1007/978-3-031-70546-5_20.
Zhang W, Wang Q, Huang K, Gu X, Guo F. Coarse-to-Fine Document Image Registration for Dewarping. 2024. p. 343–358.

DOI

Publication Date

January 1, 2024

Volume

14807 LNCS

Start / End Page

343 / 358

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 46 Information and computing sciences