Scholars@Duke publication: Class Incremental Learning for Character String Recognition

Class Incremental Learning for Character String Recognition

Publication , Chapter

Hu, Y; Zhang, YM; Huang, K; Wang, QF

January 1, 2024

Character string recognition (CSR) has drawn much attention for document intelligence, but its performance is limited by the pre-defined character set without the ability to recognize new characters. To overcome this issue, class incremental learning (CIL) can be adopted where the model learns from new data instances incrementally over time. However, it is challenging to directly apply existing CIL methods in CSR because CSR is a typical sequence recognition problem. Without accurate alignment, the recognition error of new characters will affect the recognition of other characters in the same sequence. Moreover, the new characters are usually much fewer than the old ones, resulting in a data imbalance issue for learning new classes. To tackle the misalignment issue, we decouple the learning of feature alignment and classifiers during the incremental process in CSR. To handle the data imbalance issue, we propose a Prototype Incremental Learning framework for CSR, namely PIL-CSR. In the PIL-CSR framework, we propose a prototype-centered loss (PCL) to aid the model in facilitating better class separation, and we further propose a prototype separation and feature alignment (PSFA) strategy, allowing the model to adapt and learn new classes seamlessly. Finally, we collect a CSR dataset to evaluate CIL performance (github.com/tambourine666/Doc-CIL). Experimental results demonstrate the effectiveness of our proposed sequence CIL method, obtaining a significant improvement in both line-level and character-level accuracy.

Duke Scholars

Author Kaizhu Huang DKU Faculty

DOI

10.1007/978-3-031-70549-6_24

Publication Date

January 1, 2024

Volume

14808 LNCS

Start / End Page

405 / 420

Related Subject Headings

Artificial Intelligence & Image Processing
46 Information and computing sciences

Citation

APA

Chicago

ICMJE

MLA

NLM

Hu, Y., Zhang, Y. M., Huang, K., & Wang, Q. F. (2024). Class Incremental Learning for Character String Recognition (Vol. 14808 LNCS, pp. 405–420). https://doi.org/10.1007/978-3-031-70549-6_24

Hu, Y., Y. M. Zhang, K. Huang, and Q. F. Wang. “Class Incremental Learning for Character String Recognition,” 14808 LNCS:405–20, 2024. https://doi.org/10.1007/978-3-031-70549-6_24.

Hu Y, Zhang YM, Huang K, Wang QF. Class Incremental Learning for Character String Recognition. In 2024. p. 405–20.

Hu, Y., et al. Class Incremental Learning for Character String Recognition. Vol. 14808 LNCS, 2024, pp. 405–20. Scopus, doi:10.1007/978-3-031-70549-6_24.

Hu Y, Zhang YM, Huang K, Wang QF. Class Incremental Learning for Character String Recognition. 2024. p. 405–420.

DOI

10.1007/978-3-031-70549-6_24

Publication Date

January 1, 2024

Volume

14808 LNCS

Start / End Page

405 / 420

Related Subject Headings

Artificial Intelligence & Image Processing
46 Information and computing sciences