Skip to main content

Mix-Up Augmentation for Oracle Character Recognition with Imbalanced Data Distribution

Publication ,  Conference
Li, J; Wang, QF; Zhang, R; Huang, K
Published in: Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics
January 1, 2021

Oracle bone characters are probably the oldest hieroglyphs in China. It is of significant impact to recognize such characters since they can provide important clues for Chinese archaeology and philology. Automatic oracle bone character recognition however remains to be a challenging problem. In particular, due to the inherited nature, oracle characters are typically very limited and also seriously imbalanced in most available oracle datasets, which greatly hinders the research in automatic oracle bone character recognition. To alleviate this problem, we propose to design the mix-up strategy that leverages information from both majority and minority classes to augment samples of minority classes such that their boundaries can be pushed away towards majority classes. As a result, the training bias resulted from majority classes can be largely reduced. In addition, we consolidate our new framework with both the softmax loss and triplet loss on the augmented samples which proves able to improve the classification accuracy further. We conduct extensive evaluations w.r.t. both total class accuracy and average class accuracy on three benchmark datasets (i.e., Oracle-20K, Oracle-AYNU and OBC306). Experimental results show that the proposed method can result in superior performance to the comparison approaches, attaining a new state of the art in oracle bone character recognition.

Duke Scholars

Published In

Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics

DOI

EISSN

1611-3349

ISSN

0302-9743

Publication Date

January 1, 2021

Volume

12821 LNCS

Start / End Page

237 / 251

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 46 Information and computing sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Li, J., Wang, Q. F., Zhang, R., & Huang, K. (2021). Mix-Up Augmentation for Oracle Character Recognition with Imbalanced Data Distribution. In Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics (Vol. 12821 LNCS, pp. 237–251). https://doi.org/10.1007/978-3-030-86549-8_16
Li, J., Q. F. Wang, R. Zhang, and K. Huang. “Mix-Up Augmentation for Oracle Character Recognition with Imbalanced Data Distribution.” In Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 12821 LNCS:237–51, 2021. https://doi.org/10.1007/978-3-030-86549-8_16.
Li J, Wang QF, Zhang R, Huang K. Mix-Up Augmentation for Oracle Character Recognition with Imbalanced Data Distribution. In: Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics. 2021. p. 237–51.
Li, J., et al. “Mix-Up Augmentation for Oracle Character Recognition with Imbalanced Data Distribution.” Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, vol. 12821 LNCS, 2021, pp. 237–51. Scopus, doi:10.1007/978-3-030-86549-8_16.
Li J, Wang QF, Zhang R, Huang K. Mix-Up Augmentation for Oracle Character Recognition with Imbalanced Data Distribution. Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics. 2021. p. 237–251.

Published In

Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics

DOI

EISSN

1611-3349

ISSN

0302-9743

Publication Date

January 1, 2021

Volume

12821 LNCS

Start / End Page

237 / 251

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 46 Information and computing sciences