Scholars@Duke publication: CalibreNet: Calibration Networks for Multilingual Sequence Labeling

CalibreNet: Calibration Networks for Multilingual Sequence Labeling

Publication , Conference

Liang, S; Shou, L; Pei, J; Gong, M; Zuo, W; Jiang, D

Published in: Wsdm 2021 Proceedings of the 14th ACM International Conference on Web Search and Data Mining

August 3, 2021

Lack of training data in low-resource languages presents huge challenges to sequence labeling tasks such as named entity recognition (NER) and machine reading comprehension (MRC). One major obstacle is the errors on the boundary of predicted answers. To tackle this problem, we propose CalibreNet, which predicts answers in two steps. In the first step, any existing sequence labeling method can be adopted as a base model to generate an initial answer. In the second step, CalibreNet refines the boundary of the initial answer. To tackle the challenge of lack of training data in low-resource languages, we dedicatedly develop a novel unsupervised phrase boundary recovery pre-training task to enhance the multilingual boundary detection capability of CalibreNet. Experiments on two cross-lingual benchmark datasets show that the proposed approach achieves SOTA results on zero-shot cross-lingual NER and MRC tasks.

Duke Scholars

Author Jian Pei Computer Science

Published In

Wsdm 2021 Proceedings of the 14th ACM International Conference on Web Search and Data Mining

DOI

10.1145/3437963.3441728

Publication Date

August 3, 2021

Start / End Page

842 / 850

Citation

APA

Chicago

ICMJE

MLA

NLM

Liang, S., Shou, L., Pei, J., Gong, M., Zuo, W., & Jiang, D. (2021). CalibreNet: Calibration Networks for Multilingual Sequence Labeling. In Wsdm 2021 Proceedings of the 14th ACM International Conference on Web Search and Data Mining (pp. 842–850). https://doi.org/10.1145/3437963.3441728

Liang, S., L. Shou, J. Pei, M. Gong, W. Zuo, and D. Jiang. “CalibreNet: Calibration Networks for Multilingual Sequence Labeling.” In Wsdm 2021 Proceedings of the 14th ACM International Conference on Web Search and Data Mining, 842–50, 2021. https://doi.org/10.1145/3437963.3441728.

Liang S, Shou L, Pei J, Gong M, Zuo W, Jiang D. CalibreNet: Calibration Networks for Multilingual Sequence Labeling. In: Wsdm 2021 Proceedings of the 14th ACM International Conference on Web Search and Data Mining. 2021. p. 842–50.

Liang, S., et al. “CalibreNet: Calibration Networks for Multilingual Sequence Labeling.” Wsdm 2021 Proceedings of the 14th ACM International Conference on Web Search and Data Mining, 2021, pp. 842–50. Scopus, doi:10.1145/3437963.3441728.

Liang S, Shou L, Pei J, Gong M, Zuo W, Jiang D. CalibreNet: Calibration Networks for Multilingual Sequence Labeling. Wsdm 2021 Proceedings of the 14th ACM International Conference on Web Search and Data Mining. 2021. p. 842–850.

Published In

Wsdm 2021 Proceedings of the 14th ACM International Conference on Web Search and Data Mining

DOI

10.1145/3437963.3441728

Publication Date

August 3, 2021

Start / End Page

842 / 850