Skip to main content

CalibreNet: Calibration Networks for Multilingual Sequence Labeling

Publication ,  Conference
Liang, S; Shou, L; Pei, J; Gong, M; Zuo, W; Jiang, D
Published in: WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining
August 3, 2021

Lack of training data in low-resource languages presents huge challenges to sequence labeling tasks such as named entity recognition (NER) and machine reading comprehension (MRC). One major obstacle is the errors on the boundary of predicted answers. To tackle this problem, we propose CalibreNet, which predicts answers in two steps. In the first step, any existing sequence labeling method can be adopted as a base model to generate an initial answer. In the second step, CalibreNet refines the boundary of the initial answer. To tackle the challenge of lack of training data in low-resource languages, we dedicatedly develop a novel unsupervised phrase boundary recovery pre-training task to enhance the multilingual boundary detection capability of CalibreNet. Experiments on two cross-lingual benchmark datasets show that the proposed approach achieves SOTA results on zero-shot cross-lingual NER and MRC tasks.

Duke Scholars

Published In

WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining

DOI

Publication Date

August 3, 2021

Start / End Page

842 / 850
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Liang, S., Shou, L., Pei, J., Gong, M., Zuo, W., & Jiang, D. (2021). CalibreNet: Calibration Networks for Multilingual Sequence Labeling. In WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining (pp. 842–850). https://doi.org/10.1145/3437963.3441728
Liang, S., L. Shou, J. Pei, M. Gong, W. Zuo, and D. Jiang. “CalibreNet: Calibration Networks for Multilingual Sequence Labeling.” In WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining, 842–50, 2021. https://doi.org/10.1145/3437963.3441728.
Liang S, Shou L, Pei J, Gong M, Zuo W, Jiang D. CalibreNet: Calibration Networks for Multilingual Sequence Labeling. In: WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining. 2021. p. 842–50.
Liang, S., et al. “CalibreNet: Calibration Networks for Multilingual Sequence Labeling.” WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining, 2021, pp. 842–50. Scopus, doi:10.1145/3437963.3441728.
Liang S, Shou L, Pei J, Gong M, Zuo W, Jiang D. CalibreNet: Calibration Networks for Multilingual Sequence Labeling. WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining. 2021. p. 842–850.

Published In

WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining

DOI

Publication Date

August 3, 2021

Start / End Page

842 / 850