Scholars@Duke publication: Entity disambiguation by knowledge and text jointly embedding

Entity disambiguation by knowledge and text jointly embedding

Publication , Conference

Fang, W; Zhang, J; Wang, D; Chen, Z; Li, M

Published in: Conll 2016 20th Signll Conference on Computational Natural Language Learning Proceedings

January 1, 2016

For most entity disambiguation systems, the secret recipes are feature representations for mentions and entities, most of which are based on Bag-of-Words (BoW) representations. Commonly, BoW has several drawbacks: (1) It ignores the intrinsic meaning of words/entities; (2) It often results in high-dimension vector spaces and expensive computation; (3) For different applications, methods of designing handcrafted representations may be quite different, lacking of a general guideline. In this paper, we propose a different approach named EDKate. We first learn low-dimensional continuous vector representations for entities and words by jointly embedding knowledge base and text in the same vector space. Then we utilize these embeddings to design simple but effective features and build a two-layer disambiguation model. Extensive experiments on real-world data sets show that (1) The embedding-based features are very effective. Even a single one embedding-based feature can beat the combination of several BoW-based features. (2) The superiority is even more promising in a difficult set where the mention-entity prior cannot work well. (3) The proposed embedding method is much better than trivial implementations of some off-the-shelf embedding algorithms. (4) We compared our EDKate with existing methods/systems and the results are also positive.

Duke Scholars

Author Ming Li DKU Faculty

Published In

Conll 2016 20th Signll Conference on Computational Natural Language Learning Proceedings

DOI

10.18653/v1/k16-1026

Publication Date

January 1, 2016

Start / End Page

260 / 269

Citation

APA

Chicago

ICMJE

MLA

NLM

Fang, W., Zhang, J., Wang, D., Chen, Z., & Li, M. (2016). Entity disambiguation by knowledge and text jointly embedding. In Conll 2016 20th Signll Conference on Computational Natural Language Learning Proceedings (pp. 260–269). https://doi.org/10.18653/v1/k16-1026

Fang, W., J. Zhang, D. Wang, Z. Chen, and M. Li. “Entity disambiguation by knowledge and text jointly embedding.” In Conll 2016 20th Signll Conference on Computational Natural Language Learning Proceedings, 260–69, 2016. https://doi.org/10.18653/v1/k16-1026.

Fang W, Zhang J, Wang D, Chen Z, Li M. Entity disambiguation by knowledge and text jointly embedding. In: Conll 2016 20th Signll Conference on Computational Natural Language Learning Proceedings. 2016. p. 260–9.

Fang, W., et al. “Entity disambiguation by knowledge and text jointly embedding.” Conll 2016 20th Signll Conference on Computational Natural Language Learning Proceedings, 2016, pp. 260–69. Scopus, doi:10.18653/v1/k16-1026.

Fang W, Zhang J, Wang D, Chen Z, Li M. Entity disambiguation by knowledge and text jointly embedding. Conll 2016 20th Signll Conference on Computational Natural Language Learning Proceedings. 2016. p. 260–269.

Published In

Conll 2016 20th Signll Conference on Computational Natural Language Learning Proceedings

DOI

10.18653/v1/k16-1026

Publication Date

January 1, 2016

Start / End Page

260 / 269