Skip to main content

Entity disambiguation by knowledge and text jointly embedding

Publication ,  Conference
Fang, W; Zhang, J; Wang, D; Chen, Z; Li, M
Published in: CoNLL 2016 - 20th SIGNLL Conference on Computational Natural Language Learning, Proceedings
January 1, 2016

For most entity disambiguation systems, the secret recipes are feature representations for mentions and entities, most of which are based on Bag-of-Words (BoW) representations. Commonly, BoW has several drawbacks: (1) It ignores the intrinsic meaning of words/entities; (2) It often results in high-dimension vector spaces and expensive computation; (3) For different applications, methods of designing handcrafted representations may be quite different, lacking of a general guideline. In this paper, we propose a different approach named EDKate. We first learn low-dimensional continuous vector representations for entities and words by jointly embedding knowledge base and text in the same vector space. Then we utilize these embeddings to design simple but effective features and build a two-layer disambiguation model. Extensive experiments on real-world data sets show that (1) The embedding-based features are very effective. Even a single one embedding-based feature can beat the combination of several BoW-based features. (2) The superiority is even more promising in a difficult set where the mention-entity prior cannot work well. (3) The proposed embedding method is much better than trivial implementations of some off-the-shelf embedding algorithms. (4) We compared our EDKate with existing methods/systems and the results are also positive.

Duke Scholars

Published In

CoNLL 2016 - 20th SIGNLL Conference on Computational Natural Language Learning, Proceedings

DOI

Publication Date

January 1, 2016

Start / End Page

260 / 269
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Fang, W., Zhang, J., Wang, D., Chen, Z., & Li, M. (2016). Entity disambiguation by knowledge and text jointly embedding. In CoNLL 2016 - 20th SIGNLL Conference on Computational Natural Language Learning, Proceedings (pp. 260–269). https://doi.org/10.18653/v1/k16-1026
Fang, W., J. Zhang, D. Wang, Z. Chen, and M. Li. “Entity disambiguation by knowledge and text jointly embedding.” In CoNLL 2016 - 20th SIGNLL Conference on Computational Natural Language Learning, Proceedings, 260–69, 2016. https://doi.org/10.18653/v1/k16-1026.
Fang W, Zhang J, Wang D, Chen Z, Li M. Entity disambiguation by knowledge and text jointly embedding. In: CoNLL 2016 - 20th SIGNLL Conference on Computational Natural Language Learning, Proceedings. 2016. p. 260–9.
Fang, W., et al. “Entity disambiguation by knowledge and text jointly embedding.” CoNLL 2016 - 20th SIGNLL Conference on Computational Natural Language Learning, Proceedings, 2016, pp. 260–69. Scopus, doi:10.18653/v1/k16-1026.
Fang W, Zhang J, Wang D, Chen Z, Li M. Entity disambiguation by knowledge and text jointly embedding. CoNLL 2016 - 20th SIGNLL Conference on Computational Natural Language Learning, Proceedings. 2016. p. 260–269.

Published In

CoNLL 2016 - 20th SIGNLL Conference on Computational Natural Language Learning, Proceedings

DOI

Publication Date

January 1, 2016

Start / End Page

260 / 269