Skip to main content

Improving grouped-entity resolution using Quasi-Cliques

Publication ,  Conference
On, BW; Elmacioglu, E; Lee, D; Kangt, J; Pei, J
Published in: Proceedings - IEEE International Conference on Data Mining, ICDM
January 1, 2006

The entity resolution (ER) problem, which identifies duplicate entities that refer to the same real world entity, is essential in many applications. In this paper, in particular, we focus on resolving entities that contain a group of related elements in them (e.g., an author entity with a list of citations, a singer entity with song list, or an intermediate result by GROUP BY SQL query). Such entities, named as grouped-entities, frequently occur in many applications. The previous approaches toward grouped-entity resolution often rely on textual similarity, and produce a large number of false positives. As a complementing technique, in this paper, we present our experience of applying a recently proposed graph mining technique, Quasi-Clique, atop conventional ER solutions. Our approach exploits contextual information mined from the group of elements per entity in addition to syntactic similarity. Extensive experiments verify that our proposal improves precision and recall up to 83% when used together with a variety of existing ER solutions, but never worsens them. © 2006 IEEE.

Duke Scholars

Published In

Proceedings - IEEE International Conference on Data Mining, ICDM

DOI

ISSN

1550-4786

Publication Date

January 1, 2006

Start / End Page

1008 / 1015
 

Citation

APA
Chicago
ICMJE
MLA
NLM
On, B. W., Elmacioglu, E., Lee, D., Kangt, J., & Pei, J. (2006). Improving grouped-entity resolution using Quasi-Cliques. In Proceedings - IEEE International Conference on Data Mining, ICDM (pp. 1008–1015). https://doi.org/10.1109/ICDM.2006.85
On, B. W., E. Elmacioglu, D. Lee, J. Kangt, and J. Pei. “Improving grouped-entity resolution using Quasi-Cliques.” In Proceedings - IEEE International Conference on Data Mining, ICDM, 1008–15, 2006. https://doi.org/10.1109/ICDM.2006.85.
On BW, Elmacioglu E, Lee D, Kangt J, Pei J. Improving grouped-entity resolution using Quasi-Cliques. In: Proceedings - IEEE International Conference on Data Mining, ICDM. 2006. p. 1008–15.
On, B. W., et al. “Improving grouped-entity resolution using Quasi-Cliques.” Proceedings - IEEE International Conference on Data Mining, ICDM, 2006, pp. 1008–15. Scopus, doi:10.1109/ICDM.2006.85.
On BW, Elmacioglu E, Lee D, Kangt J, Pei J. Improving grouped-entity resolution using Quasi-Cliques. Proceedings - IEEE International Conference on Data Mining, ICDM. 2006. p. 1008–1015.

Published In

Proceedings - IEEE International Conference on Data Mining, ICDM

DOI

ISSN

1550-4786

Publication Date

January 1, 2006

Start / End Page

1008 / 1015