Estimating the Performance of Entity Resolution Algorithms: Lessons Learned Through PatentsView.org
Publication
, Journal Article
Binette, O; York, SA; Hickerson, E; Baek, Y; Madhavan, S; Jones, C
Published in: American Statistician
January 1, 2023
This article introduces a novel evaluation methodology for entity resolution algorithms. It is motivated by PatentsView.org, a public-use patent data exploration platform that disambiguates patent inventors using an entity resolution algorithm. We provide a data collection methodology and tailored performance estimators that account for sampling biases. Our approach is simple, practical, and principled—key characteristics that allow us to paint the first representative picture of PatentsView’s disambiguation performance. The results are used to inform PatentsView’s users of the reliability of the data and to allow the comparison of competing disambiguation algorithms.
Duke Scholars
Altmetric Attention Stats
Dimensions Citation Stats
Published In
American Statistician
DOI
EISSN
1537-2731
ISSN
0003-1305
Publication Date
January 1, 2023
Volume
77
Issue
4
Start / End Page
370 / 380
Related Subject Headings
- Statistics & Probability
- 4905 Statistics
- 0104 Statistics
Citation
APA
Chicago
ICMJE
MLA
NLM
Binette, O., York, S. A., Hickerson, E., Baek, Y., Madhavan, S., & Jones, C. (2023). Estimating the Performance of Entity Resolution Algorithms: Lessons Learned Through PatentsView.org. American Statistician, 77(4), 370–380. https://doi.org/10.1080/00031305.2023.2191664
Binette, O., S. A. York, E. Hickerson, Y. Baek, S. Madhavan, and C. Jones. “Estimating the Performance of Entity Resolution Algorithms: Lessons Learned Through PatentsView.org.” American Statistician 77, no. 4 (January 1, 2023): 370–80. https://doi.org/10.1080/00031305.2023.2191664.
Binette O, York SA, Hickerson E, Baek Y, Madhavan S, Jones C. Estimating the Performance of Entity Resolution Algorithms: Lessons Learned Through PatentsView.org. American Statistician. 2023 Jan 1;77(4):370–80.
Binette, O., et al. “Estimating the Performance of Entity Resolution Algorithms: Lessons Learned Through PatentsView.org.” American Statistician, vol. 77, no. 4, Jan. 2023, pp. 370–80. Scopus, doi:10.1080/00031305.2023.2191664.
Binette O, York SA, Hickerson E, Baek Y, Madhavan S, Jones C. Estimating the Performance of Entity Resolution Algorithms: Lessons Learned Through PatentsView.org. American Statistician. 2023 Jan 1;77(4):370–380.
Published In
American Statistician
DOI
EISSN
1537-2731
ISSN
0003-1305
Publication Date
January 1, 2023
Volume
77
Issue
4
Start / End Page
370 / 380
Related Subject Headings
- Statistics & Probability
- 4905 Statistics
- 0104 Statistics