Skip to main content
Journal cover image

Approximating the crowd

Publication ,  Journal Article
Ertekin, S; Rudin, C; Hirsh, H
Published in: Data Mining and Knowledge Discovery
January 1, 2014

The problem of "approximating the crowd" is that of estimating the crowd's majority opinion by querying only a subset of it. Algorithms that approximate the crowd can intelligently stretch a limited budget for a crowdsourcing task. We present an algorithm, "CrowdSense," that works in an online fashion where items come one at a time. CrowdSense dynamically samples subsets of the crowd based on an exploration/exploitation criterion. The algorithm produces a weighted combination of the subset's votes that approximates the crowd's opinion. We then introduce two variations of CrowdSense that make various distributional approximations to handle distinct crowd characteristics. In particular, the first algorithm makes a statistical independence approximation of the labelers for large crowds, whereas the second algorithm finds a lower bound on how often the current subcrowd agrees with the crowd's majority vote. Our experiments on CrowdSense and several baselines demonstrate that we can reliably approximate the entire crowd's vote by collecting opinions from a representative subset of the crowd. © 2014 The Author(s).

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Data Mining and Knowledge Discovery

DOI

ISSN

1384-5810

Publication Date

January 1, 2014

Volume

28

Issue

5-6

Start / End Page

1189 / 1221

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 46 Information and computing sciences
  • 0806 Information Systems
  • 0804 Data Format
  • 0801 Artificial Intelligence and Image Processing
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Ertekin, S., Rudin, C., & Hirsh, H. (2014). Approximating the crowd. Data Mining and Knowledge Discovery, 28(5–6), 1189–1221. https://doi.org/10.1007/s10618-014-0354-1
Ertekin, S., C. Rudin, and H. Hirsh. “Approximating the crowd.” Data Mining and Knowledge Discovery 28, no. 5–6 (January 1, 2014): 1189–1221. https://doi.org/10.1007/s10618-014-0354-1.
Ertekin S, Rudin C, Hirsh H. Approximating the crowd. Data Mining and Knowledge Discovery. 2014 Jan 1;28(5–6):1189–221.
Ertekin, S., et al. “Approximating the crowd.” Data Mining and Knowledge Discovery, vol. 28, no. 5–6, Jan. 2014, pp. 1189–221. Scopus, doi:10.1007/s10618-014-0354-1.
Ertekin S, Rudin C, Hirsh H. Approximating the crowd. Data Mining and Knowledge Discovery. 2014 Jan 1;28(5–6):1189–1221.
Journal cover image

Published In

Data Mining and Knowledge Discovery

DOI

ISSN

1384-5810

Publication Date

January 1, 2014

Volume

28

Issue

5-6

Start / End Page

1189 / 1221

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 46 Information and computing sciences
  • 0806 Information Systems
  • 0804 Data Format
  • 0801 Artificial Intelligence and Image Processing