Crowdsourcing is a computational paradigm whose distinctive feature is the involvement of human workers in key steps of the computation. It is used successfully to address problems that would be hard or impossible to solve for machines. As we highlight in this work, the exclusive use of nonexpert individuals may prove ineffective in some cases, especially when the task at hand or the need for accurate solutions demand some degree of specialization to avoid excessive uncertainty and inconsistency in the answers. We address this limitation by proposing an approach that combines the wisdom of the crowd with the educated opinion of experts. We present a computational model for crowdsourcing that envisions two classes of workers with different expertise levels. One of its distinctive features is the adoption of the threshold error model, whose roots are in psychometrics and which we extend from previous theoretical work. Our computational model allows to evaluate the performance of crowdsourcing algorithms with respect to accuracy and cost. We use our model to develop and analyze an algorithm for approximating the best, in a broad sense, of a set of elements. The algorithm uses naïve and expert workers to find an element that is a constantfactor approximation to the best. We prove upper and lower bounds on the number of comparisons needed to solve this problem, showing that our algorithm uses expert and naïve workers optimally up to a constant factor. Finally, we evaluate our algorithm on real and synthetic datasets using the CrowdFlower crowdsourcing platform, showing that our approach is also effective in practice.

The importance of being expert: Efficient max-finding in crowdsourcing / Anagnostopoulos, Aristidis; Becchetti, Luca; Fazzone, Adriano; Mele, Ida; Riondato, Matteo. - (2015), pp. 983-998. (Intervento presentato al convegno ACM SIGMOD International Conference on Management of Data, SIGMOD 2015 tenutosi a Melbourne; Australia) [10.1145/2723372.2723722].

The importance of being expert: Efficient max-finding in crowdsourcing

ANAGNOSTOPOULOS, ARISTIDIS
;
BECCHETTI, Luca;FAZZONE, ADRIANO;MELE, IDA;
2015

Abstract

Crowdsourcing is a computational paradigm whose distinctive feature is the involvement of human workers in key steps of the computation. It is used successfully to address problems that would be hard or impossible to solve for machines. As we highlight in this work, the exclusive use of nonexpert individuals may prove ineffective in some cases, especially when the task at hand or the need for accurate solutions demand some degree of specialization to avoid excessive uncertainty and inconsistency in the answers. We address this limitation by proposing an approach that combines the wisdom of the crowd with the educated opinion of experts. We present a computational model for crowdsourcing that envisions two classes of workers with different expertise levels. One of its distinctive features is the adoption of the threshold error model, whose roots are in psychometrics and which we extend from previous theoretical work. Our computational model allows to evaluate the performance of crowdsourcing algorithms with respect to accuracy and cost. We use our model to develop and analyze an algorithm for approximating the best, in a broad sense, of a set of elements. The algorithm uses naïve and expert workers to find an element that is a constantfactor approximation to the best. We prove upper and lower bounds on the number of comparisons needed to solve this problem, showing that our algorithm uses expert and naïve workers optimally up to a constant factor. Finally, we evaluate our algorithm on real and synthetic datasets using the CrowdFlower crowdsourcing platform, showing that our approach is also effective in practice.
2015
ACM SIGMOD International Conference on Management of Data, SIGMOD 2015
Crowdsourcing; human computation; max algorithm; worker models
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
The importance of being expert: Efficient max-finding in crowdsourcing / Anagnostopoulos, Aristidis; Becchetti, Luca; Fazzone, Adriano; Mele, Ida; Riondato, Matteo. - (2015), pp. 983-998. (Intervento presentato al convegno ACM SIGMOD International Conference on Management of Data, SIGMOD 2015 tenutosi a Melbourne; Australia) [10.1145/2723372.2723722].
File allegati a questo prodotto
File Dimensione Formato  
Anagnostopoulos_The-Importance_2015.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.62 MB
Formato Adobe PDF
1.62 MB Adobe PDF   Contatta l'autore
Anagnostopoulos_postprint_The-Importance_2015.pdf

accesso aperto

Note: https://dl.acm.org/doi/10.1145/2723372.2723722
Tipologia: Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.14 MB
Formato Adobe PDF
1.14 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/842138
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 13
  • ???jsp.display-item.citation.isi??? 11
social impact