Preferences on a Budget: Prioritizing Document Pairs when Crowdsourcing Relevance Judgments

Roitero, Kevin; Checco, Alessandro; Mizzaro, Stefano; Demartini, Gianluca

doi:10.1145/3485447.3511960

In Information Retrieval (IR) evaluation, preference judgments are collected by presenting to the assessors a pair of documents and asking them to select which of the two, if any, is the most relevant. This is an alternative to the classic relevance judgment approach, in which human assessors judge the relevance of a single document on a scale; such an alternative allows to make relative rather than absolute judgments of relevance. While preference judgments are easier for human assessors to perform, the number of possible document pairs to be judged is usually so high that it makes it unfeasible to judge them all. Thus, following a similar idea to pooling strategies for single document relevance judgments where the goal is to sample the most useful documents to be judged, in this work we focus on analyzing alternative ways to sample document pairs to judge, in order to maximize the value of a fixed number of preference judgments that can feasibly be collected. Such value is defined as how well we can evaluate IR systems given a budget, that is, a fixed number of human preference judgments that may be collected. By relying on several datasets featuring relevance judgments gathered by means of experts and crowdsourcing, we experimentally compare alternative strategies to select document pairs and show how different strategies lead to different IR evaluation result quality levels. Our results show that, by using the appropriate procedure, it is possible to achieve good IR evaluation results with a limited number of preference judgments, thus confirming the feasibility of using preference judgments to create IR evaluation collections.

Preferences on a Budget: Prioritizing Document Pairs when Crowdsourcing Relevance Judgments / Roitero, Kevin; Checco, Alessandro; Mizzaro, Stefano; Demartini, Gianluca. - (2022), pp. 319-327. (Intervento presentato al convegno 31st ACM World Wide Web Conference, WWW 2022 tenutosi a Lyon; France) [10.1145/3485447.3511960].

Preferences on a Budget: Prioritizing Document Pairs when Crowdsourcing Relevance Judgments

Kevin Roitero;Alessandro Checco;Stefano Mizzaro;Gianluca Demartini

2022

Abstract

In Information Retrieval (IR) evaluation, preference judgments are collected by presenting to the assessors a pair of documents and asking them to select which of the two, if any, is the most relevant. This is an alternative to the classic relevance judgment approach, in which human assessors judge the relevance of a single document on a scale; such an alternative allows to make relative rather than absolute judgments of relevance. While preference judgments are easier for human assessors to perform, the number of possible document pairs to be judged is usually so high that it makes it unfeasible to judge them all. Thus, following a similar idea to pooling strategies for single document relevance judgments where the goal is to sample the most useful documents to be judged, in this work we focus on analyzing alternative ways to sample document pairs to judge, in order to maximize the value of a fixed number of preference judgments that can feasibly be collected. Such value is defined as how well we can evaluate IR systems given a budget, that is, a fixed number of human preference judgments that may be collected. By relying on several datasets featuring relevance judgments gathered by means of experts and crowdsourcing, we experimentally compare alternative strategies to select document pairs and show how different strategies lead to different IR evaluation result quality levels. Our results show that, by using the appropriate procedure, it is possible to achieve good IR evaluation results with a limited number of preference judgments, thus confirming the feasibility of using preference judgments to create IR evaluation collections.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2022
			
	Nome convegno
	
				31st ACM World Wide Web Conference, WWW 2022
			
	Parole chiave
	
				crowdsourcing; relevance judgment; information retrieval
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				Preferences on a Budget: Prioritizing Document Pairs when Crowdsourcing Relevance Judgments / Roitero, Kevin; Checco, Alessandro; Mizzaro, Stefano; Demartini, Gianluca. - (2022), pp. 319-327. (Intervento presentato al  convegno 31st ACM World Wide Web Conference, WWW 2022 tenutosi a Lyon; France) [10.1145/3485447.3511960].

File allegati a questo prodotto

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1696296

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

6

ND

Catalogo dei prodotti della ricerca