Load-sensitive selective pruning for distributed search

Broccolo, D.; Macdonald, C.; Orlando, S.; Ounis, I.; Perego, R.; Silvestri, Fabrizio; Tonellotto, N.

doi:10.1145/2505515.2505699

A search engine infrastructure must be able to provide the same quality of service to all queries received during a day. During normal operating conditions, the demand for resources is considerably lower than under peak conditions, yet an oversized infrastructure would result in an unnecessary waste of computing power. A possible solution adopted in this situation might consist of defining a maximum threshold processing time for each query, and dropping queries for which this threshold elapses, leading to disappointed users. In this paper, we propose and evaluate a different approach, where, given a set of different query processing strategies with differing efficiency, each query is considered by a framework that sets a maximum query processing time and selects which processing strategy is the best for that query, such that the processing time for all queries is kept below the threshold. The processing time estimates used by the scheduler are learned from past queries. We experimentally validate our approach on 10, 000 queries from a standard TREC dataset with over 50 million documents, and we compare it with several baselines. These experiments encompass testing the system under different query loads and different maximum tolerated query response times. Our results show that, at the cost of a marginal loss in terms of response quality, our search system is able to answer 90% of queries within half a second during times of high query volume. Copyright 2013 ACM.

Load-sensitive selective pruning for distributed search / Broccolo, D.; Macdonald, C.; Orlando, S.; Ounis, I.; Perego, R.; Silvestri, Fabrizio; Tonellotto, N.. - (2013), pp. 379-388. (Intervento presentato al convegno 22nd ACM International Conference on Information and Knowledge Management, CIKM 2013 tenutosi a San Francisco, CA, usa) [10.1145/2505515.2505699].

Load-sensitive selective pruning for distributed search

Broccolo D.;Macdonald C.;Orlando S.;Ounis I.;Perego R.;SILVESTRI, FABRIZIO;Tonellotto N.

2013

Abstract

A search engine infrastructure must be able to provide the same quality of service to all queries received during a day. During normal operating conditions, the demand for resources is considerably lower than under peak conditions, yet an oversized infrastructure would result in an unnecessary waste of computing power. A possible solution adopted in this situation might consist of defining a maximum threshold processing time for each query, and dropping queries for which this threshold elapses, leading to disappointed users. In this paper, we propose and evaluate a different approach, where, given a set of different query processing strategies with differing efficiency, each query is considered by a framework that sets a maximum query processing time and selects which processing strategy is the best for that query, such that the processing time for all queries is kept below the threshold. The processing time estimates used by the scheduler are learned from past queries. We experimentally validate our approach on 10, 000 queries from a standard TREC dataset with over 50 million documents, and we compare it with several baselines. These experiments encompass testing the system under different query loads and different maximum tolerated query response times. Our results show that, at the cost of a marginal loss in terms of response quality, our search system is able to answer 90% of queries within half a second during times of high query volume. Copyright 2013 ACM.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2013
			
	Nome convegno
	
				22nd ACM International Conference on Information and Knowledge Management, CIKM 2013
			
	Parole chiave
	
				Query efficiency prediction; Scheduling
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				Load-sensitive selective pruning for distributed search / Broccolo, D.; Macdonald, C.; Orlando, S.; Ounis, I.; Perego, R.; Silvestri, Fabrizio; Tonellotto, N.. - (2013), pp. 379-388. (Intervento presentato al  convegno 22nd ACM International Conference on Information and Knowledge Management, CIKM 2013 tenutosi a San Francisco, CA, usa) [10.1145/2505515.2505699].
			
	Appartiene alla tipologia:
	
				04b Atto di convegno in volume

File allegati a questo prodotto

File	Dimensione	Formato
VE_2013_11573-1476535.pdf solo gestori archivio Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 1.08 MB Formato Adobe PDF Contatta l'autore	1.08 MB	Adobe PDF	Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1476535

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

15

10

Catalogo dei prodotti della ricerca