Catalogo dei prodotti della ricerca

We design algorithms that, given a collection of documents and a distribution over user queries, return a small subset of the document collection in such a way that we can efficiently provide high-quality answers to user queries using only the selected subset. This approach has applications when space is a constraint or when the query-processing time increases significantly with the size of the collection. We study our algorithms through the lens of stochastic analysis and prove that even though they use only a small fraction of the entire collection, they can provide answers to most user queries, achieving a performance close to the optimal. To complement our theoretical findings, we experimentally show the versatility of our approach by considering two important cases in the context of Web search. In the first case, we favor the retrieval of documents that are relevant to the query, whereas in the second case we aim for document diversification. Both the theoretical and the experimental analysis provide strong evidence of the potential value of query covering in diverse application scenarios.

Stochastic Query Covering for Fast Approximate Document Retrieval / Anagnostopoulos, A., Becchetti, L., Ilaria, B., Leonardi, S., Ida, M., Piotr, S.. - In: ACM TRANSACTIONS ON INFORMATION SYSTEMS. - ISSN 1046-8188. - STAMPA. - 33:3(2015), pp. 1-35. [10.1145/2699671]

Stochastic Query Covering for Fast Approximate Document Retrieval

ANAGNOSTOPOULOS, ARISTIDIS;BECCHETTI, Luca;Ilaria Bordino;LEONARDI, Stefano;Ida Mele;Piotr Sankowski

2015

Abstract

We design algorithms that, given a collection of documents and a distribution over user queries, return a small subset of the document collection in such a way that we can efficiently provide high-quality answers to user queries using only the selected subset. This approach has applications when space is a constraint or when the query-processing time increases significantly with the size of the collection. We study our algorithms through the lens of stochastic analysis and prove that even though they use only a small fraction of the entire collection, they can provide answers to most user queries, achieving a performance close to the optimal. To complement our theoretical findings, we experimentally show the versatility of our approach by considering two important cases in the context of Web search. In the first case, we favor the retrieval of documents that are relevant to the query, whereas in the second case we aim for document diversification. Both the theoretical and the experimental analysis provide strong evidence of the potential value of query covering in diverse application scenarios.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2015
			
	Parole chiave
	
				Algorithms; query covering; stochastic analysis; caching
			
	Tipologia
	
				01 Pubblicazione su rivista::01a Articolo in rivista
			
	Citazione
	
				Stochastic Query Covering for Fast Approximate Document Retrieval / Anagnostopoulos, A., Becchetti, L., Ilaria, B., Leonardi, S., Ida, M., Piotr, S.. - In: ACM TRANSACTIONS ON INFORMATION SYSTEMS. - ISSN 1046-8188. - STAMPA. - 33:3(2015), pp. 1-35. [10.1145/2699671]
			
	Appartiene alla tipologia:
	
				01a Articolo in rivista

File allegati a questo prodotto

File	Dimensione	Formato
Anagnostopoulos_Stochastic-Query-Covering_Postprint_2015.pdf accesso aperto Note: DOI: http://dx.doi.org/10.1145/2699671 Tipologia: Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 363.41 kB Formato Adobe PDF	363.41 kB	Adobe PDF
Anagnostopoulos_Stochastic-Query-Covering_2015.pdf solo gestori archivio Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 711.77 kB Formato Adobe PDF Contatta l'autore	711.77 kB	Adobe PDF	Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/783743

Citazioni

ND

17

15

social impact