It is widely known that spectral techniques are very effective for document retrieval. Recently, a lot of effort has been spent by researchers to provide a formal mathematical explanation for this effectiveness [3]. Latent Semantic Indexing, in particular, is a text retrieval algorithm based on the spectral analysis of the occurrences of terms in text documents. Despite of its value in improving the quality of a text search, LSI has the drawback of an elevate response time, which makes it unsuitable for on-line search in large collections of documents (e.g., web search engines). In this paper we present two approaches aimed to combine the effectiveness of latent semantic analysis with the efficiency of text matching retrieval, through the technique of query expansion. We show that both approaches have relatively small computational cost and we provide experimental evidence of their ability to improve document retrieval.

Fast LSI-based techniques for query expansion in text retrieval systems / Laura, L; Nanni, Umberto; Sarracco, F.. - STAMPA. - (2005), pp. 15-28. (Intervento presentato al convegno 28th German Conf. on Art. Intell. - 2nd Int. Workshop on Text-based Information Retrieval (TIR-05) tenutosi a Koblenz; Germany nel September 11-14, 2005).

Fast LSI-based techniques for query expansion in text retrieval systems.

NANNI, Umberto;
2005

Abstract

It is widely known that spectral techniques are very effective for document retrieval. Recently, a lot of effort has been spent by researchers to provide a formal mathematical explanation for this effectiveness [3]. Latent Semantic Indexing, in particular, is a text retrieval algorithm based on the spectral analysis of the occurrences of terms in text documents. Despite of its value in improving the quality of a text search, LSI has the drawback of an elevate response time, which makes it unsuitable for on-line search in large collections of documents (e.g., web search engines). In this paper we present two approaches aimed to combine the effectiveness of latent semantic analysis with the efficiency of text matching retrieval, through the technique of query expansion. We show that both approaches have relatively small computational cost and we provide experimental evidence of their ability to improve document retrieval.
2005
28th German Conf. on Art. Intell. - 2nd Int. Workshop on Text-based Information Retrieval (TIR-05)
Computational costs; Document Retrieval; Experimental evidence
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Fast LSI-based techniques for query expansion in text retrieval systems / Laura, L; Nanni, Umberto; Sarracco, F.. - STAMPA. - (2005), pp. 15-28. (Intervento presentato al convegno 28th German Conf. on Art. Intell. - 2nd Int. Workshop on Text-based Information Retrieval (TIR-05) tenutosi a Koblenz; Germany nel September 11-14, 2005).
File allegati a questo prodotto
File Dimensione Formato  
VE_2005_11573-204969.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 15.46 MB
Formato Adobe PDF
15.46 MB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/204969
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact