Given a notion of pairwise similarity between objects, locality sensitive hashing (LSH) aims to construct a hash function family over the universe of objects such that the probability two objects hash to the same value is their similarity. LSH is a powerful algorithmic tool for large scale applications and much work has been done to understand LSHable similarities, i.e., similarities that admit an LSH. In this paper we focus on similarities that are provably non-LSHable and propose a notion of distortion to capture the approximation of such a similarity by an LSHable similarity. We consider several well-known non-LSHable similarities and show tight upper and lower bounds on their distortion.

On the distortion of locality sensitive hashing / Chierichetti, Flavio; Kumar, Ravi; Panconesi, Alessandro; Terolli, Erisa. - In: SIAM JOURNAL ON COMPUTING. - ISSN 0097-5397. - 48:2(2019), pp. 350-372. [10.1137/17M1127752]

On the distortion of locality sensitive hashing

Chierichetti, Flavio;Panconesi, Alessandro;Terolli, Erisa
2019

Abstract

Given a notion of pairwise similarity between objects, locality sensitive hashing (LSH) aims to construct a hash function family over the universe of objects such that the probability two objects hash to the same value is their similarity. LSH is a powerful algorithmic tool for large scale applications and much work has been done to understand LSHable similarities, i.e., similarities that admit an LSH. In this paper we focus on similarities that are provably non-LSHable and propose a notion of distortion to capture the approximation of such a similarity by an LSHable similarity. We consider several well-known non-LSHable similarities and show tight upper and lower bounds on their distortion.
2019
locality sensitive hashing; distortion; similarity
01 Pubblicazione su rivista::01a Articolo in rivista
On the distortion of locality sensitive hashing / Chierichetti, Flavio; Kumar, Ravi; Panconesi, Alessandro; Terolli, Erisa. - In: SIAM JOURNAL ON COMPUTING. - ISSN 0097-5397. - 48:2(2019), pp. 350-372. [10.1137/17M1127752]
File allegati a questo prodotto
File Dimensione Formato  
Chierichetti_distortion_2019.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 532.3 kB
Formato Adobe PDF
532.3 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1269940
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 2
social impact