Encoding lists of integers efficiently is important for many applications in different fields. Adjacency lists of large graphs are usually encoded to save space and to improve decoding speed. Inverted indexes of Information Retrieval systems keep the lists of postings compressed in order to exploit the memory hierarchy. Secondary indexes of DBMSs are stored similarly to inverted indexes in IR systems. In this paper we propose Vector of Splits Encoding (VSEncoding), a novel class of encoders that work by optimally partitioning a list of integers into blocks which are efficiently compressed by using simple encoders. In previous works heuristics were applied during the partitioning step. Instead, we find the optimal solution by using a dynamic programming approach. Experiments show that our class of encoders outperform all the existing methods in literature by more than 10% (with the exception of Binary Interpolative Coding with which they, roughly, tie) still retaining a very fast decompression algorithm. © 2010 ACM.

VSEncoding: Efficient coding and fast decoding of integer lists via dynamic programming / Silvestri, F.; Venturini, R.. - (2010), pp. 1219-1228. (Intervento presentato al convegno 19th International Conference on Information and Knowledge Management and Co-located Workshops, CIKM'10 tenutosi a Toronto, ON, can) [10.1145/1871437.1871592].

VSEncoding: Efficient coding and fast decoding of integer lists via dynamic programming

Silvestri F.;
2010

Abstract

Encoding lists of integers efficiently is important for many applications in different fields. Adjacency lists of large graphs are usually encoded to save space and to improve decoding speed. Inverted indexes of Information Retrieval systems keep the lists of postings compressed in order to exploit the memory hierarchy. Secondary indexes of DBMSs are stored similarly to inverted indexes in IR systems. In this paper we propose Vector of Splits Encoding (VSEncoding), a novel class of encoders that work by optimally partitioning a list of integers into blocks which are efficiently compressed by using simple encoders. In previous works heuristics were applied during the partitioning step. Instead, we find the optimal solution by using a dynamic programming approach. Experiments show that our class of encoders outperform all the existing methods in literature by more than 10% (with the exception of Binary Interpolative Coding with which they, roughly, tie) still retaining a very fast decompression algorithm. © 2010 ACM.
2010
19th International Conference on Information and Knowledge Management and Co-located Workshops, CIKM'10
Adaptive encoding; D-gap encoding; Index compression; Inverted index encoding
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
VSEncoding: Efficient coding and fast decoding of integer lists via dynamic programming / Silvestri, F.; Venturini, R.. - (2010), pp. 1219-1228. (Intervento presentato al convegno 19th International Conference on Information and Knowledge Management and Co-located Workshops, CIKM'10 tenutosi a Toronto, ON, can) [10.1145/1871437.1871592].
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1572863
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 55
  • ???jsp.display-item.citation.isi??? ND
social impact