The greedy approach to dictionary-based static text compression can be executed by a finite state machine. When it is applied in parallel to different blocks of data independently, there is no lack of robustness even on standard large scale distributed systems with input files of arbitrary size. Beyond standard large scale, a negative effect on the compression effectiveness is caused by the very small size of the data blocks. A robust approach for extreme distributed systems is presented in this paper, where this problem is fixed by overlapping adjacent blocks and preprocessing the neighborhoods of the boundaries. Moreover, we introduce the notion of pseudo-prefix dictionary, which allows optimal compression by means of a real-time semi-greedy procedure and a slight improvement on the compression ratio obtained by the distributed implementations.

Real-time and distributed applications for dictionary-based data compression / DE AGOSTINO, Sergio. - In: INTERNATIONAL JOURNAL ON ADVANCES IN SOFTWARE. - ISSN 1942-2628. - ELETTRONICO. - 8:1 & 2(2015), pp. 75-84.

Real-time and distributed applications for dictionary-based data compression

DE AGOSTINO, Sergio
2015

Abstract

The greedy approach to dictionary-based static text compression can be executed by a finite state machine. When it is applied in parallel to different blocks of data independently, there is no lack of robustness even on standard large scale distributed systems with input files of arbitrary size. Beyond standard large scale, a negative effect on the compression effectiveness is caused by the very small size of the data blocks. A robust approach for extreme distributed systems is presented in this paper, where this problem is fixed by overlapping adjacent blocks and preprocessing the neighborhoods of the boundaries. Moreover, we introduce the notion of pseudo-prefix dictionary, which allows optimal compression by means of a real-time semi-greedy procedure and a slight improvement on the compression ratio obtained by the distributed implementations.
2015
data compression, decoding, real-time application, distributed system, scalability, robustness
01 Pubblicazione su rivista::01a Articolo in rivista
Real-time and distributed applications for dictionary-based data compression / DE AGOSTINO, Sergio. - In: INTERNATIONAL JOURNAL ON ADVANCES IN SOFTWARE. - ISSN 1942-2628. - ELETTRONICO. - 8:1 & 2(2015), pp. 75-84.
File allegati a questo prodotto
File Dimensione Formato  
DeAgostino_Real_2015.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 175.7 kB
Formato Adobe PDF
175.7 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/787086
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact