Motivated by structural properties of the Web graph that support efficient data structures for in memory adjacency queries, we study the extent to which a large network can be compressed. Boldi and Vigna (WWW 2004), showed that Web graphs can be compressed down to three bits of storage per edge; we study the compressibility of social networks where again adjacency queries are a fundamental primitive. To this end, we propose simple combinatorial formulations that encapsulate efficient compressibility of graphs. We show that some of the problems are NP-hard yet admit effective heuristics, some of which can exploit properties of social networks such as link reciprocity. Our extensive experiments show that social networks and the Web graph exhibit vastly different compressibility characteristics. Copyright 2009 ACM.
On Compressing Social Networks / CHIERICHETTI, FLAVIO; RAVI, KUMAR; SILVIO, LATTANZI; MICHAEL, MITZENMACHER; PANCONESI, Alessandro; PRABHAKAR, RAGHAVAN. - (2009), pp. 219-227. (Intervento presentato al convegno ACM SIGKDD Conference On Knowledge Discovery and Data Mining (KDD 09) tenutosi a Paris; France nel 28 Giugno - 1 Luglio 2009) [10.1145/1557019.1557049].
On Compressing Social Networks
CHIERICHETTI, FLAVIO;PANCONESI, Alessandro;
2009
Abstract
Motivated by structural properties of the Web graph that support efficient data structures for in memory adjacency queries, we study the extent to which a large network can be compressed. Boldi and Vigna (WWW 2004), showed that Web graphs can be compressed down to three bits of storage per edge; we study the compressibility of social networks where again adjacency queries are a fundamental primitive. To this end, we propose simple combinatorial formulations that encapsulate efficient compressibility of graphs. We show that some of the problems are NP-hard yet admit effective heuristics, some of which can exploit properties of social networks such as link reciprocity. Our extensive experiments show that social networks and the Web graph exhibit vastly different compressibility characteristics. Copyright 2009 ACM.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.