Clustering is a fundamental step in many information-retrieval and data-mining applications. Detecting clusters in graphs is also a key tool for finding the community structure in social and behavioral networks. In many of these applications, the input graph evolves over time in a continual and decentralized manner, and, to maintain a good clustering, the clustering algorithm needs to repeatedly probe the graph. Furthermore, there are often limitations on the frequency of such probes, either imposed explicitly by the online platform (e.g., in the case of crawling proprietary social networks like twitter) or implicitly because of resource limitations (e.g., in the case of crawling the web). In this paper, we study a model of clustering on evolving graphs that captures this aspect of the problem. Our model is based on the classical stochastic block model, which has been used to assess rigorously the quality of various static clustering methods. In our model, the algorithm is supposed to reconstruct the planted clustering, given the ability to query for small pieces of local information about the graph, at a limited rate. We design and analyze clustering algorithms that work in this model, and show asymptotically tight upper and lower bounds on their accuracy. Finally, we perform simulations, which demonstrate that our main asymptotic results hold true also in practice.

Community Detection on Evolving Graphs / Anagnostopoulos, Aristidis; Lacki, Jakub; Lattanzi, Silvio; Leonardi, Stefano; Mahdian, Mohammad. - 29:(2016), pp. 3522-3530. (Intervento presentato al convegno 30th Annual Conference on Neural Information Processing Systems 2016 tenutosi a Barcelona; Spain).

Community Detection on Evolving Graphs

ANAGNOSTOPOULOS, ARISTIDIS
;
LEONARDI, Stefano;
2016

Abstract

Clustering is a fundamental step in many information-retrieval and data-mining applications. Detecting clusters in graphs is also a key tool for finding the community structure in social and behavioral networks. In many of these applications, the input graph evolves over time in a continual and decentralized manner, and, to maintain a good clustering, the clustering algorithm needs to repeatedly probe the graph. Furthermore, there are often limitations on the frequency of such probes, either imposed explicitly by the online platform (e.g., in the case of crawling proprietary social networks like twitter) or implicitly because of resource limitations (e.g., in the case of crawling the web). In this paper, we study a model of clustering on evolving graphs that captures this aspect of the problem. Our model is based on the classical stochastic block model, which has been used to assess rigorously the quality of various static clustering methods. In our model, the algorithm is supposed to reconstruct the planted clustering, given the ability to query for small pieces of local information about the graph, at a limited rate. We design and analyze clustering algorithms that work in this model, and show asymptotically tight upper and lower bounds on their accuracy. Finally, we perform simulations, which demonstrate that our main asymptotic results hold true also in practice.
2016
30th Annual Conference on Neural Information Processing Systems 2016
Stochastic Blockmodels; Stochastic systems; Models; stochastic block
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Community Detection on Evolving Graphs / Anagnostopoulos, Aristidis; Lacki, Jakub; Lattanzi, Silvio; Leonardi, Stefano; Mahdian, Mohammad. - 29:(2016), pp. 3522-3530. (Intervento presentato al convegno 30th Annual Conference on Neural Information Processing Systems 2016 tenutosi a Barcelona; Spain).
File allegati a questo prodotto
File Dimensione Formato  
Anagnostopoulos_Community-detection_2016.pdf

accesso aperto

Note: https://papers.nips.cc/paper/6173-community-detection-on-evolving-graphs.pdf
Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 261.94 kB
Formato Adobe PDF
261.94 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/936170
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 14
  • ???jsp.display-item.citation.isi??? 0
social impact