Detecting communities in a network, based only on the adjacency matrix, is a problem of interest to several scientific disciplines. Recently, Zhang and Moore have introduced an algorithm [Proc. Natl. Acad. Sci. USA 111, 18144 (2014)], called mod-bp, that avoids overfitting the data by optimizing a weighted average of modularity (a popular goodness-of-fit measure in community detection) and entropy (i.e., number of configurations with a given modularity). The adjustment of the relative weight, the “temperature” of the model, is crucial for getting a correct result from mod-bp. In this work we study the many phase transitions that mod-bp may undergo by changing the two parameters of the algorithm: the temperature T and the maximum number of groups q. We introduce a new set of order parameters that allow us to determine the actual number of groups qˆ , and we observe on both synthetic and real networks the existence of phases with any qˆ ∈ {1,q}, which were unknown before. We discuss how to interpret the results of mod-bp and how to make the optimal choice for the problem of detecting significant communities.

Multiple phases in modularity-based community detection / Schülke, Christophe; RICCI TERSENGHI, Federico. - In: PHYSICAL REVIEW E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS. - ISSN 1539-3755. - STAMPA. - 92:4(2015), p. 042804. [10.1103/PhysRevE.92.042804]

Multiple phases in modularity-based community detection

RICCI TERSENGHI, Federico
2015

Abstract

Detecting communities in a network, based only on the adjacency matrix, is a problem of interest to several scientific disciplines. Recently, Zhang and Moore have introduced an algorithm [Proc. Natl. Acad. Sci. USA 111, 18144 (2014)], called mod-bp, that avoids overfitting the data by optimizing a weighted average of modularity (a popular goodness-of-fit measure in community detection) and entropy (i.e., number of configurations with a given modularity). The adjustment of the relative weight, the “temperature” of the model, is crucial for getting a correct result from mod-bp. In this work we study the many phase transitions that mod-bp may undergo by changing the two parameters of the algorithm: the temperature T and the maximum number of groups q. We introduce a new set of order parameters that allow us to determine the actual number of groups qˆ , and we observe on both synthetic and real networks the existence of phases with any qˆ ∈ {1,q}, which were unknown before. We discuss how to interpret the results of mod-bp and how to make the optimal choice for the problem of detecting significant communities.
2015
Condensed Matter Physics; Statistical and Nonlinear Physics; Statistics and Probability
01 Pubblicazione su rivista::01a Articolo in rivista
Multiple phases in modularity-based community detection / Schülke, Christophe; RICCI TERSENGHI, Federico. - In: PHYSICAL REVIEW E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS. - ISSN 1539-3755. - STAMPA. - 92:4(2015), p. 042804. [10.1103/PhysRevE.92.042804]
File allegati a questo prodotto
File Dimensione Formato  
Schulke_Multiple_2015.pdf

solo utenti autorizzati

Tipologia: Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 701.78 kB
Formato Adobe PDF
701.78 kB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/868046
Citazioni
  • ???jsp.display-item.citation.pmc??? 1
  • Scopus 11
  • ???jsp.display-item.citation.isi??? 11
social impact