In this paper, we propose a new algorithm for the joint clustering of two sets of statistical units N and M which are also equipped with an adjacency structure which is represented by a bipartite network. Our model is based on the fuzzy Partition Around Medoids, and it combines it with techniques for community detection in bipartite complex networks based on Barber modularity maximization. The goal is to produce a partition of N∪M into clusters, each of which is also identified by two medoids, one in N and one in M, which represent the typical units in the cluster for each set. Such clusters are optimized so that units in the same cluster both have similar values on their attributes and are likely to be adjacent. We test the algorithm on both simulated and real data, to show how it is able to capture a wide range of different interactions between the distribution of the attributes and the network structure.
Fuzzy clustering with Barber modularity regularization / D'Urso, Pierpaolo; De Giovanni, Livia; Federico, Lorenzo; Vitale, Vincenzina. - In: STATISTICS AND COMPUTING. - ISSN 0960-3174. - 34:6(2024). [10.1007/s11222-024-10495-5]
Fuzzy clustering with Barber modularity regularization
D'Urso, Pierpaolo;Vitale, Vincenzina
2024
Abstract
In this paper, we propose a new algorithm for the joint clustering of two sets of statistical units N and M which are also equipped with an adjacency structure which is represented by a bipartite network. Our model is based on the fuzzy Partition Around Medoids, and it combines it with techniques for community detection in bipartite complex networks based on Barber modularity maximization. The goal is to produce a partition of N∪M into clusters, each of which is also identified by two medoids, one in N and one in M, which represent the typical units in the cluster for each set. Such clusters are optimized so that units in the same cluster both have similar values on their attributes and are likely to be adjacent. We test the algorithm on both simulated and real data, to show how it is able to capture a wide range of different interactions between the distribution of the attributes and the network structure.File | Dimensione | Formato | |
---|---|---|---|
s11222-024-10495-5.pdf
accesso aperto
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
1.29 MB
Formato
Adobe PDF
|
1.29 MB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.