In this paper, two fuzzy clustering methods for spatial interval-valued data are proposed, i.e. the fuzzy C-Medoids clustering of spatial interval-valued data with and without entropy regularization. Both methods are based on the Partitioning Around Medoids (PAM) algorithm, inheriting the great advantage of obtaining non-fictitious representative units for each cluster. In both methods, the units are endowed with a relation of contiguity, represented by a symmetric binary matrix. This can be intended both as contiguity in a physical space and as a more abstract notion of contiguity. The performances of the methods are proved by simulation, testing the methods with different contiguity matrices associated to natural clusters of units. In order to show the effectiveness of the methods in empirical studies, three applications are presented: the clustering of municipalities based on interval-valued pollutants levels, the clustering of European fact-checkers based on interval-valued data on the average number of impressions received by their tweets and the clustering of the residential zones of the city of Rome based on the interval of price values.

Fuzzy clustering of spatial interval-valued data / D'Urso, P.; De Giovanni, L.; Federico, L.; Vitale, V.. - In: SPATIAL STATISTICS. - ISSN 2211-6753. - 57:(2023), p. 100764. [10.1016/j.spasta.2023.100764]

Fuzzy clustering of spatial interval-valued data

D'Urso P.
;
Vitale V.
2023

Abstract

In this paper, two fuzzy clustering methods for spatial interval-valued data are proposed, i.e. the fuzzy C-Medoids clustering of spatial interval-valued data with and without entropy regularization. Both methods are based on the Partitioning Around Medoids (PAM) algorithm, inheriting the great advantage of obtaining non-fictitious representative units for each cluster. In both methods, the units are endowed with a relation of contiguity, represented by a symmetric binary matrix. This can be intended both as contiguity in a physical space and as a more abstract notion of contiguity. The performances of the methods are proved by simulation, testing the methods with different contiguity matrices associated to natural clusters of units. In order to show the effectiveness of the methods in empirical studies, three applications are presented: the clustering of municipalities based on interval-valued pollutants levels, the clustering of European fact-checkers based on interval-valued data on the average number of impressions received by their tweets and the clustering of the residential zones of the city of Rome based on the interval of price values.
2023
spatial imprecise data; fuzzy clustering; partitioning around medoids; entropy; environmental data; Networks
01 Pubblicazione su rivista::01a Articolo in rivista
Fuzzy clustering of spatial interval-valued data / D'Urso, P.; De Giovanni, L.; Federico, L.; Vitale, V.. - In: SPATIAL STATISTICS. - ISSN 2211-6753. - 57:(2023), p. 100764. [10.1016/j.spasta.2023.100764]
File allegati a questo prodotto
File Dimensione Formato  
1-s2.0-S2211675323000398-main.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 2.71 MB
Formato Adobe PDF
2.71 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1689675
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 6
  • ???jsp.display-item.citation.isi??? 4
social impact