The detection of patterns in multivariate time series is a relevant task, especially for large datasets. In this paper, four clustering models for multivariate time series are proposed, with the following characteristics. First, the Partitioning Around Medoids (PAM) framework is considered. Among the different approaches to the clustering of multivariate time series, the observation-based is adopted. To cope with the complexity of the features of each multivariate time series and the associated assignment uncertainty a fuzzy clustering approach is adopted. Finally, to neutralize the effect of possible outliers, a robust metric approach is used, i.e., the exponential transformation of dissimilarity measures. The proposed models are robust extensions of the Fuzzy C-Medoids clustering algorithm for multivariate time series. With respect to the management of the time behaviour, four variants are proposed: the Cross-Sectional Fuzzy C-Medoids clustering model with exponential transformation (CS-Exp-FCMd) classifies the multivariate time series taking into account their respective instantaneous features; the Longitudinal Fuzzy C-Medoids clustering model with exponential transformation (L-Exp-FCMd) takes into account the evolutive (longitudinal) features; the Mixed Fuzzy C-Medoids clustering model with exponential transformation (M-Exp-FCMd) which consider simultaneously both the instantaneous and the longitudinal features in the clustering process; the Dynamic Time Warping-based Fuzzy C-Medoids model with exponential transformation (DTW-Exp-FCMd) uses the Dynamic Time Warping (DTW) distance. Three simulation studies show the clustering performance of the proposed models in presence of outliers, compared to their non-robust counterparts, and to other models proposed in the literature. An application on real-world data on the concentration of three pollutants in nineteen stations in the Metropolitan City of Rome shows the relevance of the robustness to outliers in the identification of the clusters.

Robust fuzzy clustering of multivariate time trajectories / D'Urso, Pierpaolo; De Giovanni, Livia; Massari, Riccardo. - In: INTERNATIONAL JOURNAL OF APPROXIMATE REASONING. - ISSN 0888-613X. - STAMPA. - 99:(2018), pp. 12-38. [10.1016/j.ijar.2018.05.002]

Robust fuzzy clustering of multivariate time trajectories

D'Urso, Pierpaolo;De Giovanni, Livia;Massari, Riccardo
2018

Abstract

The detection of patterns in multivariate time series is a relevant task, especially for large datasets. In this paper, four clustering models for multivariate time series are proposed, with the following characteristics. First, the Partitioning Around Medoids (PAM) framework is considered. Among the different approaches to the clustering of multivariate time series, the observation-based is adopted. To cope with the complexity of the features of each multivariate time series and the associated assignment uncertainty a fuzzy clustering approach is adopted. Finally, to neutralize the effect of possible outliers, a robust metric approach is used, i.e., the exponential transformation of dissimilarity measures. The proposed models are robust extensions of the Fuzzy C-Medoids clustering algorithm for multivariate time series. With respect to the management of the time behaviour, four variants are proposed: the Cross-Sectional Fuzzy C-Medoids clustering model with exponential transformation (CS-Exp-FCMd) classifies the multivariate time series taking into account their respective instantaneous features; the Longitudinal Fuzzy C-Medoids clustering model with exponential transformation (L-Exp-FCMd) takes into account the evolutive (longitudinal) features; the Mixed Fuzzy C-Medoids clustering model with exponential transformation (M-Exp-FCMd) which consider simultaneously both the instantaneous and the longitudinal features in the clustering process; the Dynamic Time Warping-based Fuzzy C-Medoids model with exponential transformation (DTW-Exp-FCMd) uses the Dynamic Time Warping (DTW) distance. Three simulation studies show the clustering performance of the proposed models in presence of outliers, compared to their non-robust counterparts, and to other models proposed in the literature. An application on real-world data on the concentration of three pollutants in nineteen stations in the Metropolitan City of Rome shows the relevance of the robustness to outliers in the identification of the clusters.
2018
Cross-sectional and longitudinal clustering; Dynamic time warping; Exponential distance; Outlier time trajectory; Partitioning around medoids; Robust fuzzy clustering; Software; Theoretical Computer Science; Artificial Intelligence; Applied Mathematics
01 Pubblicazione su rivista::01a Articolo in rivista
Robust fuzzy clustering of multivariate time trajectories / D'Urso, Pierpaolo; De Giovanni, Livia; Massari, Riccardo. - In: INTERNATIONAL JOURNAL OF APPROXIMATE REASONING. - ISSN 0888-613X. - STAMPA. - 99:(2018), pp. 12-38. [10.1016/j.ijar.2018.05.002]
File allegati a questo prodotto
File Dimensione Formato  
2018IJA.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 4.65 MB
Formato Adobe PDF
4.65 MB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1135174
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 47
  • ???jsp.display-item.citation.isi??? 38
social impact