Robust fuzzy clustering of multivariate time series is addressed when the clustering purpose is grouping together series generated from similar stochastic processes. Robustness to the presence of anomalous series is attained by considering three well-known robust versions of a fuzzy C-means model based on a spectral dissimilarity measure with high discriminatory power. The dissimilarity measure compares principal component scores obtained from estimates of quantile cross-spectral densities, and the robust techniques follow the so-called metric, noise and trimmed approaches. The metric approach incorporates in the objective function a distance aimed at neutralizing the effect of the outliers, the noise approach builds an artificial cluster expected to contain the outlying series, and the trimmed approach removes the most atypical series in the dataset. As result, the proposed clustering methods take advantage of both the robust nature of these techniques and the capability of the quantile cross-spectral density to identify complex dependence structures. An extensive simulation study including multivariate linear, nonlinear and GARCH processes shows that the algorithms are substantially effective in coping with the presence of outlying series, clearly outperforming other alternative procedures. Two specific applications regarding financial and environmental series illustrate the usefulness of the presented methods. (c) 2022 The Author(s). Published by Elsevier Inc.
Quantile-based fuzzy C-means clustering of multivariate time series: Robust techniques / Lopez-Oriona, A; D'Urso, P; Vilar, Ja; Lafuente-Rego, B. - In: INTERNATIONAL JOURNAL OF APPROXIMATE REASONING. - ISSN 0888-613X. - 150:(2022), pp. 55-82. [10.1016/j.ijar.2022.07.010]
Quantile-based fuzzy C-means clustering of multivariate time series: Robust techniques
D'Urso, P;
2022
Abstract
Robust fuzzy clustering of multivariate time series is addressed when the clustering purpose is grouping together series generated from similar stochastic processes. Robustness to the presence of anomalous series is attained by considering three well-known robust versions of a fuzzy C-means model based on a spectral dissimilarity measure with high discriminatory power. The dissimilarity measure compares principal component scores obtained from estimates of quantile cross-spectral densities, and the robust techniques follow the so-called metric, noise and trimmed approaches. The metric approach incorporates in the objective function a distance aimed at neutralizing the effect of the outliers, the noise approach builds an artificial cluster expected to contain the outlying series, and the trimmed approach removes the most atypical series in the dataset. As result, the proposed clustering methods take advantage of both the robust nature of these techniques and the capability of the quantile cross-spectral density to identify complex dependence structures. An extensive simulation study including multivariate linear, nonlinear and GARCH processes shows that the algorithms are substantially effective in coping with the presence of outlying series, clearly outperforming other alternative procedures. Two specific applications regarding financial and environmental series illustrate the usefulness of the presented methods. (c) 2022 The Author(s). Published by Elsevier Inc.File | Dimensione | Formato | |
---|---|---|---|
1-s2.0-S0888613X22001128-main.pdf
accesso aperto
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
1.92 MB
Formato
Adobe PDF
|
1.92 MB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.