A key issue in cluster analysis is determining a proper dissimilarity measure between two data objects, and many pairwise dissimilarities have been proposed to deal with time series. Assuming that the clustering purpose is to group series according to the underlying dependence structures, a detailed study of the behavior in clustering of a dissimilarity based on comparing estimated quantile autocovariance functions (QAF) is carried out. Quantile autocovariances provide information about the serial dependence structure that other conventional features are not able to capture, which suggests great potential to perform clustering of series. The asymptotic behavior of the sample quantile autocovariances is studied and an algorithm to determine optimal combinations of lags and pairs of quantile levels to perform clustering is introduced. The proposed metric is used to perform hard and soft partitioning-based clustering. First, a broad simulation study examines the behavior of the proposed metric in crisp clustering with the PAM procedure. A novel fuzzy C-medoids algorithm based on the QAF-dissimilarity is then proposed and compared with other fuzzy procedures in a new simulation study conducted to cluster fuzzy scenarios involving AR and GARCH models. In all cases, the QAF-based procedures outperform or are highly competitive with a range of dissimilarities reported in the literature, particularly exhibiting high capability to cluster conditionally heteroskedastic time series and robustness to the distributional form of the errors. Two specific applications involving air quality data and financial time series illustrate the usefulness of the proposed procedures.
Quantile autocovariances: a powerful tool for hard and soft partitional clustering of time series / J. A., Vilar; B., Lafuente; D'Urso, Pierpaolo. - In: FUZZY SETS AND SYSTEMS. - ISSN 0165-0114. - (2017).
Quantile autocovariances: a powerful tool for hard and soft partitional clustering of time series
D'URSO, Pierpaolo
2017
Abstract
A key issue in cluster analysis is determining a proper dissimilarity measure between two data objects, and many pairwise dissimilarities have been proposed to deal with time series. Assuming that the clustering purpose is to group series according to the underlying dependence structures, a detailed study of the behavior in clustering of a dissimilarity based on comparing estimated quantile autocovariance functions (QAF) is carried out. Quantile autocovariances provide information about the serial dependence structure that other conventional features are not able to capture, which suggests great potential to perform clustering of series. The asymptotic behavior of the sample quantile autocovariances is studied and an algorithm to determine optimal combinations of lags and pairs of quantile levels to perform clustering is introduced. The proposed metric is used to perform hard and soft partitioning-based clustering. First, a broad simulation study examines the behavior of the proposed metric in crisp clustering with the PAM procedure. A novel fuzzy C-medoids algorithm based on the QAF-dissimilarity is then proposed and compared with other fuzzy procedures in a new simulation study conducted to cluster fuzzy scenarios involving AR and GARCH models. In all cases, the QAF-based procedures outperform or are highly competitive with a range of dissimilarities reported in the literature, particularly exhibiting high capability to cluster conditionally heteroskedastic time series and robustness to the distributional form of the errors. Two specific applications involving air quality data and financial time series illustrate the usefulness of the proposed procedures.File | Dimensione | Formato | |
---|---|---|---|
FSS (Quantile autocovariances a powerful tool for hard and soft partitional clustering of time series).pdf
accesso aperto
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Creative commons
Dimensione
2.56 MB
Formato
Adobe PDF
|
2.56 MB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.