Methods for clustering univariate time series often rely on choosing some features relevant for the problem at hand and seeking for clusters according to their measurements, for instance the autoregressive coefficients, spectral measures, time delays at some selected frequencies and special characteristics such as trend, seasonality, etc. In this context some interesting features based on indexes of goodness-of-fit seem worth of special attention. Similar approaches have been suggested for clustering sets of multivariate time series. For example, clusters of regional economies may be formed based on sets of macroeconomic time series for each country. In a multivariate framework, however, the features of interest are more difficult to extract than for univariate time series. Indeed multivariate time series may differ not only for structure or pairwise correlation but for dimensionality and internal correlation as well. We propose some measures of predictability and interpolability as indexes of goodness-of-fit for multivariate time series that may serve as useful features to find clusters in the data. The capability of a clustering methods in distinguishing clusters of multivariate time series may be evaluated by using several cluster internal validity criteria. As each criterion is known to measure some special characteristics of the extracted features, multiobjective clustering methods and a genetic algorithm implementation are used to perform such evaluation. The concept of Pareto optimality in multiobjective genetic algorithms is used to perform simultaneous search over multiple criteria. The advantage in using genetic algorithms for multiobjective optimizationresides in the circumstance that genetic algorithms maintain a population of solutions most of them non-dominated in the Pareto sense so that the whole Pareto front may be provided in a single run. The effectiveness of the measures of predictability and interpolability in conjunction with the multiobjective genetic optimization procedure for outlining the cluster structure of a set of multivariate time series will be studied on a set of real time series data. Furthermore, a simulation experiment will be presented to compare the performance of the proposed procedure with procedures arising from alternative approaches.

Clustering multivariate time series by genetic multiobjective optimization / S., Bandyopadhyay; Baragona, Roberto; U., Maulik. - In: METRON. - ISSN 0026-1424. - 68:2(2010), pp. 161-183.

Clustering multivariate time series by genetic multiobjective optimization

BARAGONA, Roberto;
2010

Abstract

Methods for clustering univariate time series often rely on choosing some features relevant for the problem at hand and seeking for clusters according to their measurements, for instance the autoregressive coefficients, spectral measures, time delays at some selected frequencies and special characteristics such as trend, seasonality, etc. In this context some interesting features based on indexes of goodness-of-fit seem worth of special attention. Similar approaches have been suggested for clustering sets of multivariate time series. For example, clusters of regional economies may be formed based on sets of macroeconomic time series for each country. In a multivariate framework, however, the features of interest are more difficult to extract than for univariate time series. Indeed multivariate time series may differ not only for structure or pairwise correlation but for dimensionality and internal correlation as well. We propose some measures of predictability and interpolability as indexes of goodness-of-fit for multivariate time series that may serve as useful features to find clusters in the data. The capability of a clustering methods in distinguishing clusters of multivariate time series may be evaluated by using several cluster internal validity criteria. As each criterion is known to measure some special characteristics of the extracted features, multiobjective clustering methods and a genetic algorithm implementation are used to perform such evaluation. The concept of Pareto optimality in multiobjective genetic algorithms is used to perform simultaneous search over multiple criteria. The advantage in using genetic algorithms for multiobjective optimizationresides in the circumstance that genetic algorithms maintain a population of solutions most of them non-dominated in the Pareto sense so that the whole Pareto front may be provided in a single run. The effectiveness of the measures of predictability and interpolability in conjunction with the multiobjective genetic optimization procedure for outlining the cluster structure of a set of multivariate time series will be studied on a set of real time series data. Furthermore, a simulation experiment will be presented to compare the performance of the proposed procedure with procedures arising from alternative approaches.
2010
Clustering; Cluster validity index; Genetic algorithms; Multiobjective optimization; Time series; Pareto optimality
01 Pubblicazione su rivista::01a Articolo in rivista
Clustering multivariate time series by genetic multiobjective optimization / S., Bandyopadhyay; Baragona, Roberto; U., Maulik. - In: METRON. - ISSN 0026-1424. - 68:2(2010), pp. 161-183.
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/99406
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 10
  • ???jsp.display-item.citation.isi??? ND
social impact