The aim of this article is to propose a procedure to cluster functional observations in a subspace of reduced dimension. The dimensional reduction is obtained by constraining the cluster centroids to lie into a subspace which preserves the maximum amount of discriminative information contained in the original data. The model is estimated by using penalized least squares to take into account the functional nature of the data. The smoothing is carried out within the clustering and its amount is adaptively calibrated. A simulation study shows how the combination of these two elements, feature-extraction and automatic data-driven smoothing, improves the performance of clustering by reducing irrelevant and redundant information in the data. The effectiveness of the proposal is demonstrated by an application to a real dataset regarding a speech recognition problem. Implementation details of the algorithm together with a computer code are available in the online supplements.
Clustering Curves on a Reduced Subspace / Gattone, S A; Rocci, R. - In: JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS. - ISSN 1537-2715. - 21:2(2012), pp. 361-379.
Clustering Curves on a Reduced Subspace
Rocci R
2012
Abstract
The aim of this article is to propose a procedure to cluster functional observations in a subspace of reduced dimension. The dimensional reduction is obtained by constraining the cluster centroids to lie into a subspace which preserves the maximum amount of discriminative information contained in the original data. The model is estimated by using penalized least squares to take into account the functional nature of the data. The smoothing is carried out within the clustering and its amount is adaptively calibrated. A simulation study shows how the combination of these two elements, feature-extraction and automatic data-driven smoothing, improves the performance of clustering by reducing irrelevant and redundant information in the data. The effectiveness of the proposal is demonstrated by an application to a real dataset regarding a speech recognition problem. Implementation details of the algorithm together with a computer code are available in the online supplements.File | Dimensione | Formato | |
---|---|---|---|
Gattone_ Clustering-curves_2012.pdf
solo gestori archivio
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
772.96 kB
Formato
Adobe PDF
|
772.96 kB | Adobe PDF | Contatta l'autore |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.