A novel clustering model, CPclus, for three-way data concerning a set of objects on which variables are measured by different subjects is proposed. The main aim of the proposal is to simultaneously summarize the objects through clusters and both variables and subjects through components. The object clusters are found by adopting a K-means-based strategy where the centroids are reduced according to the Candecomp/Parafac model in order to exploit the three-way structure of the data. The clustering process is carried out in order to reveal between-cluster differences in mean. Least-squares fitting is performed by using an iterative alternating least-squares algorithm. Model selection is addressed by considering an elbow-based method. An extensive simulation study and some real-life applications show the effectiveness of the proposal, also in comparison with its potential competitors.

CPclus: Candecomp/Parafac Clustering Model for Three-Way Data / Vicari, Donatella; Giordani, Paolo. - In: JOURNAL OF CLASSIFICATION. - ISSN 0176-4268. - 40:(2023), pp. 432-465. [10.1007/s00357-023-09440-4]

CPclus: Candecomp/Parafac Clustering Model for Three-Way Data

Vicari, Donatella
;
Giordani, Paolo
2023

Abstract

A novel clustering model, CPclus, for three-way data concerning a set of objects on which variables are measured by different subjects is proposed. The main aim of the proposal is to simultaneously summarize the objects through clusters and both variables and subjects through components. The object clusters are found by adopting a K-means-based strategy where the centroids are reduced according to the Candecomp/Parafac model in order to exploit the three-way structure of the data. The clustering process is carried out in order to reveal between-cluster differences in mean. Least-squares fitting is performed by using an iterative alternating least-squares algorithm. Model selection is addressed by considering an elbow-based method. An extensive simulation study and some real-life applications show the effectiveness of the proposal, also in comparison with its potential competitors.
2023
three-way data; simultaneous dimension reduction; K-means; Candecomp/ Parafac
01 Pubblicazione su rivista::01a Articolo in rivista
CPclus: Candecomp/Parafac Clustering Model for Three-Way Data / Vicari, Donatella; Giordani, Paolo. - In: JOURNAL OF CLASSIFICATION. - ISSN 0176-4268. - 40:(2023), pp. 432-465. [10.1007/s00357-023-09440-4]
File allegati a questo prodotto
File Dimensione Formato  
Vicari_CPclus_2023.pdf

solo gestori archivio

Note: Articolo CPclus
Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.5 MB
Formato Adobe PDF
1.5 MB Adobe PDF   Contatta l'autore
Vicari_CPclus_2023.pdf

solo gestori archivio

Note: Materiale Supplementare CPclus
Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 336.29 kB
Formato Adobe PDF
336.29 kB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1683586
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact