Finite mixture of Gaussians are often used to classify two- (units and variables) or three- (units, variables and occasions) way data. However, two issues arise: model complexity and capturing the true cluster structure. Indeed, a large number of variables and/or occasions implies a large number of model parameters; while the existence of noise variables (and/or occasions) could mask the true cluster structure. The approach adopted in the present paper is to reduce the number of model parameters by identifying a sub-space containing the information needed to classify the observations. This should also help in identifying noise variables and/or occasions. The maximum likelihood model estimation is carried out through an EM-like algorithm. The effectiveness of the proposal is assessed through a simulation study and an application to real data.

Mixture models for simultaneous classification and reduction of three-way data / Rocci, Roberto; Vichi, Maurizio; Ranalli, Monia. - In: COMPUTATIONAL STATISTICS. - ISSN 0943-4062. - (2024), pp. 1-39. [10.1007/s00180-024-01478-1]

Mixture models for simultaneous classification and reduction of three-way data

Rocci, Roberto;Vichi, Maurizio;Ranalli, Monia
2024

Abstract

Finite mixture of Gaussians are often used to classify two- (units and variables) or three- (units, variables and occasions) way data. However, two issues arise: model complexity and capturing the true cluster structure. Indeed, a large number of variables and/or occasions implies a large number of model parameters; while the existence of noise variables (and/or occasions) could mask the true cluster structure. The approach adopted in the present paper is to reduce the number of model parameters by identifying a sub-space containing the information needed to classify the observations. This should also help in identifying noise variables and/or occasions. The maximum likelihood model estimation is carried out through an EM-like algorithm. The effectiveness of the proposal is assessed through a simulation study and an application to real data.
2024
three-way data; cluster analysis; dimensionality reduction; mixture models; Tucker2
01 Pubblicazione su rivista::01a Articolo in rivista
Mixture models for simultaneous classification and reduction of three-way data / Rocci, Roberto; Vichi, Maurizio; Ranalli, Monia. - In: COMPUTATIONAL STATISTICS. - ISSN 0943-4062. - (2024), pp. 1-39. [10.1007/s00180-024-01478-1]
File allegati a questo prodotto
File Dimensione Formato  
Rocci_mixture-models_2024.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 2.46 MB
Formato Adobe PDF
2.46 MB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1718260
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 0
social impact