In real applications, it is very common to have the true clustering structure masked by the presence of noise variables and/or dimensions. A mixture model is proposed for simultaneous clustering and dimensionality reduction of mixed-type data: the continuous and the ordinal variables are assumed to follow a Gaussian mixture model, where, as regards the ordinal variables, it is only partially observed. To recognize discriminative and noise dimensions, the variables are considered to be linear combinations of two independent sets of latent factors where only one contains the information about the cluster structure while the other one contains noise dimensions. In order to overcome computational issues, the parameter estimation is carried out through an EM-like algorithm maximizing a composite log-likelihood based on low-dimensional margins.

Simultaneous clustering and dimensional reduction of mixed-type data / Ranalli, Monia; Rocci, Roberto. - (2018), pp. 179-186. (Intervento presentato al convegno Advances in Statistical Modelling of Ordinal Data ASMOD 2018 tenutosi a Naples) [10.6093/978-88-6887-042-3].

Simultaneous clustering and dimensional reduction of mixed-type data

Ranalli Monia;Rocci Roberto
2018

Abstract

In real applications, it is very common to have the true clustering structure masked by the presence of noise variables and/or dimensions. A mixture model is proposed for simultaneous clustering and dimensionality reduction of mixed-type data: the continuous and the ordinal variables are assumed to follow a Gaussian mixture model, where, as regards the ordinal variables, it is only partially observed. To recognize discriminative and noise dimensions, the variables are considered to be linear combinations of two independent sets of latent factors where only one contains the information about the cluster structure while the other one contains noise dimensions. In order to overcome computational issues, the parameter estimation is carried out through an EM-like algorithm maximizing a composite log-likelihood based on low-dimensional margins.
2018
Advances in Statistical Modelling of Ordinal Data ASMOD 2018
Mixture models, Composite likelihood, EM algorithm
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Simultaneous clustering and dimensional reduction of mixed-type data / Ranalli, Monia; Rocci, Roberto. - (2018), pp. 179-186. (Intervento presentato al convegno Advances in Statistical Modelling of Ordinal Data ASMOD 2018 tenutosi a Naples) [10.6093/978-88-6887-042-3].
File allegati a questo prodotto
File Dimensione Formato  
Ranalli_simultaneous-clustering_2018.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 2.66 MB
Formato Adobe PDF
2.66 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1347172
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact