Mixture models represent a powerful statistical tool for clustering observations, an essential task in many fields. A multivariate factor analysis regression model (MFARM) can be used to explore the relationship between the observations and predictors, especially when the predictors' matrices are high dimensional. The disadvantages of MFARMs are generally related to the potential difficulty in interpretability of the resulting factors. A finite mixture of MFARMs is proposed for clustering both observations and predictors. In particular, by replacing the factor loading matrix with a binary row-stochastic matrix in the factor analyzer structure, the predictors can be clustered into groups such that a predictor is only associated with one of the factors. An alternating expectation-conditional maximization algorithm is used for parameter estimation. Application of the proposed approach to both simulated and real datasets is presented and discussed.

CLUSTERING VIA A FINITE MIXTURE OF DISJOINT FACTOR ANALYZER / Qin, Xiaoke; Martella, Francesca; Subedi, Sanjeena. - (2024). (Intervento presentato al convegno the 2024 Classification Society Annual Meeting tenutosi a Kelowna, British Columbia, Canada).

CLUSTERING VIA A FINITE MIXTURE OF DISJOINT FACTOR ANALYZER.

Francesca Martella;
2024

Abstract

Mixture models represent a powerful statistical tool for clustering observations, an essential task in many fields. A multivariate factor analysis regression model (MFARM) can be used to explore the relationship between the observations and predictors, especially when the predictors' matrices are high dimensional. The disadvantages of MFARMs are generally related to the potential difficulty in interpretability of the resulting factors. A finite mixture of MFARMs is proposed for clustering both observations and predictors. In particular, by replacing the factor loading matrix with a binary row-stochastic matrix in the factor analyzer structure, the predictors can be clustered into groups such that a predictor is only associated with one of the factors. An alternating expectation-conditional maximization algorithm is used for parameter estimation. Application of the proposed approach to both simulated and real datasets is presented and discussed.
2024
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1717033
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact