A three-way three-mode data array X, where modes are units, variables, and occasions, is the data structure that can comprehensively and statistically analyze a collective phenomenon. When X has a large dimension, it is important to synthesize its information by identifying classes of similar occasions where units are described by a reduced set of LVs. In this paper, a simultaneous reduction of the occasions and variables of X is proposed. A fuzzy clustering of the occasions allows the identification of K clusters of multivariate data matrices that are within-cluster perceived similar. For each cluster, a consensus matrix with respect to the units is identified. Variablesin the cluster are correlated and maintain their covariance structure that can be synthesized for each consensus matrix by applying aSecond-Order Disjoint Factor Analysis. The proposal allows therefore to softly cluster occasions into K clusters and, for each consensus matrix, firstly identify a set of Q first-order factors and secondly identify a unique general factor, which can be considered as the most synthetic indicator summarizing the original J variables. The performance of the methodology is tested through a detailed simulation study. Finally, it is also applied to a real dataset, where its strength and usefulness are revealed.
Fuzzy clustering and dimensionality reduction of a three-way data matrix / Bombelli, Ilaria; Bottazzi Schenone, Mariaelena; Vichi, Maurizio. - In: STATISTICS. - ISSN 1029-4910. - (2025), pp. 1-34. [10.1080/02331888.2025.2589299]
Fuzzy clustering and dimensionality reduction of a three-way data matrix
ilaria bombelli
;mariaelena bottazzi schenone;maurizio vichi
2025
Abstract
A three-way three-mode data array X, where modes are units, variables, and occasions, is the data structure that can comprehensively and statistically analyze a collective phenomenon. When X has a large dimension, it is important to synthesize its information by identifying classes of similar occasions where units are described by a reduced set of LVs. In this paper, a simultaneous reduction of the occasions and variables of X is proposed. A fuzzy clustering of the occasions allows the identification of K clusters of multivariate data matrices that are within-cluster perceived similar. For each cluster, a consensus matrix with respect to the units is identified. Variablesin the cluster are correlated and maintain their covariance structure that can be synthesized for each consensus matrix by applying aSecond-Order Disjoint Factor Analysis. The proposal allows therefore to softly cluster occasions into K clusters and, for each consensus matrix, firstly identify a set of Q first-order factors and secondly identify a unique general factor, which can be considered as the most synthetic indicator summarizing the original J variables. The performance of the methodology is tested through a detailed simulation study. Finally, it is also applied to a real dataset, where its strength and usefulness are revealed.| File | Dimensione | Formato | |
|---|---|---|---|
|
Bombelli_fuzzy-clustering_2025.pdf
solo gestori archivio
Note: Full text
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
4.32 MB
Formato
Adobe PDF
|
4.32 MB | Adobe PDF | Contatta l'autore |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


