The problem of missing data in building multidimensional composite indicators is a delicate problem which is often underrated. An imputation method particularly suitable for categorical data is proposed. This method is discussed in detail in the framework of nonlinear principal component analysis and compared to other missing data treatments which are commonly used in this analysis. Its performance vs. these other methods is evaluated throughout a simulation procedure performed on both an artificial case, varying the experimental conditions, and a real case. The proposed procedure is implemented using R.
An imputation method for categorical variables with application to nonlinear principal component analysis / Ferrari, P. A.; Annoni, P.; Barbiero, A.; Manzi, G.. - In: COMPUTATIONAL STATISTICS & DATA ANALYSIS. - ISSN 0167-9473. - 55:7(2011), pp. 2410-2420. [10.1016/j.csda.2011.02.007]
An imputation method for categorical variables with application to nonlinear principal component analysis
G. Manzi
2011
Abstract
The problem of missing data in building multidimensional composite indicators is a delicate problem which is often underrated. An imputation method particularly suitable for categorical data is proposed. This method is discussed in detail in the framework of nonlinear principal component analysis and compared to other missing data treatments which are commonly used in this analysis. Its performance vs. these other methods is evaluated throughout a simulation procedure performed on both an artificial case, varying the experimental conditions, and a real case. The proposed procedure is implemented using R.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.