Catalogo dei prodotti della ricerca

In recent years, much research has been devoted to solve the problem of missing data imputation. Although most of the novel proposals look attractive for some reason, less attention has been paid to the problem of when and why a particular method should be chosen while discarding the others. This matter is far crucial in applications, given that unsuitable solutions could heavily affect the reliability of statistical analyses. Starting from this, this work is addressed to study how well several algorithmic-type imputation methods perform in the case of quantitative data. We focus on three different logics of imputing, based respectively on the use of random forests, iterative PCA, and the forward procedure. In particular, the latter, having initially been introduced for ordinal data, has required us to develop an original adaptation so that it handles missing quantitative values

Algorithmic imputation techniques for missing data : performance comparisons and development perspectives / Solaro, N.; Barbiero, A.; Manzi, G.; Ferrari, P. A.. - (2012). ( JCS-CLADAG 12 Anacapri ).

Algorithmic imputation techniques for missing data : performance comparisons and development perspectives

N. Solaro;A. Barbiero;G. Manzi;P.A. Ferrari

2012

Abstract

In recent years, much research has been devoted to solve the problem of missing data imputation. Although most of the novel proposals look attractive for some reason, less attention has been paid to the problem of when and why a particular method should be chosen while discarding the others. This matter is far crucial in applications, given that unsuitable solutions could heavily affect the reliability of statistical analyses. Starting from this, this work is addressed to study how well several algorithmic-type imputation methods perform in the case of quantitative data. We focus on three different logics of imputing, based respectively on the use of random forests, iterative PCA, and the forward procedure. In particular, the latter, having initially been introduced for ordinal data, has required us to develop an original adaptation so that it handles missing quantitative values

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2012
			
	Nome convegno
	
				JCS-CLADAG 12
			
	Parole chiave
	
				Multivariate exponential power distribution; Multivariate skew-normal distribution; Nearest neighbour; Principal component Analysis; Random forest
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				Algorithmic imputation techniques for missing data : performance comparisons and development perspectives / Solaro, N.; Barbiero, A.; Manzi, G.; Ferrari, P. A.. - (2012). ( JCS-CLADAG 12 Anacapri ).
			
	Appartiene alla tipologia:
	
				04b Atto di convegno in volume

File allegati a questo prodotto

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1727335

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

ND

ND

social impact