Catalogo dei prodotti della ricerca

Missing data are a common issue in datasets used for socio-economic research; thus, the implementation, application, and evaluation of imputation methods can lead to benefits in economic and social sciences. The purpose of this paper is to apply and compare the performance of different imputation procedures for a specific and original set of data on national public R&D funding, as well as to identify and evaluate the best method (among those proposed) for longitudinal data. The procedures shown here can be generalized to all social sciences contexts when data are missing or when there are problems of missing data in official socio-economic statistics. Our results indicate that the various imputation methods improve the estimates on the basis of data characteristics. Linear Interpolation fits our data better, while Two-fold Fully Conditional Specification (FCS) seems to be the best approach when the missing values are not in consecutive years, compared to Multiple Imputation by Chained Equations (MICE) and Full Information Maximum Likelihood (FIML) procedures.

Imputation methods for estimating public R&D funding: evidence from longitudinal data / Zinilli, A.. - In: QUALITY & QUANTITY. - ISSN 0033-5177. - (2020). [10.1007/s11135-020-01023-4]

Imputation methods for estimating public R&D funding: evidence from longitudinal data

Antonio Zinilli^Primo

2020

Abstract

Missing data are a common issue in datasets used for socio-economic research; thus, the implementation, application, and evaluation of imputation methods can lead to benefits in economic and social sciences. The purpose of this paper is to apply and compare the performance of different imputation procedures for a specific and original set of data on national public R&D funding, as well as to identify and evaluate the best method (among those proposed) for longitudinal data. The procedures shown here can be generalized to all social sciences contexts when data are missing or when there are problems of missing data in official socio-economic statistics. Our results indicate that the various imputation methods improve the estimates on the basis of data characteristics. Linear Interpolation fits our data better, while Two-fold Fully Conditional Specification (FCS) seems to be the best approach when the missing values are not in consecutive years, compared to Multiple Imputation by Chained Equations (MICE) and Full Information Maximum Likelihood (FIML) procedures.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2020
			
	Parole chiave
	
				Imputation methods; R&D funding; Statistics
			
	Tipologia
	
				01 Pubblicazione su rivista::01a Articolo in rivista
			
	Citazione
	
				Imputation methods for estimating public R&D funding: evidence from longitudinal data / Zinilli, A.. - In: QUALITY & QUANTITY. - ISSN 0033-5177. - (2020). [10.1007/s11135-020-01023-4]

File allegati a questo prodotto

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1697946

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

2

ND

social impact