In this article, the quality of data produced by national statistical institutes and by governmental institutions is considered. In particular, the problem of measurement error is analyzed and an integrated Bayesian network decision support system based on non-parametric Bayesian networks is proposed for its detection and correction. Non-parametric Bayesian networks are graphical models expressing dependence structure via bivariate copulas associated to the edges of the graph. The network structure and the misreport probability are estimated using a validation sample. The Bayesian network model is proposed to decide: (i) which records have to be corrected; (ii) the kind and amount of correction to be adopted. The proposed correction procedure is applied to the Banca d’Italia Survey on Household Income and Wealth and, specifically, the bond amounts are analyzed. Finally, the sensitivity of the conditional distribution of the true value random variable given the observed one to different evidence configurations is studied.

Towards an Integrated Bayesian Network Approach to Measurement Error Detection and Correction / Marella, D.; Vicard, P.. - In: COMMUNICATIONS IN STATISTICS. SIMULATION AND COMPUTATION. - ISSN 0361-0918. - (2017).

Towards an Integrated Bayesian Network Approach to Measurement Error Detection and Correction

Marella D.;Vicard P.
2017

Abstract

In this article, the quality of data produced by national statistical institutes and by governmental institutions is considered. In particular, the problem of measurement error is analyzed and an integrated Bayesian network decision support system based on non-parametric Bayesian networks is proposed for its detection and correction. Non-parametric Bayesian networks are graphical models expressing dependence structure via bivariate copulas associated to the edges of the graph. The network structure and the misreport probability are estimated using a validation sample. The Bayesian network model is proposed to decide: (i) which records have to be corrected; (ii) the kind and amount of correction to be adopted. The proposed correction procedure is applied to the Banca d’Italia Survey on Household Income and Wealth and, specifically, the bond amounts are analyzed. Finally, the sensitivity of the conditional distribution of the true value random variable given the observed one to different evidence configurations is studied.
2017
Banca d’Italia Survey on Household Income and Wealth; Bayesian network; Data quality; Misreport probability; Non-parametric Bayesian belief network; Validation sample
01 Pubblicazione su rivista::01a Articolo in rivista
Towards an Integrated Bayesian Network Approach to Measurement Error Detection and Correction / Marella, D.; Vicard, P.. - In: COMMUNICATIONS IN STATISTICS. SIMULATION AND COMPUTATION. - ISSN 0361-0918. - (2017).
File allegati a questo prodotto
File Dimensione Formato  
com_stat2017.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.1 MB
Formato Adobe PDF
1.1 MB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1617489
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact