The problem of error detection is generally approached by formulating a set of rules that each household must respect in order to be declared correct. Afterwards, in the correction process, the incorrect values of erroneous records should be replaced by new correct ones with the purpose of restoring their unknown original values. The correction methodology is based on the use of correct records called donors. However, when the set of donors is very large, as in the case of a Census, the iterative comparison between every single erroneous record e and all possible donors could require unacceptable computational time. Therefore, we propose here a new approach for reducing the number of donors that must be examined. This is obtained by preventively dividing the large set of donors by means of a new clustering algorithm.

Data Clustering for Improving the Selection of Donors for Data Imputation / G., Bianchi; Bruni, Renato; R., Nucara; A., Reale. - STAMPA. - (2005).

Data Clustering for Improving the Selection of Donors for Data Imputation

BRUNI, Renato;
2005

Abstract

The problem of error detection is generally approached by formulating a set of rules that each household must respect in order to be declared correct. Afterwards, in the correction process, the incorrect values of erroneous records should be replaced by new correct ones with the purpose of restoring their unknown original values. The correction methodology is based on the use of correct records called donors. However, when the set of donors is very large, as in the case of a Census, the iterative comparison between every single erroneous record e and all possible donors could require unacceptable computational time. Therefore, we propose here a new approach for reducing the number of donors that must be examined. This is obtained by preventively dividing the large set of donors by means of a new clustering algorithm.
2005
Classification and Data Analysis 2005
887847066X
Clustering; Data Mining; Optimization
02 Pubblicazione su volume::02a Capitolo o Articolo
Data Clustering for Improving the Selection of Donors for Data Imputation / G., Bianchi; Bruni, Renato; R., Nucara; A., Reale. - STAMPA. - (2005).
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/498685
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact