Statistical matching attempts at producing a unique, synthetic data file, where variables observed in different sample surveys are jointly recorded. Such a file is appropriate for further statistical analysis when the joint probability distribution of the variables of interest in the population coincides with the probability distribution of the same variables in the synthetic data file, or at least when these two distributions are “close enough”. The discrepancy between these distributions is called matching noise. In this paper, statistical matching methods based on hot-deck imputation procedures are investigated as a possible cause of matching noise. Two examples when data are generated from uniform and normal distributions are discussed.
Matching noise: formalization of the problem and some examples / Conti, Pier Luigi; Scanu, Mauro. - In: RIVISTA DI STATISTICA UFFICIALE. - ISSN 1828-1982. - STAMPA. - (2006), pp. 43-56.
Matching noise: formalization of the problem and some examples
CONTI, Pier Luigi;
2006
Abstract
Statistical matching attempts at producing a unique, synthetic data file, where variables observed in different sample surveys are jointly recorded. Such a file is appropriate for further statistical analysis when the joint probability distribution of the variables of interest in the population coincides with the probability distribution of the same variables in the synthetic data file, or at least when these two distributions are “close enough”. The discrepancy between these distributions is called matching noise. In this paper, statistical matching methods based on hot-deck imputation procedures are investigated as a possible cause of matching noise. Two examples when data are generated from uniform and normal distributions are discussed.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.