There are several reasons why robust regression techniques are useful tools in sampling design. First of all, when stratified samples are considered, one needs to deal with three main issues: the sample size, the strata bounds determination and the sample allocation in the strata. Since the target variable Y, the objective of the survey, is unknown, some auxiliary information X known for the entire population from which the sample is drawn, is used. Such information is helpful as it is typically strongly correlated with the target Y. However, some discrepancies between these variables may arise. The use of auxiliary information, combined with the choice of the appropriate statistical model to estimate the relationship between Y and X, is crucial for the determination of the strata bounds, the size of the sample and the sampling rates according to a chosen precision level for the estimates, as has been shown by Rivest (2002). Nevertheless, this regression-based approach is highly sensitive to the presence of contaminated data. Since the key tool for stratified sampling is the measure of scale of Y conditional on the knowledge of the auxiliary X, a robust approach based on the S -estimator of the regression is proposed in this paper. The aim is to allow for robust sample size and strata bounds determination, together with optimal sample allocation. Simulation results based on data from the Construction sector of a Structural Business Survey illustrate the advantages of the proposed method.

Robust LH stratified sampling strategy / Bramati, Maria Caterina. - In: SURVEY RESEARCH METHODS. - ISSN 1864-3361. - ELETTRONICO. - 6:(2012).

Robust LH stratified sampling strategy

BRAMATI, Maria Caterina
2012

Abstract

There are several reasons why robust regression techniques are useful tools in sampling design. First of all, when stratified samples are considered, one needs to deal with three main issues: the sample size, the strata bounds determination and the sample allocation in the strata. Since the target variable Y, the objective of the survey, is unknown, some auxiliary information X known for the entire population from which the sample is drawn, is used. Such information is helpful as it is typically strongly correlated with the target Y. However, some discrepancies between these variables may arise. The use of auxiliary information, combined with the choice of the appropriate statistical model to estimate the relationship between Y and X, is crucial for the determination of the strata bounds, the size of the sample and the sampling rates according to a chosen precision level for the estimates, as has been shown by Rivest (2002). Nevertheless, this regression-based approach is highly sensitive to the presence of contaminated data. Since the key tool for stratified sampling is the measure of scale of Y conditional on the knowledge of the auxiliary X, a robust approach based on the S -estimator of the regression is proposed in this paper. The aim is to allow for robust sample size and strata bounds determination, together with optimal sample allocation. Simulation results based on data from the Construction sector of a Structural Business Survey illustrate the advantages of the proposed method.
2012
01 Pubblicazione su rivista::01a Articolo in rivista
Robust LH stratified sampling strategy / Bramati, Maria Caterina. - In: SURVEY RESEARCH METHODS. - ISSN 1864-3361. - ELETTRONICO. - 6:(2012).
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/498606
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 4
social impact