Large amount of data are today available, that are easier and faster to collect than survey data,bringing new challenges. One of them is the nonprobability nature of these big data that maynot represent the target population properly and hence result in highly biased estimators. Inthis article two approaches for dealing with selection bias when the selection process isnonignorable are discussed. The first one, based on the empirical likelihood, does not requireparametric specification of the population model but the probability of being in thenonprobability sample needed to be modeled. Auxiliary information known for the populationor estimable from a probability sample can be incorporated as calibration constraints, thusenhancing the precision of the estimators. The second one is a mixed approach based on massimputation and propensity score adjustment requiring that the big data membership is knownthroughout a probability sample. Finally, two simulation experiments and an application toincome data are performed to evaluate the performance of the proposed estimators in terms ofrobustness and efficiency.
Adjusting for Selection Bias in Nonprobability Samples by Empirical Likelihood Approach / Marella, Daniela. - In: JOURNAL OF OFFICIAL STATISTICS. - ISSN 0282-423X. - (2023).
Adjusting for Selection Bias in Nonprobability Samples by Empirical Likelihood Approach
Daniela Marella
2023
Abstract
Large amount of data are today available, that are easier and faster to collect than survey data,bringing new challenges. One of them is the nonprobability nature of these big data that maynot represent the target population properly and hence result in highly biased estimators. Inthis article two approaches for dealing with selection bias when the selection process isnonignorable are discussed. The first one, based on the empirical likelihood, does not requireparametric specification of the population model but the probability of being in thenonprobability sample needed to be modeled. Auxiliary information known for the populationor estimable from a probability sample can be incorporated as calibration constraints, thusenhancing the precision of the estimators. The second one is a mixed approach based on massimputation and propensity score adjustment requiring that the big data membership is knownthroughout a probability sample. Finally, two simulation experiments and an application toincome data are performed to evaluate the performance of the proposed estimators in terms ofrobustness and efficiency.File | Dimensione | Formato | |
---|---|---|---|
JOS_2023.pdf
accesso aperto
Note: manoscritto
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
507.44 kB
Formato
Adobe PDF
|
507.44 kB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.