Large amount of data are today available, that are easier and faster to collect than survey data,bringing new challenges. One of them is the nonprobability nature of these big data that maynot represent the target population properly and hence result in highly biased estimators. Inthis article two approaches for dealing with selection bias when the selection process isnonignorable are discussed. The first one, based on the empirical likelihood, does not requireparametric specification of the population model but the probability of being in thenonprobability sample needed to be modeled. Auxiliary information known for the populationor estimable from a probability sample can be incorporated as calibration constraints, thusenhancing the precision of the estimators. The second one is a mixed approach based on massimputation and propensity score adjustment requiring that the big data membership is knownthroughout a probability sample. Finally, two simulation experiments and an application toincome data are performed to evaluate the performance of the proposed estimators in terms ofrobustness and efficiency.

Adjusting for Selection Bias in Nonprobability Samples by Empirical Likelihood Approach / Marella, Daniela. - In: JOURNAL OF OFFICIAL STATISTICS. - ISSN 0282-423X. - (2023).

Adjusting for Selection Bias in Nonprobability Samples by Empirical Likelihood Approach

Daniela Marella
2023

Abstract

Large amount of data are today available, that are easier and faster to collect than survey data,bringing new challenges. One of them is the nonprobability nature of these big data that maynot represent the target population properly and hence result in highly biased estimators. Inthis article two approaches for dealing with selection bias when the selection process isnonignorable are discussed. The first one, based on the empirical likelihood, does not requireparametric specification of the population model but the probability of being in thenonprobability sample needed to be modeled. Auxiliary information known for the populationor estimable from a probability sample can be incorporated as calibration constraints, thusenhancing the precision of the estimators. The second one is a mixed approach based on massimputation and propensity score adjustment requiring that the big data membership is knownthroughout a probability sample. Finally, two simulation experiments and an application toincome data are performed to evaluate the performance of the proposed estimators in terms ofrobustness and efficiency.
2023
big data; informative sample; mass imputation
01 Pubblicazione su rivista::01a Articolo in rivista
Adjusting for Selection Bias in Nonprobability Samples by Empirical Likelihood Approach / Marella, Daniela. - In: JOURNAL OF OFFICIAL STATISTICS. - ISSN 0282-423X. - (2023).
File allegati a questo prodotto
File Dimensione Formato  
JOS_2023.pdf

accesso aperto

Note: manoscritto
Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 507.44 kB
Formato Adobe PDF
507.44 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1684072
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact