Recently, a method was proposed that combines multiple imputation and latent class analysis (MILC) to correct for misclassification in combined data sets. A multiply imputed data set is generated which can be used to estimate different statistics of interest in a straightforward manner and can ensure that uncertainty due to misclassification is incorporated in the estimate of the total variance. In this article, MILC is extended by using hidden Markov modeling so that it can handle longitudinal data and correspondingly create multiple imputations for multiple time points. Recently, many researchers have investigated the use of hidden Markov modeling to estimate employment status rates using a combined data set consisting of data originating from the Labor Force Survey (LFS) and register data; this combined data set is used for the setup of the simulation study performed in this article. Furthermore, the proposed method is applied to an Italian combined LFS-register data set. We demonstrate how the MILC method can be extended to create imputations of scores for multiple time points and thereby show how the method can be adapted to practical situations.

Combining Multiple Imputation and Hidden Markov Modeling to Obtain Consistent Estimates of Employment Status / Boeschoten, L.; Filipponi, D.; Varriale, R.. - In: JOURNAL OF SURVEY STATISTICS AND METHODOLOGY. - ISSN 2325-0984. - 9:3(2021), pp. 549-573. [10.1093/jssam/smz052]

Combining Multiple Imputation and Hidden Markov Modeling to Obtain Consistent Estimates of Employment Status

Varriale R.
2021

Abstract

Recently, a method was proposed that combines multiple imputation and latent class analysis (MILC) to correct for misclassification in combined data sets. A multiply imputed data set is generated which can be used to estimate different statistics of interest in a straightforward manner and can ensure that uncertainty due to misclassification is incorporated in the estimate of the total variance. In this article, MILC is extended by using hidden Markov modeling so that it can handle longitudinal data and correspondingly create multiple imputations for multiple time points. Recently, many researchers have investigated the use of hidden Markov modeling to estimate employment status rates using a combined data set consisting of data originating from the Labor Force Survey (LFS) and register data; this combined data set is used for the setup of the simulation study performed in this article. Furthermore, the proposed method is applied to an Italian combined LFS-register data set. We demonstrate how the MILC method can be extended to create imputations of scores for multiple time points and thereby show how the method can be adapted to practical situations.
2021
combined survey-register data; employment status; Hidden Markov model; multiple imputation
01 Pubblicazione su rivista::01a Articolo in rivista
Combining Multiple Imputation and Hidden Markov Modeling to Obtain Consistent Estimates of Employment Status / Boeschoten, L.; Filipponi, D.; Varriale, R.. - In: JOURNAL OF SURVEY STATISTICS AND METHODOLOGY. - ISSN 2325-0984. - 9:3(2021), pp. 549-573. [10.1093/jssam/smz052]
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1683703
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 4
social impact