Recently, a method was proposed that combines multiple imputation and latent class analysis (MILC) to correct for misclassification in combined data sets. A multiply imputed data set is generated which can be used to estimate different statistics of interest in a straightforward manner and can ensure that uncertainty due to misclassification is incorporated in the estimate of the total variance. In this article, MILC is extended by using hidden Markov modeling so that it can handle longitudinal data and correspondingly create multiple imputations for multiple time points. Recently, many researchers have investigated the use of hidden Markov modeling to estimate employment status rates using a combined data set consisting of data originating from the Labor Force Survey (LFS) and register data; this combined data set is used for the setup of the simulation study performed in this article. Furthermore, the proposed method is applied to an Italian combined LFS-register data set. We demonstrate how the MILC method can be extended to create imputations of scores for multiple time points and thereby show how the method can be adapted to practical situations.
Combining Multiple Imputation and Hidden Markov Modeling to Obtain Consistent Estimates of Employment Status / Boeschoten, L.; Filipponi, D.; Varriale, R.. - In: JOURNAL OF SURVEY STATISTICS AND METHODOLOGY. - ISSN 2325-0984. - 9:3(2021), pp. 549-573. [10.1093/jssam/smz052]
Combining Multiple Imputation and Hidden Markov Modeling to Obtain Consistent Estimates of Employment Status
Varriale R.
2021
Abstract
Recently, a method was proposed that combines multiple imputation and latent class analysis (MILC) to correct for misclassification in combined data sets. A multiply imputed data set is generated which can be used to estimate different statistics of interest in a straightforward manner and can ensure that uncertainty due to misclassification is incorporated in the estimate of the total variance. In this article, MILC is extended by using hidden Markov modeling so that it can handle longitudinal data and correspondingly create multiple imputations for multiple time points. Recently, many researchers have investigated the use of hidden Markov modeling to estimate employment status rates using a combined data set consisting of data originating from the Labor Force Survey (LFS) and register data; this combined data set is used for the setup of the simulation study performed in this article. Furthermore, the proposed method is applied to an Italian combined LFS-register data set. We demonstrate how the MILC method can be extended to create imputations of scores for multiple time points and thereby show how the method can be adapted to practical situations.File | Dimensione | Formato | |
---|---|---|---|
Boeshoten_combining-multiple-imputation_2021.pdf
accesso aperto
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Creative commons
Dimensione
530.59 kB
Formato
Adobe PDF
|
530.59 kB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.