The increased availability of a large amount of administrative information suggests investigating new methodological approaches for the production of estimates, based on integrating and combining administrative and statistical survey data. We show how latent variable models can be used for estimating employment proportions in the Italian regions using survey and administrative data, taking into account the deficiencies in the corresponding measurement process and the longitudinal structure of the data. To this purpose, we adopt a mixture of hidden Markov models, a longitudinal extension of latent class analysis; in the model specification, we consider covariates and unobserved heterogeneity of the latent process among the units of the population. The mixture of hidden Markov model is used to evaluate the accuracy of the available administrative sources, to define their use in the statistical production, and to produce statistics on employment status at both regional and national levels of aggregation of the entire population.

Multi-source inference via mixture of hidden Markov models: application to regional labour statistics in Italy / Filipponi, Danila; Guarnera, Ugo; Varriale, Roberta. - In: JOURNAL OF THE ROYAL STATISTICAL SOCIETY. SERIES A. STATISTICS IN SOCIETY. - ISSN 0964-1998. - 188:(2024), pp. 98-118.

Multi-source inference via mixture of hidden Markov models: application to regional labour statistics in Italy

Ugo Guarnera;Roberta Varriale
2024

Abstract

The increased availability of a large amount of administrative information suggests investigating new methodological approaches for the production of estimates, based on integrating and combining administrative and statistical survey data. We show how latent variable models can be used for estimating employment proportions in the Italian regions using survey and administrative data, taking into account the deficiencies in the corresponding measurement process and the longitudinal structure of the data. To this purpose, we adopt a mixture of hidden Markov models, a longitudinal extension of latent class analysis; in the model specification, we consider covariates and unobserved heterogeneity of the latent process among the units of the population. The mixture of hidden Markov model is used to evaluate the accuracy of the available administrative sources, to define their use in the statistical production, and to produce statistics on employment status at both regional and national levels of aggregation of the entire population.
2024
labour statistics; latent variable model; longitudinal data; mixture hidden Markov model; multi-source statistic
01 Pubblicazione su rivista::01a Articolo in rivista
Multi-source inference via mixture of hidden Markov models: application to regional labour statistics in Italy / Filipponi, Danila; Guarnera, Ugo; Varriale, Roberta. - In: JOURNAL OF THE ROYAL STATISTICAL SOCIETY. SERIES A. STATISTICS IN SOCIETY. - ISSN 0964-1998. - 188:(2024), pp. 98-118.
File allegati a questo prodotto
File Dimensione Formato  
Filipponi_multi-source-inference_2024.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.02 MB
Formato Adobe PDF
1.02 MB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1735702
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
social impact