The problem of underreported data is a frequently analyzed issue in statistics and concerns a wide range of phenomena, from medical to social aspects, and more. Traditional statistical models often fail to adequately address under-reporting biases, leading to systematic underestimation of true incidence rates. In this study, we analyze one of the main models developed within a hierarchical Bayesian framework to address and correct underreported spatio-temporal count data: the Pogit model. The proposed model assumes a latent Poisson distribution for true incidence counts and a Binomial observation mechanism to account for misreporting, incorporating spatially structured and unstructured random effects. To mitigate identifiability issues, we employ informative priors on the intercept of the logit model for the reporting probability. Through a simulation study, we analyze the sensitivity of parameter estimates to prior misspecification and evaluate the impact of using proxy covariates with varying degrees of correlation to the true reporting mechanism.
Under-reported data: a simulation study about parameter sensitivity / Panunzi, Greta; Polettini, Silvia; Arima, Serena. - (2025), pp. 599-606. ( 2025 Conference of the 12th Scientific Meeting of the Statistics for the Evaluation and Quality of Services Group of the Italian Statistical Society (SVQS) IES 2025 - Innovation & Society: Statistics and Data Science for Evaluation and Quality University of Padova Bressanone ).
Under-reported data: a simulation study about parameter sensitivity
Greta Panunzi
;Silvia Polettini;Serena Arima
2025
Abstract
The problem of underreported data is a frequently analyzed issue in statistics and concerns a wide range of phenomena, from medical to social aspects, and more. Traditional statistical models often fail to adequately address under-reporting biases, leading to systematic underestimation of true incidence rates. In this study, we analyze one of the main models developed within a hierarchical Bayesian framework to address and correct underreported spatio-temporal count data: the Pogit model. The proposed model assumes a latent Poisson distribution for true incidence counts and a Binomial observation mechanism to account for misreporting, incorporating spatially structured and unstructured random effects. To mitigate identifiability issues, we employ informative priors on the intercept of the logit model for the reporting probability. Through a simulation study, we analyze the sensitivity of parameter estimates to prior misspecification and evaluate the impact of using proxy covariates with varying degrees of correlation to the true reporting mechanism.| File | Dimensione | Formato | |
|---|---|---|---|
|
Book_IES25_Definitivo_extract.pdf
solo gestori archivio
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
3.07 MB
Formato
Adobe PDF
|
3.07 MB | Adobe PDF | Contatta l'autore |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


