The Kernel Density Estimation (KDE) method is seen here as the first step of the Expectation Maximization (EM) algorithm for estimating the density of a latent variable when the initial guess is the uniform distribution. The properties of the first EM step are then investigated for different choices of the starting density. When the KDE itself is chosen the asymptotic bias of the EM update has the opposite value of KDE while the variance order is maintained. Thus, the average of the EM update with the KDE reduces the best achievable mean integrated square error from n-4/5to n-8/9. Another estimator that achieves higher order efficiency (HOE) is directly obtained by the EM update when the initial guess is the square root of the KDE. Moreover, both the average type and the square-root type of HOE estimators have a semiparametric twin estimator, obtained by adopting a parametric estimate in place of the initial guess and, of course, they achieve HOE when the parametric family is exact. These four HOE methods are then tested on simulations and compared to another pair of estimators known in literature which can be seen as a third way to bias-correct the EM update.
NEW METHODS FOR PARAMETRIC AND NONPARAMETRIC HIGH ORDER KERNEL DENSITY ESTIMATION / Pezzulli, Sergio. - (2019).
NEW METHODS FOR PARAMETRIC AND NONPARAMETRIC HIGH ORDER KERNEL DENSITY ESTIMATION
PEZZULLI, Sergio
2019
Abstract
The Kernel Density Estimation (KDE) method is seen here as the first step of the Expectation Maximization (EM) algorithm for estimating the density of a latent variable when the initial guess is the uniform distribution. The properties of the first EM step are then investigated for different choices of the starting density. When the KDE itself is chosen the asymptotic bias of the EM update has the opposite value of KDE while the variance order is maintained. Thus, the average of the EM update with the KDE reduces the best achievable mean integrated square error from n-4/5to n-8/9. Another estimator that achieves higher order efficiency (HOE) is directly obtained by the EM update when the initial guess is the square root of the KDE. Moreover, both the average type and the square-root type of HOE estimators have a semiparametric twin estimator, obtained by adopting a parametric estimate in place of the initial guess and, of course, they achieve HOE when the parametric family is exact. These four HOE methods are then tested on simulations and compared to another pair of estimators known in literature which can be seen as a third way to bias-correct the EM update.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.