Density estimation is a central topic in statistics and a fundamental task of machine learning. In this paper, we present an algorithm for approximating multivariate empirical densities with a piecewise constant distribution defined on a hyperrectangular-shaped partition of the domain. The piecewise constant distribution is constructed through a hierarchical bisection scheme, such that locally, the sample cannot be statistically distinguished from a uniform distribution. The Wasserstein distance has been used to measure the uniformity of the sample data points lying in each partition element. Since the resulting density estimator requires significantly less memory to be stored, it can be used in a situation where the information contained in a multivariate sample needs to be preserved, transferred or analysed.

Density estimation of multivariate samples using Wasserstein distance / Luini, E.; Arbenz, P.. - In: JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION. - ISSN 0094-9655. - 90:2(2020), pp. 181-210. [10.1080/00949655.2019.1675661]

Density estimation of multivariate samples using Wasserstein distance

Luini E.
Primo
;
2020

Abstract

Density estimation is a central topic in statistics and a fundamental task of machine learning. In this paper, we present an algorithm for approximating multivariate empirical densities with a piecewise constant distribution defined on a hyperrectangular-shaped partition of the domain. The piecewise constant distribution is constructed through a hierarchical bisection scheme, such that locally, the sample cannot be statistically distinguished from a uniform distribution. The Wasserstein distance has been used to measure the uniformity of the sample data points lying in each partition element. Since the resulting density estimator requires significantly less memory to be stored, it can be used in a situation where the information contained in a multivariate sample needs to be preserved, transferred or analysed.
2020
multivariate histogram; Nonparametric density estimation; piecewise constant distribution; Wasserstein distance
01 Pubblicazione su rivista::01a Articolo in rivista
Density estimation of multivariate samples using Wasserstein distance / Luini, E.; Arbenz, P.. - In: JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION. - ISSN 0094-9655. - 90:2(2020), pp. 181-210. [10.1080/00949655.2019.1675661]
File allegati a questo prodotto
File Dimensione Formato  
Luini_Density_2020.pdf

Open Access dal 23/01/2021

Tipologia: Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.83 MB
Formato Adobe PDF
1.83 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1344363
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 3
social impact