Benford’s law is often used to support critical decisions related to data quality or the presence of data manipulations or even fraud in large datasets. However, many authors argue that conventional statistical tests will reject the null of data “Benford-ness” if applied in samples of the typical size in this kind of applications, even in the presence of tiny and practically unimportant deviations from Benford’s law. Therefore, they suggest using alternative criteria that, however, lack solid statis- tical foundations. This paper contributes to the debate on the “large n” (or “excess power”) problem in the context of Benford’s law test- ing. This issue is discussed in relation with the notion of severity testing for goodness of fit tests, with a specific focus on tests for conformity with Benford’s law. To do so, we also derive the asymptotic distribu- tion of the mean absolute deviation (MAD) statistic as well as an asymptotic standard normal test. Finally, the severity testing principle is applied to six controversial large datasets to assess their “Benford-ness”.

Severe Testing of Benford’s Law / Cerqueti, R.; Lupi, C.. - In: TEST. - ISSN 1133-0686. - (2023). [10.1007/s11749-023-00848-z]

Severe Testing of Benford’s Law

R. Cerqueti;
2023

Abstract

Benford’s law is often used to support critical decisions related to data quality or the presence of data manipulations or even fraud in large datasets. However, many authors argue that conventional statistical tests will reject the null of data “Benford-ness” if applied in samples of the typical size in this kind of applications, even in the presence of tiny and practically unimportant deviations from Benford’s law. Therefore, they suggest using alternative criteria that, however, lack solid statis- tical foundations. This paper contributes to the debate on the “large n” (or “excess power”) problem in the context of Benford’s law test- ing. This issue is discussed in relation with the notion of severity testing for goodness of fit tests, with a specific focus on tests for conformity with Benford’s law. To do so, we also derive the asymptotic distribu- tion of the mean absolute deviation (MAD) statistic as well as an asymptotic standard normal test. Finally, the severity testing principle is applied to six controversial large datasets to assess their “Benford-ness”.
2023
Benford’s law; data quality; fraud discovery; goodness of fit; Large n problem; Severity
01 Pubblicazione su rivista::01a Articolo in rivista
Severe Testing of Benford’s Law / Cerqueti, R.; Lupi, C.. - In: TEST. - ISSN 1133-0686. - (2023). [10.1007/s11749-023-00848-z]
File allegati a questo prodotto
File Dimensione Formato  
TEST - Cerqueti Lupi.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.05 MB
Formato Adobe PDF
1.05 MB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1670172
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 3
social impact