Catalogo dei prodotti della ricerca

Urn models for innovation capture fundamental empirical laws shared by several real-world processes. The so-called urn model with triggering includes, as particular cases, the urn representation of the two-parameter Poisson-Dirichlet process and the Dirichlet process, seminal in Bayesian non-parametric inference. In this work, we leverage this connection to introduce a general approach for quantifying closeness between symbolic sequences and test it within the framework of the authorship attribution problem. The method demonstrates high accuracy when compared to other related methods in different scenarios, featuring a substantial gain in computational efficiency and theoretical transparency. Beyond the practical convenience, this work demonstrates how the recently established connection between urn models and non-parametric Bayesian inference can pave the way for designing more efficient inference methods. In particular, the hybrid approach that we propose allows us to relax the exchangeability hypothesis, which can be particularly relevant for systems exhibiting complex correlation patterns and non-stationary dynamics.A class of urn-based models accounts for stochastic regularities observed in systems that exhibit innovation in diverse forms and temporal scales, from the appearance of new organisms to the evolution of language to daily new experiences. The authors investigate the predictive power of those models in inference problems, addressing the authorship attribution task as a case study.

Inference through innovation processes tested in the authorship attribution task / Tani Raffaelli, G.; Lalli, M.; Tria, F.. - In: COMMUNICATIONS PHYSICS. - ISSN 2399-3650. - 7:1(2024). [10.1038/s42005-024-01714-6]

Inference through innovation processes tested in the authorship attribution task

Tani Raffaelli G.;Lalli M.;Tria F.

2024

Abstract

Urn models for innovation capture fundamental empirical laws shared by several real-world processes. The so-called urn model with triggering includes, as particular cases, the urn representation of the two-parameter Poisson-Dirichlet process and the Dirichlet process, seminal in Bayesian non-parametric inference. In this work, we leverage this connection to introduce a general approach for quantifying closeness between symbolic sequences and test it within the framework of the authorship attribution problem. The method demonstrates high accuracy when compared to other related methods in different scenarios, featuring a substantial gain in computational efficiency and theoretical transparency. Beyond the practical convenience, this work demonstrates how the recently established connection between urn models and non-parametric Bayesian inference can pave the way for designing more efficient inference methods. In particular, the hybrid approach that we propose allows us to relax the exchangeability hypothesis, which can be particularly relevant for systems exhibiting complex correlation patterns and non-stationary dynamics.A class of urn-based models accounts for stochastic regularities observed in systems that exhibit innovation in diverse forms and temporal scales, from the appearance of new organisms to the evolution of language to daily new experiences. The authors investigate the predictive power of those models in inference problems, addressing the authorship attribution task as a case study.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2024
			
	Parole chiave
	
				innovation processes; inference; Polya urns; authorship attribution
			
	Tipologia
	
				01 Pubblicazione su rivista::01a Articolo in rivista
			
	Citazione
	
				Inference through innovation processes tested in the authorship attribution task / Tani Raffaelli, G.; Lalli, M.; Tria, F.. - In: COMMUNICATIONS PHYSICS. - ISSN 2399-3650. - 7:1(2024). [10.1038/s42005-024-01714-6]
			
	Appartiene alla tipologia:
	
				01a Articolo in rivista

File allegati a questo prodotto

File	Dimensione	Formato
Tani-Raffaelli_Inference-through_2024.pdf accesso aperto Note: Articolo su rivista Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Creative commons Dimensione 766.3 kB Formato Adobe PDF	766.3 kB	Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1720811

Citazioni

ND

1

0

social impact