Maverick: Efficient and Accurate Coreference Resolution Defying Recent Trends

Martinelli, Giuliano; Barba, Edoardo; Navigli, Roberto

doi:10.18653/v1/2024.acl-long.722

Large autoregressive generative models have emerged as the cornerstone for achieving the highest performance across several Natural Language Processing tasks. However, the urge to attain superior results has, at times, led to the premature replacement of carefully designed task-specific approaches without exhaustive experimentation. The Coreference Resolution task is no exception; all recent state-of-the-art solutions adopt large generative autoregressive models that outperform encoder-based discriminative systems. In this work, we challenge this recent trend by introducing Maverick, a carefully designed – yet simple – pipeline, which enables running a state-of-the-art Coreference Resolution system within the constraints of an academic budget, outperforming models with up to 13 billion parameters with as few as 500 million parameters. Maverick achieves state-of-the-art performance on the CoNLL-2012 benchmark, training with up to 0.006x the memory resources and obtaining a 170x faster inference compared to previous state-of-the-art systems. We extensively validate the robustness of the Maverick framework with an array of diverse experiments, reporting improvements over prior systems in data-scarce, long-document, and out-of-domain settings. We release our code and models for research purposes at https://github.com/SapienzaNLP/maverick-coref.

Maverick: Efficient and Accurate Coreference Resolution Defying Recent Trends / Martinelli, Giuliano; Barba, Edoardo; Navigli, Roberto. - Volume 1: Long Papers:(2024), pp. 13380-13394. (Intervento presentato al convegno Association for Computational Linguistics tenutosi a Bangkok; Thailand) [10.18653/v1/2024.acl-long.722].

Maverick: Efficient and Accurate Coreference Resolution Defying Recent Trends

Giuliano Martinelli^Primo;Edoardo Barba^Secondo;Roberto Navigli^Ultimo

2024

Abstract

Large autoregressive generative models have emerged as the cornerstone for achieving the highest performance across several Natural Language Processing tasks. However, the urge to attain superior results has, at times, led to the premature replacement of carefully designed task-specific approaches without exhaustive experimentation. The Coreference Resolution task is no exception; all recent state-of-the-art solutions adopt large generative autoregressive models that outperform encoder-based discriminative systems. In this work, we challenge this recent trend by introducing Maverick, a carefully designed – yet simple – pipeline, which enables running a state-of-the-art Coreference Resolution system within the constraints of an academic budget, outperforming models with up to 13 billion parameters with as few as 500 million parameters. Maverick achieves state-of-the-art performance on the CoNLL-2012 benchmark, training with up to 0.006x the memory resources and obtaining a 170x faster inference compared to previous state-of-the-art systems. We extensively validate the robustness of the Maverick framework with an array of diverse experiments, reporting improvements over prior systems in data-scarce, long-document, and out-of-domain settings. We release our code and models for research purposes at https://github.com/SapienzaNLP/maverick-coref.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2024
			
	Nome convegno
	
				Association for Computational Linguistics
			
	Parole chiave
	
				coreference resolution; information extraction; efficiency; natural language understanding; state of the art; fast;
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				Maverick: Efficient and Accurate Coreference Resolution Defying Recent Trends / Martinelli, Giuliano; Barba, Edoardo; Navigli, Roberto. - Volume 1: Long Papers:(2024), pp. 13380-13394. (Intervento presentato al  convegno Association for Computational Linguistics tenutosi a Bangkok; Thailand) [10.18653/v1/2024.acl-long.722].
			
	Appartiene alla tipologia:
	
				04b Atto di convegno in volume

File allegati a questo prodotto

File	Dimensione	Formato
Martinelli_Efficient_2024.pdf accesso aperto Note: https://aclanthology.org/2024.acl-long.722.pdf Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 304.67 kB Formato Adobe PDF	304.67 kB	Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1719954

Citazioni

ND

3

ND

Catalogo dei prodotti della ricerca