x{C}o{R}e: Cross-context Coreference Resolution

Martinelli, Giuliano; Gatti, Bruno; Navigli, Roberto

doi:10.18653/v1/2025.emnlp-main.1737

Current coreference resolution systems are typically tailored for short- or medium-sized texts and struggle to scale to very long documents due to architectural limitations and implied memory costs.However, a few available solutions can be applied by inputting documents split into smaller windows. This is inherently similar to what happens in the cross-document setting, in which systems infer coreference relations between mentions that are found in separate documents.In this paper, we unify these two challenging settings under the general framework of cross-context coreference, and introduce xCoRe, a new unified approach designed to efficiently handle short-, long-, and cross-document coreference resolution.xCoRe adopts a three-step pipeline that first identifies mentions, then creates clusters within individual contexts, and finally merges clusters across contexts.In our experiments, we show that our formulation enables joint training on shared long- and cross-document resources, increasing data availability and particularly benefiting the challenging cross-document task.Our model achieves new state-of-the-art results on cross-document benchmarks and strong performance on long-document data, while retaining top-tier results on traditional datasets, positioning it as a robust, versatile solution that can be applied across all end-to-end coreference settings.We release our models and code at http://github.com/sapienzanlp/xcore.

x{C}o{R}e: Cross-context Coreference Resolution / Martinelli, Giuliano; Gatti, Bruno; Navigli, Roberto. - (2025), pp. 34252-34266. ( EMNLP Suzhou; China ) [10.18653/v1/2025.emnlp-main.1737].

x{C}o{R}e: Cross-context Coreference Resolution

Giuliano Martinelli^Primo;Bruno Gatti^Secondo;Roberto Navigli^Ultimo

2025

Abstract

Current coreference resolution systems are typically tailored for short- or medium-sized texts and struggle to scale to very long documents due to architectural limitations and implied memory costs.However, a few available solutions can be applied by inputting documents split into smaller windows. This is inherently similar to what happens in the cross-document setting, in which systems infer coreference relations between mentions that are found in separate documents.In this paper, we unify these two challenging settings under the general framework of cross-context coreference, and introduce xCoRe, a new unified approach designed to efficiently handle short-, long-, and cross-document coreference resolution.xCoRe adopts a three-step pipeline that first identifies mentions, then creates clusters within individual contexts, and finally merges clusters across contexts.In our experiments, we show that our formulation enables joint training on shared long- and cross-document resources, increasing data availability and particularly benefiting the challenging cross-document task.Our model achieves new state-of-the-art results on cross-document benchmarks and strong performance on long-document data, while retaining top-tier results on traditional datasets, positioning it as a robust, versatile solution that can be applied across all end-to-end coreference settings.We release our models and code at http://github.com/sapienzanlp/xcore.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2025
			
	Nome convegno
	
				EMNLP
			
	Parole chiave
	
				coreference resolution; long-document; long-document coreference resolution; cross-document coreference resolution; information extraction
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				x{C}o{R}e: Cross-context Coreference Resolution / Martinelli, Giuliano; Gatti, Bruno; Navigli, Roberto. - (2025), pp. 34252-34266. ( EMNLP Suzhou; China ) [10.18653/v1/2025.emnlp-main.1737].
			
	Appartiene alla tipologia:
	
				04b Atto di convegno in volume

File allegati a questo prodotto

File	Dimensione	Formato
Martinelli_xCoRe-Cross-context_2025.pdf accesso aperto Note: DOI: 10.18653/v1/2025.emnlp-main.1737 Tipologia: Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 870.64 kB Formato Adobe PDF	870.64 kB	Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1762314

Citazioni

ND

ND

ND

Catalogo dei prodotti della ricerca