Local models have recently attained astounding performances in Entity Disambiguation (ED), with generative and extractive formulations being the most promising research directions. However, previous works have so far limited their studies to using, as the textual representation of each candidate, only its Wikipedia title. Although certainly effective, this strategy presents a few critical issues, especially when titles are not sufficiently informative or distinguishable from one another. In this paper, we address this limitation and investigate the extent to which more expressive textual representations can mitigate it. We evaluate our approach thoroughly against standard benchmarks in ED and find extractive formulations to be particularly well-suited to such representations. We report a new state of the art on 2 out of the 6 benchmarks we consider and strongly improve the generalization capability over unseen patterns. We release our code, data and model checkpoints at https://github.com/SapienzaNLP/extend.

Entity Disambiguation with Entity Definitions / Procopio, Luigi; Conia, Simone; Barba, Edoardo; Navigli, Roberto. - (2023), pp. 1297-1303. (Intervento presentato al convegno European Association for Computational Linguistics tenutosi a Dubrovnik; Croatia) [10.18653/v1/2023.eacl-main.93].

Entity Disambiguation with Entity Definitions

Luigi Procopio
Primo
;
Simone Conia
;
Edoardo Barba
;
Roberto Navigli
2023

Abstract

Local models have recently attained astounding performances in Entity Disambiguation (ED), with generative and extractive formulations being the most promising research directions. However, previous works have so far limited their studies to using, as the textual representation of each candidate, only its Wikipedia title. Although certainly effective, this strategy presents a few critical issues, especially when titles are not sufficiently informative or distinguishable from one another. In this paper, we address this limitation and investigate the extent to which more expressive textual representations can mitigate it. We evaluate our approach thoroughly against standard benchmarks in ED and find extractive formulations to be particularly well-suited to such representations. We report a new state of the art on 2 out of the 6 benchmarks we consider and strongly improve the generalization capability over unseen patterns. We release our code, data and model checkpoints at https://github.com/SapienzaNLP/extend.
2023
European Association for Computational Linguistics
entity disambiguation; named entities; natural language processing
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Entity Disambiguation with Entity Definitions / Procopio, Luigi; Conia, Simone; Barba, Edoardo; Navigli, Roberto. - (2023), pp. 1297-1303. (Intervento presentato al convegno European Association for Computational Linguistics tenutosi a Dubrovnik; Croatia) [10.18653/v1/2023.eacl-main.93].
File allegati a questo prodotto
File Dimensione Formato  
Procopio_Entity_2023.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 599.53 kB
Formato Adobe PDF
599.53 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1685073
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 1
social impact