Word Sense Disambiguation (WSD) is the task of associating a word in a given context with its most suitable meaning among a set of possible candidates. While the task has recently witnessed renewed interest, with systems achieving performances above the estimated inter-annotator agreement, at the time of writing it still struggles to find downstream applications. We argue that one of the reasons behind this is the difficulty of applying WSD to plain text. Indeed, in the standard formulation, models work under the assumptions that a) all the spans to disambiguate have already been identified, and b) all the possible candidate senses of each span are provided, both of which are requirements that are far from trivial. In this work, we present a new task called Word Sense Linking (WSL) where, given an input text and a reference sense inventory, systems have to both identify which spans to disambiguate and then link them to their most suitable meaning.We put forward a transformer-based architecture for the task and thoroughly evaluate both its performance and those of state-of-the-art WSD systems scaled to WSL, iteratively relaxing the assumptions of WSD. We hope that our work will foster easier integration of lexical semantics into downstream applications.

Word Sense Linking: Disambiguating Outside the Sandbox / Bejgu, ANDREI STEFAN; Barba, Edoardo; Procopio, Luigi; Fernández-Castro, Alberte; Navigli, Roberto. - (2024), pp. 14332-14347. ( 62nd Annual Meeting of the Association-for-Computational-Linguistics (ACL) Bangkok ) [10.18653/v1/2024.findings-acl.851].

Word Sense Linking: Disambiguating Outside the Sandbox

Andrei Stefan Bejgu
Primo
;
Edoardo Barba
Secondo
;
Luigi Procopio
Penultimo
;
Roberto Navigli
Ultimo
2024

Abstract

Word Sense Disambiguation (WSD) is the task of associating a word in a given context with its most suitable meaning among a set of possible candidates. While the task has recently witnessed renewed interest, with systems achieving performances above the estimated inter-annotator agreement, at the time of writing it still struggles to find downstream applications. We argue that one of the reasons behind this is the difficulty of applying WSD to plain text. Indeed, in the standard formulation, models work under the assumptions that a) all the spans to disambiguate have already been identified, and b) all the possible candidate senses of each span are provided, both of which are requirements that are far from trivial. In this work, we present a new task called Word Sense Linking (WSL) where, given an input text and a reference sense inventory, systems have to both identify which spans to disambiguate and then link them to their most suitable meaning.We put forward a transformer-based architecture for the task and thoroughly evaluate both its performance and those of state-of-the-art WSD systems scaled to WSL, iteratively relaxing the assumptions of WSD. We hope that our work will foster easier integration of lexical semantics into downstream applications.
2024
62nd Annual Meeting of the Association-for-Computational-Linguistics (ACL)
Word Sense Linking; Word Sense Disambiguation; Semantics
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Word Sense Linking: Disambiguating Outside the Sandbox / Bejgu, ANDREI STEFAN; Barba, Edoardo; Procopio, Luigi; Fernández-Castro, Alberte; Navigli, Roberto. - (2024), pp. 14332-14347. ( 62nd Annual Meeting of the Association-for-Computational-Linguistics (ACL) Bangkok ) [10.18653/v1/2024.findings-acl.851].
File allegati a questo prodotto
File Dimensione Formato  
Bejgu_Word-Sense_2024.pdf

accesso aperto

Note: https://aclanthology.org/2024.findings-acl.851/
Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 385.5 kB
Formato Adobe PDF
385.5 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1717771
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact