Word Sense Disambiguation (WSD) is the task of identifying the meaning of a word in a given context. It lies at the base of Natural Language Processing as it provides semantic information for words. In the last decade, great strides have been made in this field and much effort has been devoted to mitigate the knowledge acquisition bottleneck problem, i.e., the problem of semantically annotating texts at a large scale and in different languages. This issue is ubiquitous in WSD as it hinders the creation of both multilingual knowledge bases and manually-curated training sets. In this work, we first introduce the reader to the task of WSD through a short historical digression and then take the stock of the advancements to alleviate the knowledge acquisition bottleneck problem. In that, we survey the literature on manual, semi-automatic and automatic approaches to create English and multilingual corpora tagged with sense annotations and present a clear overview over supervised models for WSD. Finally, we provide our view over the future directions that we foresee for the field.

The Knowledge Acquisition Bottleneck Problem in Multilingual Word Sense Disambiguation / Pasini, Tommaso. - (2020), pp. 4936-4942. (Intervento presentato al convegno IJCAI International Joint Conferences on Artificial Intelligence Organization tenutosi a Yokohama) [10.24963/ijcai.2020/687].

The Knowledge Acquisition Bottleneck Problem in Multilingual Word Sense Disambiguation

pasini tommaso
Primo
2020

Abstract

Word Sense Disambiguation (WSD) is the task of identifying the meaning of a word in a given context. It lies at the base of Natural Language Processing as it provides semantic information for words. In the last decade, great strides have been made in this field and much effort has been devoted to mitigate the knowledge acquisition bottleneck problem, i.e., the problem of semantically annotating texts at a large scale and in different languages. This issue is ubiquitous in WSD as it hinders the creation of both multilingual knowledge bases and manually-curated training sets. In this work, we first introduce the reader to the task of WSD through a short historical digression and then take the stock of the advancements to alleviate the knowledge acquisition bottleneck problem. In that, we survey the literature on manual, semi-automatic and automatic approaches to create English and multilingual corpora tagged with sense annotations and present a clear overview over supervised models for WSD. Finally, we provide our view over the future directions that we foresee for the field.
2020
IJCAI International Joint Conferences on Artificial Intelligence Organization
Survey, Word Sense Disambiguation, Natural Language Processing, Word Sense Distribution
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
The Knowledge Acquisition Bottleneck Problem in Multilingual Word Sense Disambiguation / Pasini, Tommaso. - (2020), pp. 4936-4942. (Intervento presentato al convegno IJCAI International Joint Conferences on Artificial Intelligence Organization tenutosi a Yokohama) [10.24963/ijcai.2020/687].
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1431868
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 30
  • ???jsp.display-item.citation.isi??? ND
social impact