Word Sense Disambiguation models exist in many flavors. Even though supervised ones tend to perform best in terms of accuracy, they often lose ground to more flexible knowledge-based solutions, which do not require training by a word expert for every disambiguation target. To bridge this gap we adopt a different perspective and rely on sequence learning to frame the disambiguation problem: we propose and study in depth a series of end-to-end neural architectures directly tailored to the task, from bidirectional Long Short-Term Memory to encoder-decoder models. Our extensive evaluation over standard benchmarks and in multiple languages shows that sequence learning enables more versatile all-words models that consistently lead to state-of-the-art results, even against word experts with engineered features

Neural sequence learning models for Word Sense Disambiguation / Raganato, Alessandro; DELLI BOVI, Claudio; Navigli, Roberto. - ELETTRONICO. - 1:(2017), pp. 1156-1167. (Intervento presentato al convegno Conference on Empirical Methods in Natural Language Processing tenutosi a Copenaghen, Danimarca nel 7-11 Settembre 2017).

Neural sequence learning models for Word Sense Disambiguation

raganato, alessandro;DELLI BOVI, CLAUDIO;NAVIGLI, Roberto
2017

Abstract

Word Sense Disambiguation models exist in many flavors. Even though supervised ones tend to perform best in terms of accuracy, they often lose ground to more flexible knowledge-based solutions, which do not require training by a word expert for every disambiguation target. To bridge this gap we adopt a different perspective and rely on sequence learning to frame the disambiguation problem: we propose and study in depth a series of end-to-end neural architectures directly tailored to the task, from bidirectional Long Short-Term Memory to encoder-decoder models. Our extensive evaluation over standard benchmarks and in multiple languages shows that sequence learning enables more versatile all-words models that consistently lead to state-of-the-art results, even against word experts with engineered features
2017
Conference on Empirical Methods in Natural Language Processing
Word Sense Disambiguation; Sequence Modeling; Neural Networks; Deep Learning
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Neural sequence learning models for Word Sense Disambiguation / Raganato, Alessandro; DELLI BOVI, Claudio; Navigli, Roberto. - ELETTRONICO. - 1:(2017), pp. 1156-1167. (Intervento presentato al convegno Conference on Empirical Methods in Natural Language Processing tenutosi a Copenaghen, Danimarca nel 7-11 Settembre 2017).
File allegati a questo prodotto
File Dimensione Formato  
Navigli_Neural_2017.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 887.41 kB
Formato Adobe PDF
887.41 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1003054
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 162
  • ???jsp.display-item.citation.isi??? ND
social impact