Word Sense Disambiguation (WSD), ie, the task of assigning senses to words in context, has seen a surge of interest with the advent of neural models and a considerable increase in performance up to 80% F1 in English. However, when considering other languages, the availability of training data is limited, which hampers scaling WSD to many languages. To address this issue, we put forward MULTIMIRROR, a sense projection approach for multilingual WSD based on a novel neural discriminative model for word alignment: given as input a pair of parallel sentences, our model–trained with a low number of instances–is capable of jointly aligning, at the same time, all source and target tokens with each other, surpassing its competitors across several language combinations. We demonstrate that projecting senses from English by leveraging the alignments produced by our model leads a simple mBERT-powered classifier to achieve a new state of the art on established WSD datasets in French, German, Italian, Spanish and Japanese. We release our software and all our datasets at https://github. com/SapienzaNLP/multimirror.

MultiMirror: Neural Cross-lingual Word Alignment for Multilingual Word Sense Disambiguation / Procopio, Luigi; Barba, Edoardo; Martelli, Federico; Navigli, Roberto. - (2021), pp. 3915-3921. (Intervento presentato al convegno International Joint Conference on Artificial Intelligence tenutosi a Virtual, Online) [10.24963/ijcai.2021/539].

MultiMirror: Neural Cross-lingual Word Alignment for Multilingual Word Sense Disambiguation

Luigi Procopio
Primo
;
Edoardo Barba
Secondo
;
Federico Martelli
Penultimo
;
Roberto Navigli
Ultimo
2021

Abstract

Word Sense Disambiguation (WSD), ie, the task of assigning senses to words in context, has seen a surge of interest with the advent of neural models and a considerable increase in performance up to 80% F1 in English. However, when considering other languages, the availability of training data is limited, which hampers scaling WSD to many languages. To address this issue, we put forward MULTIMIRROR, a sense projection approach for multilingual WSD based on a novel neural discriminative model for word alignment: given as input a pair of parallel sentences, our model–trained with a low number of instances–is capable of jointly aligning, at the same time, all source and target tokens with each other, surpassing its competitors across several language combinations. We demonstrate that projecting senses from English by leveraging the alignments produced by our model leads a simple mBERT-powered classifier to achieve a new state of the art on established WSD datasets in French, German, Italian, Spanish and Japanese. We release our software and all our datasets at https://github. com/SapienzaNLP/multimirror.
2021
International Joint Conference on Artificial Intelligence
Word Sense Disambiguation; Cross-lingual Label Projection; Word Alignment
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
MultiMirror: Neural Cross-lingual Word Alignment for Multilingual Word Sense Disambiguation / Procopio, Luigi; Barba, Edoardo; Martelli, Federico; Navigli, Roberto. - (2021), pp. 3915-3921. (Intervento presentato al convegno International Joint Conference on Artificial Intelligence tenutosi a Virtual, Online) [10.24963/ijcai.2021/539].
File allegati a questo prodotto
File Dimensione Formato  
Procopio_MultiMirror_2021.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 195.1 kB
Formato Adobe PDF
195.1 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1604122
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 12
  • ???jsp.display-item.citation.isi??? ND
social impact