Catalogo dei prodotti della ricerca

Word Sense Disambiguation (WSD) is the task of associating a word in context with one of its meanings. While many works in the past have focused on raising the state of the art, none has even come close to achieving an F-score in the 80% ballpark when using WordNet as its sense inventory. We contend that one of the main reasons for this failure is the excessively fine granularity of this inventory, resulting in senses that are hard to differentiate between, even for an experienced human annotator. In this paper we cope with this long-standing problem by introducing Coarse Sense Inventory (CSI), obtained by linking WordNet concepts to a new set of 45 labels. The results show that the coarse granularity of CSI leads a WSD model to achieve 85.9% F1, while maintaining a high expressive power. Our set of labels also exhibits ease of use in tagging and a descriptiveness that other coarse inventories lack, as demonstrated in two annotation tasks which we performed. Moreover, a few-shot evaluation proves that the class-based nature of CSI allows the model to generalise over unseen or under-represented word

CSI: a Coarse Sense Inventory for 85% Word Sense Disambiguation / Lacerra, Caterina; Bevilacqua, Michele; Pasini, Tommaso; Navigli, Roberto. - (2020), pp. 8123-8130. (Intervento presentato al convegno National Conference of the American Association for Artificial Intelligence tenutosi a New York, NY) [10.1609/aaai.v34i05.6324].

CSI: a Coarse Sense Inventory for 85% Word Sense Disambiguation

Caterina Lacerra;Michele Bevilacqua;Tommaso Pasini;Roberto Navigli

2020

Abstract

Word Sense Disambiguation (WSD) is the task of associating a word in context with one of its meanings. While many works in the past have focused on raising the state of the art, none has even come close to achieving an F-score in the 80% ballpark when using WordNet as its sense inventory. We contend that one of the main reasons for this failure is the excessively fine granularity of this inventory, resulting in senses that are hard to differentiate between, even for an experienced human annotator. In this paper we cope with this long-standing problem by introducing Coarse Sense Inventory (CSI), obtained by linking WordNet concepts to a new set of 45 labels. The results show that the coarse granularity of CSI leads a WSD model to achieve 85.9% F1, while maintaining a high expressive power. Our set of labels also exhibits ease of use in tagging and a descriptiveness that other coarse inventories lack, as demonstrated in two annotation tasks which we performed. Moreover, a few-shot evaluation proves that the class-based nature of CSI allows the model to generalise over unseen or under-represented word

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2020
			
	Nome convegno
	
				National Conference of the American Association for Artificial Intelligence
			
	Parole chiave
	
				word sense disambiguation; natural language processing; deep learning
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				CSI: a Coarse Sense Inventory for 85% Word Sense Disambiguation / Lacerra, Caterina; Bevilacqua, Michele; Pasini, Tommaso; Navigli, Roberto. - (2020), pp. 8123-8130. (Intervento presentato al  convegno National Conference of the American Association for Artificial Intelligence tenutosi a New York, NY) [10.1609/aaai.v34i05.6324].
			
	Appartiene alla tipologia:
	
				04b Atto di convegno in volume

File allegati a questo prodotto

File	Dimensione	Formato
Lacerra_Csi_2020.pdf accesso aperto Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 514.64 kB Formato Adobe PDF	514.64 kB	Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1351480

Citazioni

ND

22

14

social impact