Recently, generative approaches have been used effectively to provide definitions of words in their context. However, the opposite, i.e., generating a usage example given one or more words along with their definitions, has not yet been investigated. In this work, we introduce the novel task of Exemplification Modeling (ExMod), along with a sequence-to-sequence architecture and a training procedure for it. Starting from a set of (word, definition) pairs, our approach is capable of automatically generating high-quality sentences which express the requested semantics. As a result, we can drive the creation of sense-tagged data which cover the full range of meanings in any inventory of interest, and their interactions within sentences. Human annotators agree that the sentences generated are as fluent and semantically-coherent with the input definitions as the sentences in manually-annotated corpora. Indeed, when employed as training data for Word Sense Disambiguation, our examples enable the current state of the art to be outperformed, and higher results to be achieved than when using gold-standard datasets only. We release the pretrained model, the dataset and the software at https://github.com/SapienzaNLP/exmod.

Exemplification Modeling: Can You Give Me an Example, Please? / Barba, Edoardo; Procopio, Luigi; Lacerra, Caterina; Pasini, Tommaso; Navigli, Roberto. - (2021), pp. 3779-3785. (Intervento presentato al convegno International Joint Conference on Artificial Intelligence tenutosi a Online) [10.24963/ijcai.2021/520].

Exemplification Modeling: Can You Give Me an Example, Please?

Barba, Edoardo
;
Procopio, Luigi
;
Lacerra, Caterina
;
Pasini, Tommaso
;
Navigli, Roberto
2021

Abstract

Recently, generative approaches have been used effectively to provide definitions of words in their context. However, the opposite, i.e., generating a usage example given one or more words along with their definitions, has not yet been investigated. In this work, we introduce the novel task of Exemplification Modeling (ExMod), along with a sequence-to-sequence architecture and a training procedure for it. Starting from a set of (word, definition) pairs, our approach is capable of automatically generating high-quality sentences which express the requested semantics. As a result, we can drive the creation of sense-tagged data which cover the full range of meanings in any inventory of interest, and their interactions within sentences. Human annotators agree that the sentences generated are as fluent and semantically-coherent with the input definitions as the sentences in manually-annotated corpora. Indeed, when employed as training data for Word Sense Disambiguation, our examples enable the current state of the art to be outperformed, and higher results to be achieved than when using gold-standard datasets only. We release the pretrained model, the dataset and the software at https://github.com/SapienzaNLP/exmod.
2021
International Joint Conference on Artificial Intelligence
Natural Language Processing; NLP; sequence-to-sequence; BART; Word Sense Disambiguation; WSD
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Exemplification Modeling: Can You Give Me an Example, Please? / Barba, Edoardo; Procopio, Luigi; Lacerra, Caterina; Pasini, Tommaso; Navigli, Roberto. - (2021), pp. 3779-3785. (Intervento presentato al convegno International Joint Conference on Artificial Intelligence tenutosi a Online) [10.24963/ijcai.2021/520].
File allegati a questo prodotto
File Dimensione Formato  
Barba_Exemplification_2021.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 311.38 kB
Formato Adobe PDF
311.38 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1604134
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 13
  • ???jsp.display-item.citation.isi??? ND
social impact