We introduce EXTASEM!, a novel approach for the automatic learning of lexical taxonomies from domain terminologies. First, we exploit a very large semantic network to collect thousands of in-domain textual definitions. Second, we extract (hyponym, hypernym) pairs from each definition with a CRF-based algorithm trained on manuallyvalidated data. Finally, we introduce a graph induction procedure which constructs a full-fledged taxonomy where each edge is weighted according to its domain pertinence. EXTASEM! achieves state-of-the-art results in the following taxonomy evaluation experiments: (1) Hypernym discovery, (2) Reconstructing gold standard taxonomies, and (3) Taxonomy quality according to structural measures. We release weighted taxonomies for six domains for the use and scrutiny of the community

ExTaSem! Extending, Taxonomizing and Semantifying Domain Terminologies / Espinosa Anke, Luis; Saggion, Horacio; Ronzano, Francesco; Navigli, Roberto. - ELETTRONICO. - (2016), pp. 2594-2600. (Intervento presentato al convegno The Thirtieth AAAI Conference on Artificial Intelligence tenutosi a Phoenix, Arizona nel February 12-17).

ExTaSem! Extending, Taxonomizing and Semantifying Domain Terminologies

NAVIGLI, ROBERTO
2016

Abstract

We introduce EXTASEM!, a novel approach for the automatic learning of lexical taxonomies from domain terminologies. First, we exploit a very large semantic network to collect thousands of in-domain textual definitions. Second, we extract (hyponym, hypernym) pairs from each definition with a CRF-based algorithm trained on manuallyvalidated data. Finally, we introduce a graph induction procedure which constructs a full-fledged taxonomy where each edge is weighted according to its domain pertinence. EXTASEM! achieves state-of-the-art results in the following taxonomy evaluation experiments: (1) Hypernym discovery, (2) Reconstructing gold standard taxonomies, and (3) Taxonomy quality according to structural measures. We release weighted taxonomies for six domains for the use and scrutiny of the community
2016
The Thirtieth AAAI Conference on Artificial Intelligence
automatic learning; Natural Language Processing; lexical taxonomies
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
ExTaSem! Extending, Taxonomizing and Semantifying Domain Terminologies / Espinosa Anke, Luis; Saggion, Horacio; Ronzano, Francesco; Navigli, Roberto. - ELETTRONICO. - (2016), pp. 2594-2600. (Intervento presentato al convegno The Thirtieth AAAI Conference on Artificial Intelligence tenutosi a Phoenix, Arizona nel February 12-17).
File allegati a questo prodotto
File Dimensione Formato  
Espinosa_ExTaSem!_2016.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 310.94 kB
Formato Adobe PDF
310.94 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/845290
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 18
  • ???jsp.display-item.citation.isi??? 1
social impact