In this paper we present a graph-based approach aimed at learning a lexical taxonomy automatically starting from a domain corpus and the Web. Unlike many taxonomy learning approaches in the literature, our novel algorithm learns both concepts and relations entirely from scratch via the automated extraction of terms, definitions and hypernyms. This results in a very dense, cyclic and possibly disconnected hypernym graph. The algorithm then induces a taxonomy from the graph. Our experiments show that we obtain high-quality results, both when building brand-new taxonomies and when reconstructing WordNet sub-hierarchies.
A graph-based algorithm for inducing lexical taxonomies from scratch / Navigli, Roberto; Velardi, Paola; Faralli, Stefano. - STAMPA. - (2011), pp. 1872-1877. (Intervento presentato al convegno 22nd International Joint Conference on Artificial Intelligence, IJCAI 2011 tenutosi a Barcelona, Catalonia nel 16 July 2011 through 22 July 2011) [10.5591/978-1-57735-516-8/ijcai11-313].
A graph-based algorithm for inducing lexical taxonomies from scratch
NAVIGLI, ROBERTO;VELARDI, Paola;FARALLI, Stefano
2011
Abstract
In this paper we present a graph-based approach aimed at learning a lexical taxonomy automatically starting from a domain corpus and the Web. Unlike many taxonomy learning approaches in the literature, our novel algorithm learns both concepts and relations entirely from scratch via the automated extraction of terms, definitions and hypernyms. This results in a very dense, cyclic and possibly disconnected hypernym graph. The algorithm then induces a taxonomy from the graph. Our experiments show that we obtain high-quality results, both when building brand-new taxonomies and when reconstructing WordNet sub-hierarchies.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.