This paper presents a novel methodology for topic ontology learning from text documents. The proposed methodology, named OntoTermExtraction (Term Extraction for Ontology learning), is based on OntoGen, a semi-automated tool for topic ontology construction, upgraded by using an advanced terminology extraction tool in an iterative, semi-automated ontology construction process. This process consists of (a) document clustering to find the nodes in the topic ontology, (b) term extraction from document clusters, (c) populating the term vocabulary and keyword extraction, and (d) choosing the concept names by comparing the best-ranked terms with the extracted keywords. The approach was successfully used for generating the ontology of topics in Inductive Logic Programming, learned semi-automatically from papers indexed in the ILPnet2 publications database. © 2008 Springer Berlin Heidelberg.
Advancing Topic Ontology Learning trough term extraction / B., Fortuna; N., Lavrac; Velardi, Paola. - 5351:(2008), pp. 626-635. (Intervento presentato al convegno 10th Pacific Rim Int. Conf. On Artificial Intelligence (PRICAI2008 tenutosi a Hanoi; Viet Nam nel 15-19 december 2008) [10.1007/978-3-540-89197-0_57].
Advancing Topic Ontology Learning trough term extraction
VELARDI, Paola
2008
Abstract
This paper presents a novel methodology for topic ontology learning from text documents. The proposed methodology, named OntoTermExtraction (Term Extraction for Ontology learning), is based on OntoGen, a semi-automated tool for topic ontology construction, upgraded by using an advanced terminology extraction tool in an iterative, semi-automated ontology construction process. This process consists of (a) document clustering to find the nodes in the topic ontology, (b) term extraction from document clusters, (c) populating the term vocabulary and keyword extraction, and (d) choosing the concept names by comparing the best-ranked terms with the extracted keywords. The approach was successfully used for generating the ontology of topics in Inductive Logic Programming, learned semi-automatically from papers indexed in the ILPnet2 publications database. © 2008 Springer Berlin Heidelberg.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.