Definition extraction is the task of automatically identifying definitional sentences within texts. The task has proven useful in many research areas including ontology learning, relation extraction and question answering. However, current approaches - mostly focused on lexicosyntactic patterns - suffer from both low recall and precision, as definitional sentences occur in highly variable syntactic structures. In this paper, we propose Word- Class Lattices (WCLs), a generalization of word lattices that we use to model textual definitions. Lattices are learned from a dataset of definitions from Wikipedia. Our method is applied to the task of definition and hypernym extraction and compares favorably to other pattern generalization methods proposed in the literature. © 2010 Association for Computational Linguistics.
Learning Word-Class Lattices for definition and hypernym extraction / Navigli, Roberto; Velardi, Paola. - STAMPA. - (2010), pp. 1318-1327. (Intervento presentato al convegno 48th Annual Meeting of the Association for Computational Linguistics, ACL 2010 tenutosi a Uppsala nel 11 July 2010 through 16 July 2010).
Learning Word-Class Lattices for definition and hypernym extraction
NAVIGLI, ROBERTO;VELARDI, Paola
2010
Abstract
Definition extraction is the task of automatically identifying definitional sentences within texts. The task has proven useful in many research areas including ontology learning, relation extraction and question answering. However, current approaches - mostly focused on lexicosyntactic patterns - suffer from both low recall and precision, as definitional sentences occur in highly variable syntactic structures. In this paper, we propose Word- Class Lattices (WCLs), a generalization of word lattices that we use to model textual definitions. Lattices are learned from a dataset of definitions from Wikipedia. Our method is applied to the task of definition and hypernym extraction and compares favorably to other pattern generalization methods proposed in the literature. © 2010 Association for Computational Linguistics.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.