A step in establishing a Web community's knowledge domain involves collecting a glossary of domain-relevant terms that constitute the linguistic surface manifestation of domain concepts. TermExtractor and GlossExtractor are two two Web-mining-based applications that support glossary building by exploiting the Web's evolving nature to allow continuous updating of an emerging community's vocabulary. These tools acquire a glossary's two basic components, such as terms and definitions where the terms are harvested from domain text corpora and the definitions are extracted from different types of Web pages.
Mining the Web to create specialized glossaries / Velardi, Paola; Navigli, Roberto; P., D'Amadio. - In: IEEE INTELLIGENT SYSTEMS. - ISSN 1541-1672. - STAMPA. - 23:5(2008), pp. 18-25. [10.1109/mis.2008.88]
Mining the Web to create specialized glossaries
VELARDI, Paola;NAVIGLI, ROBERTO;
2008
Abstract
A step in establishing a Web community's knowledge domain involves collecting a glossary of domain-relevant terms that constitute the linguistic surface manifestation of domain concepts. TermExtractor and GlossExtractor are two two Web-mining-based applications that support glossary building by exploiting the Web's evolving nature to allow continuous updating of an emerging community's vocabulary. These tools acquire a glossary's two basic components, such as terms and definitions where the terms are harvested from domain text corpora and the definitions are extracted from different types of Web pages.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.