Catalogo dei prodotti della ricerca

Hypernymy relations (those where an hyponym term shares a "isa" relationship with his hypernym) play a key role for many Natural Language Processing (NLP) tasks, e.g. ontology learning, automatically building or extending knowledge bases, or word sense disambiguation and induction. In fact, such relations may provide the basis for the construction of more complex structures such as taxonomies, or be used as effective background knowledge for many word understanding applications. We present a publicly available database containing more than 400 million hypernymy relations we extracted from the CommonCrawl web corpus. We describe the infrastructure we developed to iterate over the web corpus for extracting the hypernymy relations and store them effectively into a large database. This collection of relations represents a rich source of knowledge and may be useful for many researchers. We offer the tuple dataset for public download and an Application Programming Interface (API) to help other researchers programmatically query the database.

A large database of hypernymy relations extracted from the web / Seitner, J.; Bizer, C.; Eckert, K.; Faralli, S.; Meusel, R.; Paulheim, H.; Ponzetto, S.. - (2016), pp. 360-367. ( 10th International Conference on Language Resources and Evaluation, LREC 2016 Grand Hotel Bernardin Conference Center, svn ).

A large database of hypernymy relations extracted from the web

Seitner J.^Co-primo;Bizer C.^Co-primo;Eckert K.^Co-primo;Faralli S.^Co-primo;Meusel R.^Co-primo;Paulheim H.^Co-primo;Ponzetto S.^Co-primo

2016

Abstract

Hypernymy relations (those where an hyponym term shares a "isa" relationship with his hypernym) play a key role for many Natural Language Processing (NLP) tasks, e.g. ontology learning, automatically building or extending knowledge bases, or word sense disambiguation and induction. In fact, such relations may provide the basis for the construction of more complex structures such as taxonomies, or be used as effective background knowledge for many word understanding applications. We present a publicly available database containing more than 400 million hypernymy relations we extracted from the CommonCrawl web corpus. We describe the infrastructure we developed to iterate over the web corpus for extracting the hypernymy relations and store them effectively into a large database. This collection of relations represents a rich source of knowledge and may be useful for many researchers. We offer the tuple dataset for public download and an Application Programming Interface (API) to help other researchers programmatically query the database.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2016
			
	Nome convegno
	
				10th International Conference on Language Resources and Evaluation, LREC 2016
			
	Parole chiave
	
				Application Programming Interface; Big data; Hearst patterns; Hypernym extraction; Web crawling
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				A large database of hypernymy relations extracted from the web / Seitner, J.; Bizer, C.; Eckert, K.; Faralli, S.; Meusel, R.; Paulheim, H.; Ponzetto, S.. - (2016), pp. 360-367. ( 10th International Conference on Language Resources and Evaluation, LREC 2016 Grand Hotel Bernardin Conference Center, svn ).
			
	Appartiene alla tipologia:
	
				04b Atto di convegno in volume

File allegati a questo prodotto

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1620653

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

102

45

social impact