We present MultiWiBi, an approach to the automatic creation of two integrated taxonomies for Wikipedia pages and categories written in different languages. In order to create both taxonomies in an arbitrary language, we first build them in English and then project the two taxonomies to other languages automatically, without the help of language-specific resources or tools. The process crucially leverages a novel algorithm which exploits the information available in either one of the taxonomies to reinforce the creation of the other taxonomy. Our experiments show that the taxonomical information in MultiWiBi is characterized by a higher quality and coverage than state-of-the-art resources like DBpedia, YAGO, MENTA, WikiNet, LHD and WikiTaxonomy, also across languages. MultiWiBi is available online at http://wibitaxonomy.org/multiwibi.

MultiWiBi: The multilingual Wikipedia bitaxonomy project / Flati, Tiziano; Vannella, Daniele; Pasini, Tommaso; Navigli, Roberto. - In: ARTIFICIAL INTELLIGENCE. - ISSN 0004-3702. - STAMPA. - 241:December 2016(2016), pp. 66-102. [https://doi.org/10.1016/j.artint.2016.08.004]

MultiWiBi: The multilingual Wikipedia bitaxonomy project

FLATI, TIZIANO;VANNELLA, DANIELE;PASINI, TOMMASO;NAVIGLI, Roberto
2016

Abstract

We present MultiWiBi, an approach to the automatic creation of two integrated taxonomies for Wikipedia pages and categories written in different languages. In order to create both taxonomies in an arbitrary language, we first build them in English and then project the two taxonomies to other languages automatically, without the help of language-specific resources or tools. The process crucially leverages a novel algorithm which exploits the information available in either one of the taxonomies to reinforce the creation of the other taxonomy. Our experiments show that the taxonomical information in MultiWiBi is characterized by a higher quality and coverage than state-of-the-art resources like DBpedia, YAGO, MENTA, WikiNet, LHD and WikiTaxonomy, also across languages. MultiWiBi is available online at http://wibitaxonomy.org/multiwibi.
2016
taxonomy extraction; taxonomy induction; machine learning; natural language processing; collaborative resources; Wikipedia
01 Pubblicazione su rivista::01a Articolo in rivista
MultiWiBi: The multilingual Wikipedia bitaxonomy project / Flati, Tiziano; Vannella, Daniele; Pasini, Tommaso; Navigli, Roberto. - In: ARTIFICIAL INTELLIGENCE. - ISSN 0004-3702. - STAMPA. - 241:December 2016(2016), pp. 66-102. [https://doi.org/10.1016/j.artint.2016.08.004]
File allegati a questo prodotto
File Dimensione Formato  
Flati_MultiWiBi_2016.pdf

solo gestori archivio

Tipologia: Altro materiale allegato
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.88 MB
Formato Adobe PDF
1.88 MB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/960314
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 23
  • ???jsp.display-item.citation.isi??? 13
social impact