In several data-centric application domains, the need arises to extract valuable information from unstructured text documents. The recent paradigm of Ontology Mediated Information Extraction (OMIE) faces this problem by taking into account the knowledge expressed by a domain ontology, and reasoning over it to improve the quality of extracted data. MASTRO SYSTEM-T is a novel tool for OMIE, developed by Sapienza University and IBM Almaden Research. In this work, we demonstrate its usage for information extraction over real-world financial text documents from the U.S. EDGAR system.

Ontology Mediated Information Extraction with MASTRO SYSTEM-T / Lembo, Domenico; Li, Yunyao; Popa, Lucian; Qian, Kun; Scafoglieri, Federico. - 2721:(2020), pp. 256-261. (Intervento presentato al convegno International Semantic Web Conference tenutosi a Athens; Greece).

Ontology Mediated Information Extraction with MASTRO SYSTEM-T

Lembo, Domenico
;
Scafoglieri, Federico
2020

Abstract

In several data-centric application domains, the need arises to extract valuable information from unstructured text documents. The recent paradigm of Ontology Mediated Information Extraction (OMIE) faces this problem by taking into account the knowledge expressed by a domain ontology, and reasoning over it to improve the quality of extracted data. MASTRO SYSTEM-T is a novel tool for OMIE, developed by Sapienza University and IBM Almaden Research. In this work, we demonstrate its usage for information extraction over real-world financial text documents from the U.S. EDGAR system.
2020
International Semantic Web Conference
Information Extraction; Ontologies; Experiments
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Ontology Mediated Information Extraction with MASTRO SYSTEM-T / Lembo, Domenico; Li, Yunyao; Popa, Lucian; Qian, Kun; Scafoglieri, Federico. - 2721:(2020), pp. 256-261. (Intervento presentato al convegno International Semantic Web Conference tenutosi a Athens; Greece).
File allegati a questo prodotto
File Dimensione Formato  
Lembo_Ontology_2020.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 1.12 MB
Formato Adobe PDF
1.12 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1476963
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact