Policy making has the strict requirement to rely on quantitative and high quality information. This paper will address the data quality issue for policy making by showing how to deal with Big Data quality in the different steps of a processing pipeline, with a focus on the integration of Big Data sources with traditional sources. In this respect, a relevant role is played by metadata and in particular by ontologies. Integration systems relying on ontologies enable indeed a formal quality evaluation of inaccuracy, inconsistency and incompleteness of integrated data. The paper will finally describe data confidentiality as a Big Data quality dimension, showing the main issues to be faced for its assurance.

My (Fair) Big Data / Catarci, Tiziana; Scannapieco, Monica; Console, Marco; Demetrescu, Camil. - STAMPA. - (2017), pp. 2974-2979. (Intervento presentato al convegno 5th IEEE International Conference on Big Data, Big Data 2017 tenutosi a Boston, Massachusetts; USA) [10.1109/BigData.2017.8258267].

My (Fair) Big Data

Catarci, Tiziana;Scannapieco, Monica;Console, Marco;Demetrescu, Camil
2017

Abstract

Policy making has the strict requirement to rely on quantitative and high quality information. This paper will address the data quality issue for policy making by showing how to deal with Big Data quality in the different steps of a processing pipeline, with a focus on the integration of Big Data sources with traditional sources. In this respect, a relevant role is played by metadata and in particular by ontologies. Integration systems relying on ontologies enable indeed a formal quality evaluation of inaccuracy, inconsistency and incompleteness of integrated data. The paper will finally describe data confidentiality as a Big Data quality dimension, showing the main issues to be faced for its assurance.
2017
5th IEEE International Conference on Big Data, Big Data 2017
Big Data;data integrity;meta data;ontologies (artificial intelligence);Big Data quality dimension;Big Data sources;data confidentiality;data quality issue;formal quality evaluation;high quality information;integrated data;integration systems;ontologies;policy making;processing pipeline;quantitative quality information;strict requirement;Big Data;Google;Metadata;Ontologies;Pipelines;Big Data confidentiality;Big Data pipeline;Quality-driven policies;ontology-based quality checking;cybersercurity
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
My (Fair) Big Data / Catarci, Tiziana; Scannapieco, Monica; Console, Marco; Demetrescu, Camil. - STAMPA. - (2017), pp. 2974-2979. (Intervento presentato al convegno 5th IEEE International Conference on Big Data, Big Data 2017 tenutosi a Boston, Massachusetts; USA) [10.1109/BigData.2017.8258267].
File allegati a questo prodotto
File Dimensione Formato  
Catarci_My-fair-big-data_2017.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 914.7 kB
Formato Adobe PDF
914.7 kB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1084780
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 5
social impact