Catalogo dei prodotti della ricerca

It is well-known that Artificial Intelligence (AI), and in particular Machine Learning (ML), is not effective without good data preparation, as also pointed out by the recent wave of data-centric AI. Data preparation is the process of gathering, transforming and cleaning raw data prior to processing and analysis. Since nowadays data often reside in distributed and heterogeneous data sources, the first activity of data preparation requires collecting data from suitable data sources and data services, often distributed and heterogeneous. It is thus essential that providers describe their data services in a way to make them compliant with the FAIR guiding principles, i.e., make them automatically Findable, Accessible, Interoperable, and Reusable (FAIR). The notion of data abstraction has been introduced exactly to meet this need. Abstraction is a kind of reverse engineering task that automatically provides a semantic characterization of a data service made available by a provider. The goal of this paper is to review the results obtained so far in data abstraction, by presenting the formal framework for its definition, reporting about the decidability and complexity of the main theoretical problems concerning abstraction, and discuss open issues and interesting directions for future research.

A review of data abstraction / Cima, G., Console, M., Lenzerini, M., Poggi, A.. - In: FRONTIERS IN ARTIFICIAL INTELLIGENCE. - ISSN 2624-8212. - 6:(2023). [10.3389/frai.2023.1085754]

A review of data abstraction

Cima G.;Console M.;Lenzerini M.;Poggi A.

2023

Abstract

It is well-known that Artificial Intelligence (AI), and in particular Machine Learning (ML), is not effective without good data preparation, as also pointed out by the recent wave of data-centric AI. Data preparation is the process of gathering, transforming and cleaning raw data prior to processing and analysis. Since nowadays data often reside in distributed and heterogeneous data sources, the first activity of data preparation requires collecting data from suitable data sources and data services, often distributed and heterogeneous. It is thus essential that providers describe their data services in a way to make them compliant with the FAIR guiding principles, i.e., make them automatically Findable, Accessible, Interoperable, and Reusable (FAIR). The notion of data abstraction has been introduced exactly to meet this need. Abstraction is a kind of reverse engineering task that automatically provides a semantic characterization of a data service made available by a provider. The goal of this paper is to review the results obtained so far in data abstraction, by presenting the formal framework for its definition, reporting about the decidability and complexity of the main theoretical problems concerning abstraction, and discuss open issues and interesting directions for future research.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2023
			
	Parole chiave
	
				abstraction; automated reasoning; data integration; data preparation; knowledge representation
			
	Tipologia
	
				01 Pubblicazione su rivista::01g Articolo di rassegna (Review)
			
	Citazione
	
				A review of data abstraction / Cima, G., Console, M., Lenzerini, M., Poggi, A.. - In: FRONTIERS IN ARTIFICIAL INTELLIGENCE. - ISSN 2624-8212. - 6:(2023). [10.3389/frai.2023.1085754]
			
	Appartiene alla tipologia:
	
				01g Articolo di rassegna (Review)

File allegati a questo prodotto

File	Dimensione	Formato
Cima_A-review_2023.pdf accesso aperto Note: DOI 10.3389/frai.2023.1085754 Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Creative commons Dimensione 308.81 kB Formato Adobe PDF	308.81 kB	Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1691314

Citazioni

1

5

2

social impact