Sharing, discovering, and integrating data is a crucial task and poses many challenging spots and open research direction. Data owners need to know what data consumers want and data consumers need to find datasets that are satisfactory for their tasks. Several data market platforms, or data marketplaces (DMs), have been used so far to facilitate data transactions between data owners and customers. However, current DMs are mostly shop windows, where customers have to rely on metadata that owners manually curate to discover useful datasets and there is no automated mechanism for owners to determine if their data could be merged with other datasets to satisfy customers' desiderata. The availability of novel artificial intelligence techniques for data management has sparked a renewed interest in proposing new DMs that stray from this conventional paradigm and overcome its limitations. This paper envisions a conceptual framework called DataStreet where DMs can create personalized datasets by combining available datasets and presenting summarized statistics to help users make informed decisions. In our framework, owners share some of their data with a trusted DM, and customers provide a dataset template to fuel content-based (rather than metadata-based) search queries. Upon each query, the DM creates a preview of the personalized dataset through a flexible use of dataset discovery, integration, and value measurement, while ensuring owners' fair treatment and preserving privacy. The previewed datasets might not be pre-defined in the DM and are finally materialized upon successful transaction.

Bridging the Gap between Buyers and Sellers in Data Marketplaces with Personalized Datasets / Firmani, Donatella; Mathew, JERIN GEORGE; Santoro, Donatello; Simonini, Giovanni; Zecchini, Luca. - (2023), pp. 525-534. (Intervento presentato al convegno 31st Italian Symposium on Advanced Database Systems tenutosi a Galzignano Terme; Italy).

Bridging the Gap between Buyers and Sellers in Data Marketplaces with Personalized Datasets

Donatella Firmani
;
Jerin George Mathew
;
2023

Abstract

Sharing, discovering, and integrating data is a crucial task and poses many challenging spots and open research direction. Data owners need to know what data consumers want and data consumers need to find datasets that are satisfactory for their tasks. Several data market platforms, or data marketplaces (DMs), have been used so far to facilitate data transactions between data owners and customers. However, current DMs are mostly shop windows, where customers have to rely on metadata that owners manually curate to discover useful datasets and there is no automated mechanism for owners to determine if their data could be merged with other datasets to satisfy customers' desiderata. The availability of novel artificial intelligence techniques for data management has sparked a renewed interest in proposing new DMs that stray from this conventional paradigm and overcome its limitations. This paper envisions a conceptual framework called DataStreet where DMs can create personalized datasets by combining available datasets and presenting summarized statistics to help users make informed decisions. In our framework, owners share some of their data with a trusted DM, and customers provide a dataset template to fuel content-based (rather than metadata-based) search queries. Upon each query, the DM creates a preview of the personalized dataset through a flexible use of dataset discovery, integration, and value measurement, while ensuring owners' fair treatment and preserving privacy. The previewed datasets might not be pre-defined in the DM and are finally materialized upon successful transaction.
2023
31st Italian Symposium on Advanced Database Systems
data market; data integration; fairness; privacy
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Bridging the Gap between Buyers and Sellers in Data Marketplaces with Personalized Datasets / Firmani, Donatella; Mathew, JERIN GEORGE; Santoro, Donatello; Simonini, Giovanni; Zecchini, Luca. - (2023), pp. 525-534. (Intervento presentato al convegno 31st Italian Symposium on Advanced Database Systems tenutosi a Galzignano Terme; Italy).
File allegati a questo prodotto
File Dimensione Formato  
Lembo_postprint_Bridging_2023.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 933.02 kB
Formato Adobe PDF
933.02 kB Adobe PDF   Contatta l'autore
Lembo_Bridging_2023.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 1.16 MB
Formato Adobe PDF
1.16 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1687834
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact