Proprietary data is a valuable asset used to develop predictive algorithms that benefit a wide range of users, including customers, business owners, and decision-makers. Consequently, there is a growing interest in developing safe and robust techniques for sharing, learning models, and distributing predictions across a wide spectrum of potential stakeholders. However, a structured process to assess the value of data assets, and thus enabling collaborations among stakeholders, remains largely unexplored. This is particularly challenging when the data to be shared has a networked structure, where increasing the shared data samples potentially connects information observed by different data owners, providing new knowledge that is unavailable to any data owner individually. Here, we propose E-GraDE, a framework that assists organizations in assessing the value of their networked data to better address graph machine learning tasks. This framework includes a step-by-step analysis of the requirements of different stakeholders, such as the accuracy or fairness requisites of the models, ensuring a fair evaluation process and stronger alignment in the development of a data federation consortium. Additionally, we propose an approach to estimate the value of networked data to be shared while disclosing only a small fraction of the original information. We support our approach with extensive computational experiments, analysing each part of it through simulated use cases.

Value is in the Eye of the Beholder: A Framework for an Equitable Graph Data Evaluation / Nerini, F. P.; Bajardi, P.; Panisson, A.. - (2024), pp. 467-479. (Intervento presentato al convegno 6th ACM Conference on Fairness, Accountability, and Transparency (ACM FAccT) tenutosi a Rio De Janeiro; Brasil) [10.1145/3630106.3658919].

Value is in the Eye of the Beholder: A Framework for an Equitable Graph Data Evaluation

Nerini F. P.
;
2024

Abstract

Proprietary data is a valuable asset used to develop predictive algorithms that benefit a wide range of users, including customers, business owners, and decision-makers. Consequently, there is a growing interest in developing safe and robust techniques for sharing, learning models, and distributing predictions across a wide spectrum of potential stakeholders. However, a structured process to assess the value of data assets, and thus enabling collaborations among stakeholders, remains largely unexplored. This is particularly challenging when the data to be shared has a networked structure, where increasing the shared data samples potentially connects information observed by different data owners, providing new knowledge that is unavailable to any data owner individually. Here, we propose E-GraDE, a framework that assists organizations in assessing the value of their networked data to better address graph machine learning tasks. This framework includes a step-by-step analysis of the requirements of different stakeholders, such as the accuracy or fairness requisites of the models, ensuring a fair evaluation process and stronger alignment in the development of a data federation consortium. Additionally, we propose an approach to estimate the value of networked data to be shared while disclosing only a small fraction of the original information. We support our approach with extensive computational experiments, analysing each part of it through simulated use cases.
2024
6th ACM Conference on Fairness, Accountability, and Transparency (ACM FAccT)
Dataset Evaluation; Graph Datasets; Graph Neural Networks; Shapley Values
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Value is in the Eye of the Beholder: A Framework for an Equitable Graph Data Evaluation / Nerini, F. P.; Bajardi, P.; Panisson, A.. - (2024), pp. 467-479. (Intervento presentato al convegno 6th ACM Conference on Fairness, Accountability, and Transparency (ACM FAccT) tenutosi a Rio De Janeiro; Brasil) [10.1145/3630106.3658919].
File allegati a questo prodotto
File Dimensione Formato  
Nerini_Value_2024.pdf

accesso aperto

Note: https://dl.acm.org/doi/pdf/10.1145/3630106.3658919
Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 3.18 MB
Formato Adobe PDF
3.18 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1725347
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact