We study fair clustering problems as proposed by Chierichetti et al. [CKLV17]. Here, points have a sensitive attribute and all clusters in the solution are required to be balanced with respect to it (to counteract any form of data-inherent bias). Previous algorithms for fair clustering do not scale well. We show how to model and compute so-called coresets for fair clustering problems, which can be used to significantly reduce the input data size. We prove that the coresets are composable [IMMM14] and show how to compute them in a streaming setting. This yields a streaming PTAS for fair k-means in the case of two colors (and exact balances). Furthermore, we extend techniques due to Chierichetti et al. [CKLV17] to obtain an approximation algorithm for k-means, which leads to a constant factor algorithm in the streaming model when combined with the coreset.

Fair Coresets and Streaming Algorithms for Fair k-means / Schmidt, M.; Schwiegelshohn, C.; Sohler, C.. - 11926:(2020), pp. 232-251. (Intervento presentato al convegno 17th International Workshop on Approximation and Online Algorithms, WAOA 2019 tenutosi a Munich; Germany) [10.1007/978-3-030-39479-0_16].

Fair Coresets and Streaming Algorithms for Fair k-means

Schwiegelshohn C.
;
Sohler C.
2020

Abstract

We study fair clustering problems as proposed by Chierichetti et al. [CKLV17]. Here, points have a sensitive attribute and all clusters in the solution are required to be balanced with respect to it (to counteract any form of data-inherent bias). Previous algorithms for fair clustering do not scale well. We show how to model and compute so-called coresets for fair clustering problems, which can be used to significantly reduce the input data size. We prove that the coresets are composable [IMMM14] and show how to compute them in a streaming setting. This yields a streaming PTAS for fair k-means in the case of two colors (and exact balances). Furthermore, we extend techniques due to Chierichetti et al. [CKLV17] to obtain an approximation algorithm for k-means, which leads to a constant factor algorithm in the streaming model when combined with the coreset.
2020
17th International Workshop on Approximation and Online Algorithms, WAOA 2019
Approximation algorithms; Algorithms; Facility location
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Fair Coresets and Streaming Algorithms for Fair k-means / Schmidt, M.; Schwiegelshohn, C.; Sohler, C.. - 11926:(2020), pp. 232-251. (Intervento presentato al convegno 17th International Workshop on Approximation and Online Algorithms, WAOA 2019 tenutosi a Munich; Germany) [10.1007/978-3-030-39479-0_16].
File allegati a questo prodotto
File Dimensione Formato  
Schmidt_Fair_2020.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 9.5 MB
Formato Adobe PDF
9.5 MB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1390282
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 52
  • ???jsp.display-item.citation.isi??? 25
social impact