The minimum sum-of-squares clustering (MSSC), or k-means type clustering, has been recently extended to exploit prior knowledge on the cardinality of each cluster. Such knowledge is used to increase performance as well as solution quality. In this paper, we propose a global optimization approach based on the branch-and-cut technique to solve the cardinality-constrained MSSC. For the lower bound routine, we use the semidefinite programming (SDP) relaxation recently proposed by Rujeerapaiboon et al. (SIAM J Optim 29(2):1211–1239, 2019). However, this relaxation can be used in a branch-and-cut method only for small-size instances. Therefore, we derive a new SDP relaxation that scales better with the instance size and the number of clusters. In both cases, we strengthen the bound by adding polyhedral cuts. Benefiting from a tailored branching strategy which enforces pairwise constraints, we reduce the complexity of the problems arising in the children nodes. For the upper bound, instead, we present a local search procedure that exploits the solution of the SDP relaxation solved at each node. Computational results show that the proposed algorithm globally solves, for the first time, real-world instances of size 10 times larger than those solved by state-of-the-art exact methods.

Global optimization for cardinality-constrained minimum sum-of-squares clustering via semidefinite programming / Piccialli, Veronica; Sudoso, Antonio M.. - In: MATHEMATICAL PROGRAMMING. - ISSN 1436-4646. - (2023). [10.1007/s10107-023-02021-8]

Global optimization for cardinality-constrained minimum sum-of-squares clustering via semidefinite programming

Veronica Piccialli
;
Antonio M. Sudoso
2023

Abstract

The minimum sum-of-squares clustering (MSSC), or k-means type clustering, has been recently extended to exploit prior knowledge on the cardinality of each cluster. Such knowledge is used to increase performance as well as solution quality. In this paper, we propose a global optimization approach based on the branch-and-cut technique to solve the cardinality-constrained MSSC. For the lower bound routine, we use the semidefinite programming (SDP) relaxation recently proposed by Rujeerapaiboon et al. (SIAM J Optim 29(2):1211–1239, 2019). However, this relaxation can be used in a branch-and-cut method only for small-size instances. Therefore, we derive a new SDP relaxation that scales better with the instance size and the number of clusters. In both cases, we strengthen the bound by adding polyhedral cuts. Benefiting from a tailored branching strategy which enforces pairwise constraints, we reduce the complexity of the problems arising in the children nodes. For the upper bound, instead, we present a local search procedure that exploits the solution of the SDP relaxation solved at each node. Computational results show that the proposed algorithm globally solves, for the first time, real-world instances of size 10 times larger than those solved by state-of-the-art exact methods.
2023
Global optimization ; Constrained clustering ; Semidefinite programming ; Branch-and-cut ; Distance geometry
01 Pubblicazione su rivista::01a Articolo in rivista
Global optimization for cardinality-constrained minimum sum-of-squares clustering via semidefinite programming / Piccialli, Veronica; Sudoso, Antonio M.. - In: MATHEMATICAL PROGRAMMING. - ISSN 1436-4646. - (2023). [10.1007/s10107-023-02021-8]
File allegati a questo prodotto
File Dimensione Formato  
Piccialli_Global_2023.pdf

accesso aperto

Note: https://link.springer.com/content/pdf/10.1007/s10107-023-02021-8
Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 982.44 kB
Formato Adobe PDF
982.44 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1690986
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 2
social impact