Biclustering, also called co-clustering, block clustering, or two-way clustering, involves the simultaneous clustering of both the rows and the columns of a data matrix into distinct groups such that the rows and columns within a group display similar patterns. As a model problem for biclustering, we consider the k-densest disjoint biclique problem, whose goal is to identify k disjoint complete bipartite subgraphs (called bicliques) of a given weighted complete bipartite graph such that the sum of their densities is maximized. To address this problem, we present a tailored branch-and-cut algorithm. For the upper-bound routine, we consider a semidefinite programming relaxation and propose valid inequalities to strengthen the bound. We solve this relaxation in a cutting-plane fashion using a first-order method. For the lower bound, we design a maximum weight matching rounding procedure that exploits the solution of the relaxation solved at each node. Computational results on both synthetic and real-world instances show that the proposed algorithm can solve instances approximately 20 times larger than those handled by general-purpose solvers.
A Semidefinite Programming-Based Branch-and-Cut Algorithm for Biclustering / Sudoso, Antonio M.. - In: INFORMS JOURNAL ON COMPUTING. - ISSN 1091-9856. - (2024). [10.1287/ijoc.2024.0683]
A Semidefinite Programming-Based Branch-and-Cut Algorithm for Biclustering
Sudoso, Antonio M.
2024
Abstract
Biclustering, also called co-clustering, block clustering, or two-way clustering, involves the simultaneous clustering of both the rows and the columns of a data matrix into distinct groups such that the rows and columns within a group display similar patterns. As a model problem for biclustering, we consider the k-densest disjoint biclique problem, whose goal is to identify k disjoint complete bipartite subgraphs (called bicliques) of a given weighted complete bipartite graph such that the sum of their densities is maximized. To address this problem, we present a tailored branch-and-cut algorithm. For the upper-bound routine, we consider a semidefinite programming relaxation and propose valid inequalities to strengthen the bound. We solve this relaxation in a cutting-plane fashion using a first-order method. For the lower bound, we design a maximum weight matching rounding procedure that exploits the solution of the relaxation solved at each node. Computational results on both synthetic and real-world instances show that the proposed algorithm can solve instances approximately 20 times larger than those handled by general-purpose solvers.| File | Dimensione | Formato | |
|---|---|---|---|
|
Sudoso_A-Semidefinite_2024.pdf
solo gestori archivio
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
3.43 MB
Formato
Adobe PDF
|
3.43 MB | Adobe PDF | Contatta l'autore |
|
Sudoso_preprint_An-SDP-based_2024.pdf
accesso aperto
Note: DOI10.1287/ijoc.2024.0683
Tipologia:
Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza:
Creative commons
Dimensione
853.5 kB
Formato
Adobe PDF
|
853.5 kB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


