Biclustering, also called co-clustering, block clustering, or two-way clustering, involves the simultaneous clustering of both the rows and the columns of a data matrix into distinct groups such that the rows and columns within a group display similar patterns. As a model problem for biclustering, we consider the k-densest disjoint biclique problem, whose goal is to identify k disjoint complete bipartite subgraphs (called bicliques) of a given weighted complete bipartite graph such that the sum of their densities is maximized. To address this problem, we present a tailored branch-and-cut algorithm. For the upper-bound routine, we consider a semidefinite programming relaxation and propose valid inequalities to strengthen the bound. We solve this relaxation in a cutting-plane fashion using a first-order method. For the lower bound, we design a maximum weight matching rounding procedure that exploits the solution of the relaxation solved at each node. Computational results on both synthetic and real-world instances show that the proposed algorithm can solve instances approximately 20 times larger than those handled by general-purpose solvers.

A Semidefinite Programming-Based Branch-and-Cut Algorithm for Biclustering / Sudoso, Antonio M.. - In: INFORMS JOURNAL ON COMPUTING. - ISSN 1091-9856. - (2024). [10.1287/ijoc.2024.0683]

A Semidefinite Programming-Based Branch-and-Cut Algorithm for Biclustering

Sudoso, Antonio M.
2024

Abstract

Biclustering, also called co-clustering, block clustering, or two-way clustering, involves the simultaneous clustering of both the rows and the columns of a data matrix into distinct groups such that the rows and columns within a group display similar patterns. As a model problem for biclustering, we consider the k-densest disjoint biclique problem, whose goal is to identify k disjoint complete bipartite subgraphs (called bicliques) of a given weighted complete bipartite graph such that the sum of their densities is maximized. To address this problem, we present a tailored branch-and-cut algorithm. For the upper-bound routine, we consider a semidefinite programming relaxation and propose valid inequalities to strengthen the bound. We solve this relaxation in a cutting-plane fashion using a first-order method. For the lower bound, we design a maximum weight matching rounding procedure that exploits the solution of the relaxation solved at each node. Computational results on both synthetic and real-world instances show that the proposed algorithm can solve instances approximately 20 times larger than those handled by general-purpose solvers.
2024
biclustering; semidefinite programming; branch and cut; global optimization
01 Pubblicazione su rivista::01a Articolo in rivista
A Semidefinite Programming-Based Branch-and-Cut Algorithm for Biclustering / Sudoso, Antonio M.. - In: INFORMS JOURNAL ON COMPUTING. - ISSN 1091-9856. - (2024). [10.1287/ijoc.2024.0683]
File allegati a questo prodotto
File Dimensione Formato  
Sudoso_A-Semidefinite_2024.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 3.43 MB
Formato Adobe PDF
3.43 MB Adobe PDF   Contatta l'autore
Sudoso_preprint_An-SDP-based_2024.pdf

accesso aperto

Note: DOI10.1287/ijoc.2024.0683
Tipologia: Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza: Creative commons
Dimensione 853.5 kB
Formato Adobe PDF
853.5 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1732971
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 1
social impact