The complexity of boolean failure identification

Galesi, N.; Ranjbar, F.

doi:10.1016/j.tcs.2025.115536

We consider the problem of identifying failure nodes in networks under the Boolean Network Tomography (BNT) approach, which is based on end-to-end measurements routed in a network along paths and producing a boolean (failure/not-failure) outcome. Such end-to-end measurements paths are usually described by an incidence boolean matrix M with m rows (the measurements paths) and n columns (the nodes of the network). A key notion used in practice in this approach is that of k-identifiability. Loosely speaking, a set of m boolean measurements paths over n nodes is k-identifiable, where k is a non-negative integer, if, whenever there are fewer than k+1 failures, it is always possible to identify unambiguously and uniquely which nodes are failing. Following the focus of some recent results analyzing maximal identifiability from a theoretical point of view, this work establishes the complexity of the optimization problem that determines the maximal k for which a set of measurement paths is k-identifiable (MID). We prove that such problem is NP-hard by a reduction from the Minimum Hitting Set problem and we prove that its decision version is in NP. We further consider the following extremal combinatoric question, which is also of practical relevance: given the number n of nodes of the network and a non-negative integer value k for the identifiability, what is the minimal number m of measurement paths over the n nodes to consider in such a way that the maximal identifiability is at least k? A folklore result shows that to have maximal identifiability at least 1, then m≥log(n+1) (or, equivalently, that if n>2m−1, then the maximal identifiability is less than or equal 0). In this work we answer this question for each n∈N and for each k≥2, proving that, there is constant C such that if n>Cm1+[Formula presented], then the maximal identifiability value is strictly smaller than k (and when k=2, n>Cmm suffices). Finally, we study upper and lower bounds on the number of unambiguously identifiable nodes, introducing new identifiability conditions which strictly imply and are strictly implied by unambiguous identifiability. We use these new conditions to design algorithmic heuristics to count defective nodes in a fine-grained way. In particular we introduce a random model to study lower bounds on the number of unambiguously identifiable defective nodes and we use this model to estimate lower bounds on the number of identifiable nodes on real networks by a maximum likelihood estimate approach.

The complexity of boolean failure identification / Galesi, N., Ranjbar, F.. - In: THEORETICAL COMPUTER SCIENCE. - ISSN 0304-3975. - 1056:21 November(2025). [10.1016/j.tcs.2025.115536]

The complexity of boolean failure identification

Galesi N.^{Primo

Membro del Collaboration Group};

2025

Abstract

We consider the problem of identifying failure nodes in networks under the Boolean Network Tomography (BNT) approach, which is based on end-to-end measurements routed in a network along paths and producing a boolean (failure/not-failure) outcome. Such end-to-end measurements paths are usually described by an incidence boolean matrix M with m rows (the measurements paths) and n columns (the nodes of the network). A key notion used in practice in this approach is that of k-identifiability. Loosely speaking, a set of m boolean measurements paths over n nodes is k-identifiable, where k is a non-negative integer, if, whenever there are fewer than k+1 failures, it is always possible to identify unambiguously and uniquely which nodes are failing. Following the focus of some recent results analyzing maximal identifiability from a theoretical point of view, this work establishes the complexity of the optimization problem that determines the maximal k for which a set of measurement paths is k-identifiable (MID). We prove that such problem is NP-hard by a reduction from the Minimum Hitting Set problem and we prove that its decision version is in NP. We further consider the following extremal combinatoric question, which is also of practical relevance: given the number n of nodes of the network and a non-negative integer value k for the identifiability, what is the minimal number m of measurement paths over the n nodes to consider in such a way that the maximal identifiability is at least k? A folklore result shows that to have maximal identifiability at least 1, then m≥log(n+1) (or, equivalently, that if n>2m−1, then the maximal identifiability is less than or equal 0). In this work we answer this question for each n∈N and for each k≥2, proving that, there is constant C such that if n>Cm1+[Formula presented], then the maximal identifiability value is strictly smaller than k (and when k=2, n>Cmm suffices). Finally, we study upper and lower bounds on the number of unambiguously identifiable nodes, introducing new identifiability conditions which strictly imply and are strictly implied by unambiguous identifiability. We use these new conditions to design algorithmic heuristics to count defective nodes in a fine-grained way. In particular we introduce a random model to study lower bounds on the number of unambiguously identifiable defective nodes and we use this model to estimate lower bounds on the number of identifiable nodes on real networks by a maximum likelihood estimate approach.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2025
			
	Parole chiave
	
				Boolean failure identification; Boolean network tomography; Group testing; NP-hardness
			
	Tipologia
	
				01 Pubblicazione su rivista::01a Articolo in rivista
			
	Citazione
	
				The complexity of boolean failure identification / Galesi, N., Ranjbar, F.. - In: THEORETICAL COMPUTER SCIENCE. - ISSN 0304-3975. - 1056:21 November(2025). [10.1016/j.tcs.2025.115536]
			
	Appartiene alla tipologia:
	
				01a Articolo in rivista

File allegati a questo prodotto

File	Dimensione	Formato
Galesi_The-complexity_2025.pdf accesso aperto Note: https://doi.org/10.1016/j.tcs.2025.115536 Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Creative commons Dimensione 3.61 MB Formato Adobe PDF	3.61 MB	Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1767069

Citazioni

ND

0

0

Catalogo dei prodotti della ricerca