By reinterpreting a robust discriminative classifier as Energy-based Model (EBM), we offer a new take on the dynamics of adversarial training (AT). Our analysis of the energy landscape during AT reveals that untargeted attacks generate adversarial images much more in-distribution (lower energy) than the original data from the point of view of the model. Conversely, we observe the opposite for targeted attacks. On the ground of our thorough analysis, we present new theoretical and practical results that show how interpreting AT energy dynamics unlocks a better understanding: (1) AT dynamic is governed by three phases and robust overfitting occurs in the third phase with a drastic divergence between natural and adversarial energies (2) by rewriting TRADES loss in terms of energies, we show that TRADES implicitly alleviates overfitting by means of aligning the natural energy with the adversarial one (3) we empirically show that all recent state-of-the-art robust classifiers are smoothing the energy landscape and we reconcile a variety of studies about understanding AT and weighting the loss function under the umbrella of EBMs. Motivated by rigorous evidence, we propose Weighted Energy Adversarial Training (WEAT), a novel sample weighting scheme that yields robust accuracy matching the state-of-the-art on multiple benchmarks such as CIFAR-10 and SVHN and going beyond in CIFAR-100 and Tiny-ImageNet. We further show that robust classifiers vary in the intensity and quality of their generative capabilities, and offer a simple method to push this capability, reaching a remarkable Inception Score (IS) and FID using a robust classifier without training for generative modeling. The code to reproduce our results is available at https://github.com/OmnAI-Lab/Robust-Classifiers-under-the-lens-of-EBM

Shedding More Light on Robust Classifiers under the lens of Energy-based Models / Mirza, Mujtaba Hussain; Briglia, Maria Rosaria; Beadini, Senad; Masi, Iacopo. - 15061:(2025), pp. 451-468. ( European Conference on Computer Vision Milan, Italy ) [10.1007/978-3-031-72646-0_26].

Shedding More Light on Robust Classifiers under the lens of Energy-based Models

Mujtaba Hussain Mirza
;
Maria Rosaria Briglia;Iacopo Masi
2025

Abstract

By reinterpreting a robust discriminative classifier as Energy-based Model (EBM), we offer a new take on the dynamics of adversarial training (AT). Our analysis of the energy landscape during AT reveals that untargeted attacks generate adversarial images much more in-distribution (lower energy) than the original data from the point of view of the model. Conversely, we observe the opposite for targeted attacks. On the ground of our thorough analysis, we present new theoretical and practical results that show how interpreting AT energy dynamics unlocks a better understanding: (1) AT dynamic is governed by three phases and robust overfitting occurs in the third phase with a drastic divergence between natural and adversarial energies (2) by rewriting TRADES loss in terms of energies, we show that TRADES implicitly alleviates overfitting by means of aligning the natural energy with the adversarial one (3) we empirically show that all recent state-of-the-art robust classifiers are smoothing the energy landscape and we reconcile a variety of studies about understanding AT and weighting the loss function under the umbrella of EBMs. Motivated by rigorous evidence, we propose Weighted Energy Adversarial Training (WEAT), a novel sample weighting scheme that yields robust accuracy matching the state-of-the-art on multiple benchmarks such as CIFAR-10 and SVHN and going beyond in CIFAR-100 and Tiny-ImageNet. We further show that robust classifiers vary in the intensity and quality of their generative capabilities, and offer a simple method to push this capability, reaching a remarkable Inception Score (IS) and FID using a robust classifier without training for generative modeling. The code to reproduce our results is available at https://github.com/OmnAI-Lab/Robust-Classifiers-under-the-lens-of-EBM
2025
European Conference on Computer Vision
robustness; adversarial training; energy-based models; deep learning
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Shedding More Light on Robust Classifiers under the lens of Energy-based Models / Mirza, Mujtaba Hussain; Briglia, Maria Rosaria; Beadini, Senad; Masi, Iacopo. - 15061:(2025), pp. 451-468. ( European Conference on Computer Vision Milan, Italy ) [10.1007/978-3-031-72646-0_26].
File allegati a questo prodotto
File Dimensione Formato  
Mirza_preprint_Shedding_2025.pdf

accesso aperto

Note: https://link.springer.com/chapter/10.1007/978-3-031-72646-0_26
Tipologia: Documento in Pre-print (manoscritto inviato all'editore, precedente alla peer review)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 9.78 MB
Formato Adobe PDF
9.78 MB Adobe PDF
Mirza_Shedding_2025.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 2.2 MB
Formato Adobe PDF
2.2 MB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1717654
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact