We offer a method for one-shot mask-guided image synthesis that allows controlling manipulations of a single image by inverting a quasi-robust classifier equipped with strong regularizers. Our proposed method, entitled MAGIC, leverages structured gradients from a pre-trained quasi-robust classifier to better preserve the input semantics while preserving its classification accuracy, thereby guaranteeing credibility in the synthesis. Unlike current methods that use complex primitives to supervise the process or use attention maps as a weak supervisory signal, MAGIC aggregates gradients over the input, driven by a guide binary mask that enforces a strong, spatial prior. MAGIC implements a series of manipulations with a single framework achieving shape and location control, intense non-rigid shape deformations, and copy/move operations in the presence of repeating objects and gives users firm control over the synthesis by requiring to simply specify binary guide masks. Our study and findings are supported by various qualitative comparisons with the state-of-the-art on the same images sampled from ImageNet and quantitative analysis using machine perception along with a user survey of 100+ participants that endorse our synthesis quality.

MAGIC: Mask-Guided Image Synthesis by Inverting a Quasi-Robust Classifier / Rouhsedaghat, Mozhdeh; Monajatipoor, Masoud; Jay Kuo, C. -C.; Masi, Iacopo. - (2023). (Intervento presentato al convegno Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI-22) tenutosi a Washington DC, USA).

MAGIC: Mask-Guided Image Synthesis by Inverting a Quasi-Robust Classifier

Iacopo Masi
Ultimo
Supervision
2023

Abstract

We offer a method for one-shot mask-guided image synthesis that allows controlling manipulations of a single image by inverting a quasi-robust classifier equipped with strong regularizers. Our proposed method, entitled MAGIC, leverages structured gradients from a pre-trained quasi-robust classifier to better preserve the input semantics while preserving its classification accuracy, thereby guaranteeing credibility in the synthesis. Unlike current methods that use complex primitives to supervise the process or use attention maps as a weak supervisory signal, MAGIC aggregates gradients over the input, driven by a guide binary mask that enforces a strong, spatial prior. MAGIC implements a series of manipulations with a single framework achieving shape and location control, intense non-rigid shape deformations, and copy/move operations in the presence of repeating objects and gives users firm control over the synthesis by requiring to simply specify binary guide masks. Our study and findings are supported by various qualitative comparisons with the state-of-the-art on the same images sampled from ImageNet and quantitative analysis using machine perception along with a user survey of 100+ participants that endorse our synthesis quality.
2023
Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI-22)
deep learning; model inversion; image synthesis
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
MAGIC: Mask-Guided Image Synthesis by Inverting a Quasi-Robust Classifier / Rouhsedaghat, Mozhdeh; Monajatipoor, Masoud; Jay Kuo, C. -C.; Masi, Iacopo. - (2023). (Intervento presentato al convegno Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI-22) tenutosi a Washington DC, USA).
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1664202
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact