Generative adversarial networks (GANs) have become widespread models for complex density estimation tasks such as image generation or image-to-image synthesis. At the same time, training of GANs can suffer from several problems, either of stability or convergence, sometimes hindering their effective deployment. In this paper we investigate whether we can improve GAN training by endowing the neural network models with more flexible activation functions compared to the commonly used rectified linear unit (or its variants). In particular, we evaluate training a deep convolutional GAN wherein all hidden activation functions are replaced with a version of the kernel activation function (KAF), a recently proposed technique for learning non-parametric nonlinearities during the optimization process. On a thorough empirical evaluation on multiple image generation benchmarks, we show that the resulting architectures learn to generate visually pleasing images in a fraction of the number of the epochs, eventually converging to a better solution, even when we equalize (or even lower) the number of free parameters. Overall, this points to the importance of investigating better and more flexible architectures in the context of GANs.

Flexible generative adversarial networks with non-parametric activation functions / Grassucci, E.; Scardapane, S.; Comminiello, D.; Uncini, A.. - (2021), pp. 67-77. - SMART INNOVATION, SYSTEMS AND TECHNOLOGIES. [10.1007/978-981-15-5093-5_7].

Flexible generative adversarial networks with non-parametric activation functions

Grassucci E.
;
Scardapane S.;Comminiello D.;Uncini A.
2021

Abstract

Generative adversarial networks (GANs) have become widespread models for complex density estimation tasks such as image generation or image-to-image synthesis. At the same time, training of GANs can suffer from several problems, either of stability or convergence, sometimes hindering their effective deployment. In this paper we investigate whether we can improve GAN training by endowing the neural network models with more flexible activation functions compared to the commonly used rectified linear unit (or its variants). In particular, we evaluate training a deep convolutional GAN wherein all hidden activation functions are replaced with a version of the kernel activation function (KAF), a recently proposed technique for learning non-parametric nonlinearities during the optimization process. On a thorough empirical evaluation on multiple image generation benchmarks, we show that the resulting architectures learn to generate visually pleasing images in a fraction of the number of the epochs, eventually converging to a better solution, even when we equalize (or even lower) the number of free parameters. Overall, this points to the importance of investigating better and more flexible architectures in the context of GANs.
2021
Smart Innovation, Systems and Technologies
978-981-15-5092-8
978-981-15-5093-5
activation function; generative adversarial network; image; neural network
02 Pubblicazione su volume::02a Capitolo o Articolo
Flexible generative adversarial networks with non-parametric activation functions / Grassucci, E.; Scardapane, S.; Comminiello, D.; Uncini, A.. - (2021), pp. 67-77. - SMART INNOVATION, SYSTEMS AND TECHNOLOGIES. [10.1007/978-981-15-5093-5_7].
File allegati a questo prodotto
File Dimensione Formato  
Grassucci_Flexible_2021.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 2.21 MB
Formato Adobe PDF
2.21 MB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1547542
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact