Flexible generative adversarial networks with non-parametric activation functions

Grassucci, E.; Scardapane, S.; Comminiello, D.; Uncini, A.

doi:10.1007/978-981-15-5093-5_7

Generative adversarial networks (GANs) have become widespread models for complex density estimation tasks such as image generation or image-to-image synthesis. At the same time, training of GANs can suffer from several problems, either of stability or convergence, sometimes hindering their effective deployment. In this paper we investigate whether we can improve GAN training by endowing the neural network models with more flexible activation functions compared to the commonly used rectified linear unit (or its variants). In particular, we evaluate training a deep convolutional GAN wherein all hidden activation functions are replaced with a version of the kernel activation function (KAF), a recently proposed technique for learning non-parametric nonlinearities during the optimization process. On a thorough empirical evaluation on multiple image generation benchmarks, we show that the resulting architectures learn to generate visually pleasing images in a fraction of the number of the epochs, eventually converging to a better solution, even when we equalize (or even lower) the number of free parameters. Overall, this points to the importance of investigating better and more flexible architectures in the context of GANs.

Flexible generative adversarial networks with non-parametric activation functions / Grassucci, E.; Scardapane, S.; Comminiello, D.; Uncini, A.. - (2021), pp. 67-77. - SMART INNOVATION, SYSTEMS AND TECHNOLOGIES. [10.1007/978-981-15-5093-5_7].

Flexible generative adversarial networks with non-parametric activation functions

Grassucci E.;Scardapane S.;Comminiello D.;Uncini A.

2021

Abstract

Generative adversarial networks (GANs) have become widespread models for complex density estimation tasks such as image generation or image-to-image synthesis. At the same time, training of GANs can suffer from several problems, either of stability or convergence, sometimes hindering their effective deployment. In this paper we investigate whether we can improve GAN training by endowing the neural network models with more flexible activation functions compared to the commonly used rectified linear unit (or its variants). In particular, we evaluate training a deep convolutional GAN wherein all hidden activation functions are replaced with a version of the kernel activation function (KAF), a recently proposed technique for learning non-parametric nonlinearities during the optimization process. On a thorough empirical evaluation on multiple image generation benchmarks, we show that the resulting architectures learn to generate visually pleasing images in a fraction of the number of the epochs, eventually converging to a better solution, even when we equalize (or even lower) the number of free parameters. Overall, this points to the importance of investigating better and more flexible architectures in the context of GANs.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2021
			
	Titolo del volume
	
				Smart Innovation, Systems and Technologies
			
	ISBN
	
				978-981-15-5092-8
978-981-15-5093-5
			
	Parole chiave
	
				activation function; generative adversarial network; image; neural network
			
	Tipologia
	
				02 Pubblicazione su volume::02a Capitolo o Articolo
			
	Citazione
	
				Flexible generative adversarial networks with non-parametric activation functions / Grassucci, E.; Scardapane, S.; Comminiello, D.; Uncini, A.. - (2021), pp. 67-77. - SMART INNOVATION, SYSTEMS AND TECHNOLOGIES. [10.1007/978-981-15-5093-5_7].
			
	Appartiene alla tipologia:
	
				02a Capitolo o Articolo

File allegati a questo prodotto

File	Dimensione	Formato
Grassucci_Flexible_2021.pdf solo gestori archivio Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 2.21 MB Formato Adobe PDF Contatta l'autore	2.21 MB	Adobe PDF	Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1547542

Citazioni

ND

3

ND

Catalogo dei prodotti della ricerca

Flexible generative adversarial networks with non-parametric activation functions

Grassucci E.;Scardapane S.;Comminiello D.;Uncini A.

2021

Abstract

Scheda breve

Scheda completa

Citazioni

social impact

Catalogo dei prodotti della ricerca

Flexible generative adversarial networks with non-parametric activation functions

Grassucci E.;Scardapane S.;Comminiello D.;Uncini A.

2021

Abstract

Scheda breve Scheda completa

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa