Recently, neural networks (NN) with an infinite number of layers have been introduced. Especially for these very large NN the training procedure is very expensive. Hence, there is interest to study their robustness with respect to input data to avoid unnecessarily retraining the network. Typically, model-based statistical inference methods, e.g. Bayesian neural networks, are used to quantify uncertainties. Here, we consider a special class of residual neural networks and we study the case, when the number of layers can be arbitrarily large. Then, kinetic theory allows to interpret the network as a dynamical system, described by a partial differential equation. We study the robustness of the mean-field neural network with respect to perturbations in initial data by applying UQ approaches on the loss functions.

Spectral methods to study the robustness of residual neural networks with infinite layers / Trimborn, Torsten; Gerster, Stephan; Visconti, Giuseppe. - In: FOUNDATIONS OF DATA SCIENCE. - ISSN 2639-8001. - 0:0(2019), pp. 0-0. [10.3934/fods.2020012]

Spectral methods to study the robustness of residual neural networks with infinite layers

Visconti, Giuseppe
2019

Abstract

Recently, neural networks (NN) with an infinite number of layers have been introduced. Especially for these very large NN the training procedure is very expensive. Hence, there is interest to study their robustness with respect to input data to avoid unnecessarily retraining the network. Typically, model-based statistical inference methods, e.g. Bayesian neural networks, are used to quantify uncertainties. Here, we consider a special class of residual neural networks and we study the case, when the number of layers can be arbitrarily large. Then, kinetic theory allows to interpret the network as a dynamical system, described by a partial differential equation. We study the robustness of the mean-field neural network with respect to perturbations in initial data by applying UQ approaches on the loss functions.
2019
Neural networks; kinetic equations; uncertainty quantification; stochastic Galerkin; machine learning application
01 Pubblicazione su rivista::01a Articolo in rivista
Spectral methods to study the robustness of residual neural networks with infinite layers / Trimborn, Torsten; Gerster, Stephan; Visconti, Giuseppe. - In: FOUNDATIONS OF DATA SCIENCE. - ISSN 2639-8001. - 0:0(2019), pp. 0-0. [10.3934/fods.2020012]
File allegati a questo prodotto
File Dimensione Formato  
Trimborn_Spectral-methods_2020.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 2.16 MB
Formato Adobe PDF
2.16 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1553444
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 3
social impact