In this paper, making use of the signal-flow-graph (SFG) representation and its known properties, we derive a new general method for backward gradient computation of a system output or cost function with respect to past (or present) system parameters. The system can be any causal, in general nonlinear and time-variant dynamic system represented by a SFG, in particular any feedforward or recurrent neural network. In this work we use discrete time notation, but the same theory holds for the continuous time case. The gradient is obtained by the analysis of two SFGs, the original one and its adjoint. This method can be used both for online and off-line learning. In the latter case using the mean square error cost function, our approach particularises to Wan's method (1996) that is not suited for online training of recurrent networks. Computer simulations of nonlinear dynamic systems identification will also be presented to assess the performance of the algorithm resulting from the application of the proposed method in the case of locally recurrent neural networks.

Signal-flow-graph derivation of on-line gradient learning algorithms / Campolucci, P; Marchegiani, A; Uncini, Aurelio; Piazza, F.. - 3:(1997), pp. 1884-1889. [10.1109/ICNN.1997.614186]

Signal-flow-graph derivation of on-line gradient learning algorithms

UNCINI, Aurelio;
1997

Abstract

In this paper, making use of the signal-flow-graph (SFG) representation and its known properties, we derive a new general method for backward gradient computation of a system output or cost function with respect to past (or present) system parameters. The system can be any causal, in general nonlinear and time-variant dynamic system represented by a SFG, in particular any feedforward or recurrent neural network. In this work we use discrete time notation, but the same theory holds for the continuous time case. The gradient is obtained by the analysis of two SFGs, the original one and its adjoint. This method can be used both for online and off-line learning. In the latter case using the mean square error cost function, our approach particularises to Wan's method (1996) that is not suited for online training of recurrent networks. Computer simulations of nonlinear dynamic systems identification will also be presented to assess the performance of the algorithm resulting from the application of the proposed method in the case of locally recurrent neural networks.
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/212625
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? ND
social impact