We consider a three-layer restricted Boltzmann machine, where the two visible layers (encoding for input and output, respectively) are made of binary neurons while the hidden layer is made of Gaussian neurons, and we show a formal equivalence with a Hopfield model. The machine architecture allows for different learning and operational modes: when all neurons are free to evolve we recover a standard Hopfield model whose size corresponds to the overall size of visible neurons; when input neurons are clamped we recover a Hopfield model, whose size corresponds to the size of the output layer, endowed with an external field as well as additional slow noise. The former stems from the signal provided by the input layer and tends to favour retrieval, the latter can be related to the statistical properties of the training set and tends to impair the retrieval performance of the network. We address this model by rigorous techniques, finding an explicit expression for its free-energy, whence a phase-diagram showing the performance of the system as parameters are tuned.

Learning and Retrieval Operational Modes for Three-Layer Restricted Boltzmann Machines / Agliari, E.; Sebastiani, G.. - In: JOURNAL OF STATISTICAL PHYSICS. - ISSN 0022-4715. - 185:2(2021). [10.1007/s10955-021-02841-y]

Learning and Retrieval Operational Modes for Three-Layer Restricted Boltzmann Machines

Agliari E.
;
Sebastiani G.
2021

Abstract

We consider a three-layer restricted Boltzmann machine, where the two visible layers (encoding for input and output, respectively) are made of binary neurons while the hidden layer is made of Gaussian neurons, and we show a formal equivalence with a Hopfield model. The machine architecture allows for different learning and operational modes: when all neurons are free to evolve we recover a standard Hopfield model whose size corresponds to the overall size of visible neurons; when input neurons are clamped we recover a Hopfield model, whose size corresponds to the size of the output layer, endowed with an external field as well as additional slow noise. The former stems from the signal provided by the input layer and tends to favour retrieval, the latter can be related to the statistical properties of the training set and tends to impair the retrieval performance of the network. We address this model by rigorous techniques, finding an explicit expression for its free-energy, whence a phase-diagram showing the performance of the system as parameters are tuned.
2021
Boltzmann machine; disordered systems; Hopfield model
01 Pubblicazione su rivista::01a Articolo in rivista
Learning and Retrieval Operational Modes for Three-Layer Restricted Boltzmann Machines / Agliari, E.; Sebastiani, G.. - In: JOURNAL OF STATISTICAL PHYSICS. - ISSN 0022-4715. - 185:2(2021). [10.1007/s10955-021-02841-y]
File allegati a questo prodotto
File Dimensione Formato  
Agliari_Learning_2021.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 857.66 kB
Formato Adobe PDF
857.66 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1586700
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 2
social impact