Despite the widespread use of gradient-based algorithms for optimizing high-dimensional non-convex functions, understanding their ability of finding good minima instead of being trapped in spurious ones remains to a large extent an open problem. Here we focus on gradient flow dynamics for phase retrieval from random measurements. When the ratio of the number of measurements over the input dimension is small the dynamics remains trapped in spurious minima with large basins of attraction. We find analytically that above a critical ratio those critical points become unstable developing a negative direction toward the signal. By numerical experiments we show that in this regime the gradient flow algorithm is not trapped; it drifts away from the spurious critical points along the unstable direction and succeeds in finding the global minimum. Using tools from statistical physics we characterize this phenomenon, which is related to a BBP-type transition in the Hessian of the spurious minima.

Complex Dynamics in Simple Neural Networks: Understanding Gradient Flow in Phase Retrieval / Mannelli Stefano, Sarao; Biroli, Giulio; Cammarota, Chiara; Krzakala, Florent; Urbani, Pierfrancesco; Zdeborová, Lenka. - (2020). (Intervento presentato al convegno 2020 Conference on Neural Information Processing Systems-NeurIPS 2020 tenutosi a Vancouver Canada (online participation only)).

Complex Dynamics in Simple Neural Networks: Understanding Gradient Flow in Phase Retrieval

Chiara Cammarota
Supervision
;
2020

Abstract

Despite the widespread use of gradient-based algorithms for optimizing high-dimensional non-convex functions, understanding their ability of finding good minima instead of being trapped in spurious ones remains to a large extent an open problem. Here we focus on gradient flow dynamics for phase retrieval from random measurements. When the ratio of the number of measurements over the input dimension is small the dynamics remains trapped in spurious minima with large basins of attraction. We find analytically that above a critical ratio those critical points become unstable developing a negative direction toward the signal. By numerical experiments we show that in this regime the gradient flow algorithm is not trapped; it drifts away from the spurious critical points along the unstable direction and succeeds in finding the global minimum. Using tools from statistical physics we characterize this phenomenon, which is related to a BBP-type transition in the Hessian of the spurious minima.
2020
2020 Conference on Neural Information Processing Systems-NeurIPS 2020
inference, algorithms, risk landscape
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Complex Dynamics in Simple Neural Networks: Understanding Gradient Flow in Phase Retrieval / Mannelli Stefano, Sarao; Biroli, Giulio; Cammarota, Chiara; Krzakala, Florent; Urbani, Pierfrancesco; Zdeborová, Lenka. - (2020). (Intervento presentato al convegno 2020 Conference on Neural Information Processing Systems-NeurIPS 2020 tenutosi a Vancouver Canada (online participation only)).
File allegati a questo prodotto
File Dimensione Formato  
Sarao Mannelli_Complex Dynamics_2020.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.22 MB
Formato Adobe PDF
1.22 MB Adobe PDF
Mannelli_Complex_2020.pdf

accesso aperto

Tipologia: Documento in Pre-print (manoscritto inviato all'editore, precedente alla peer review)
Licenza: Creative commons
Dimensione 602.16 kB
Formato Adobe PDF
602.16 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1472265
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? ND
social impact