• Editors' Suggestion

Star-Shaped Space of Solutions of the Spherical Negative Perceptron

Brandon Livio Annesi, Clarissa Lauditi, Carlo Lucibello, Enrico M. Malatesta, Gabriele Perugini, Fabrizio Pittorino, and Luca Saglietti
Phys. Rev. Lett. 131, 227301 – Published 29 November 2023

Abstract

Empirical studies on the landscape of neural networks have shown that low-energy configurations are often found in complex connected structures, where zero-energy paths between pairs of distant solutions can be constructed. Here, we consider the spherical negative perceptron, a prototypical nonconvex neural network model framed as a continuous constraint satisfaction problem. We introduce a general analytical method for computing energy barriers in the simplex with vertex configurations sampled from the equilibrium. We find that in the overparametrized regime the solution manifold displays simple connectivity properties. There exists a large geodesically convex component that is attractive for a wide range of optimization dynamics. Inside this region we identify a subset of atypical high-margin solutions that are geodesically connected with most other solutions, giving rise to a star-shaped geometry. We analytically characterize the organization of the connected space of solutions and show numerical evidence of a transition, at larger constraint densities, where the aforementioned simple geodesic connectivity breaks down.

  • Figure
  • Figure
  • Figure
  • Figure
  • Figure
  • Received 17 May 2023
  • Revised 5 September 2023
  • Accepted 8 November 2023

DOI:https://doi.org/10.1103/PhysRevLett.131.227301

© 2023 American Physical Society

Physics Subject Headings (PhySH)

Statistical Physics & Thermodynamics

Authors & Affiliations

Brandon Livio Annesi1, Clarissa Lauditi2, Carlo Lucibello1,3, Enrico M. Malatesta1,3, Gabriele Perugini1, Fabrizio Pittorino4,3, and Luca Saglietti1,3

  • 1Department of Computing Sciences, Bocconi University, 20136 Milano, Italy
  • 2Department of Applied Science and Technology, Politecnico di Torino, 10129 Torino, Italy
  • 3Bocconi Institute for Data Science and Analytics, 20136 Milano, Italy
  • 4Department of Electronics, Information, and Bioengineering, Politecnico di Milano, 20125 Milano, Italy

Article Text (Subscription Required)

Click to Expand

Supplemental Material (Subscription Required)

Click to Expand

References (Subscription Required)

Click to Expand
Issue

Vol. 131, Iss. 22 — 1 December 2023

Reuse & Permissions
Access Options
Author publication services for translation and copyediting assistance advertisement

Authorization Required


×
×

Images

×

Sign up to receive regular email alerts from Physical Review Letters

Log In

Cancel
×

Search


Article Lookup

Paste a citation or DOI

Enter a citation
×