Role of Synaptic Stochasticity in Training Low-Precision Neural Networks

Carlo Baldassi, Federica Gerace, Hilbert J. Kappen, Carlo Lucibello, Luca Saglietti, Enzo Tartaglione, and Riccardo Zecchina
Phys. Rev. Lett. 120, 268103 – Published 29 June 2018
PDFHTMLExport Citation

Abstract

Stochasticity and limited precision of synaptic weights in neural network models are key aspects of both biological and hardware modeling of learning processes. Here we show that a neural network model with stochastic binary weights naturally gives prominence to exponentially rare dense regions of solutions with a number of desirable properties such as robustness and good generalization performance, while typical solutions are isolated and hard to find. Binary solutions of the standard perceptron problem are obtained from a simple gradient descent procedure on a set of real values parametrizing a probability distribution over the binary synapses. Both analytical and numerical results are presented. An algorithmic extension that allows to train discrete deep neural networks is also investigated.

  • Figure
  • Figure
  • Received 27 October 2017
  • Revised 19 March 2018

DOI:https://doi.org/10.1103/PhysRevLett.120.268103

© 2018 American Physical Society

Physics Subject Headings (PhySH)

Condensed Matter, Materials & Applied PhysicsStatistical Physics & ThermodynamicsNetworksGeneral Physics

Authors & Affiliations

Carlo Baldassi1,2,3, Federica Gerace2,4, Hilbert J. Kappen5, Carlo Lucibello2,4, Luca Saglietti2,4, Enzo Tartaglione2,4, and Riccardo Zecchina1,2,6

  • 1Bocconi Institute for Data Science and Analytics, Bocconi University, Milano 20136, Italy
  • 2Italian Institute for Genomic Medicine, Torino 10126, Italy
  • 3Istituto Nazionale di Fisica Nucleare, Sezione di Torino, Torino 10129, Italy
  • 4Department of Applied Science and Technology, Politecnico di Torino, Torino 10129, Italy
  • 5Radboud University Nijmegen, Donders Institute for Brain, Cognition and Behaviour 6525 EZ Nijmegen, Netherlands
  • 6International Centre for Theoretical Physics, Trieste 34151, Italy

Article Text (Subscription Required)

Click to Expand

Supplemental Material (Subscription Required)

Click to Expand

References (Subscription Required)

Click to Expand
Issue

Vol. 120, Iss. 26 — 29 June 2018

Reuse & Permissions
Access Options
CHORUS

Article Available via CHORUS

Download Accepted Manuscript
Author publication services for translation and copyediting assistance advertisement

Authorization Required


×
×

Images

×

Sign up to receive regular email alerts from Physical Review Letters

Log In

Cancel
×

Search


Article Lookup

Paste a citation or DOI

Enter a citation
×