Role of Synaptic Stochasticity in Training Low-Precision Neural Networks

Carlo Baldassi, Federica Gerace, Hilbert J. Kappen, Carlo Lucibello, Luca Saglietti, Enzo Tartaglione, and Riccardo Zecchina

Phys. Rev. Lett. 120, 268103 – Published 29 June 2018

Abstract

Stochasticity and limited precision of synaptic weights in neural network models are key aspects of both biological and hardware modeling of learning processes. Here we show that a neural network model with stochastic binary weights naturally gives prominence to exponentially rare dense regions of solutions with a number of desirable properties such as robustness and good generalization performance, while typical solutions are isolated and hard to find. Binary solutions of the standard perceptron problem are obtained from a simple gradient descent procedure on a set of real values parametrizing a probability distribution over the binary synapses. Both analytical and numerical results are presented. An algorithmic extension that allows to train discrete deep neural networks is also investigated.

Received 27 October 2017
Revised 19 March 2018

DOI:https://doi.org/10.1103/PhysRevLett.120.268103

Physics Subject Headings (PhySH)

Artificial neural networks Disordered systems

Condensed Matter, Materials & Applied PhysicsStatistical Physics & ThermodynamicsNetworksGeneral Physics

Authors & Affiliations

Carlo Baldassi^1,2,3, Federica Gerace^2,4, Hilbert J. Kappen⁵, Carlo Lucibello^2,4, Luca Saglietti^2,4, Enzo Tartaglione^2,4, and Riccardo Zecchina^1,2,6

¹Bocconi Institute for Data Science and Analytics, Bocconi University, Milano 20136, Italy
²Italian Institute for Genomic Medicine, Torino 10126, Italy
³Istituto Nazionale di Fisica Nucleare, Sezione di Torino, Torino 10129, Italy
⁴Department of Applied Science and Technology, Politecnico di Torino, Torino 10129, Italy
⁵Radboud University Nijmegen, Donders Institute for Brain, Cognition and Behaviour 6525 EZ Nijmegen, Netherlands
⁶International Centre for Theoretical Physics, Trieste 34151, Italy