• Open Access

Marvels and Pitfalls of the Langevin Algorithm in Noisy High-Dimensional Inference

Stefano Sarao Mannelli, Giulio Biroli, Chiara Cammarota, Florent Krzakala, Pierfrancesco Urbani, and Lenka Zdeborová
Phys. Rev. X 10, 011057 – Published 5 March 2020

Abstract

Gradient-descent-based algorithms and their stochastic versions have widespread applications in machine learning and statistical inference. In this work, we carry out an analytic study of the performance of the algorithm most commonly considered in physics, the Langevin algorithm, in the context of noisy high-dimensional inference. We employ the Langevin algorithm to sample the posterior probability measure for the spiked mixed matrix-tensor model. The typical behavior of this algorithm is described by a system of integrodifferential equations that we call the Langevin state evolution, whose solution is compared with the one of the state evolution of approximate message passing (AMP). Our results show that, remarkably, the algorithmic threshold of the Langevin algorithm is suboptimal with respect to the one given by AMP. This phenomenon is due to the residual glassiness present in that region of parameters. We also present a simple heuristic expression of the transition line, which appears to be in agreement with the numerical results.

  • Figure
  • Figure
  • Figure
  • Figure
  • Figure
  • Figure
  • Figure
15 More
  • Received 27 June 2019
  • Revised 22 November 2019
  • Accepted 6 January 2020

DOI:https://doi.org/10.1103/PhysRevX.10.011057

Published by the American Physical Society under the terms of the Creative Commons Attribution 4.0 International license. Further distribution of this work must maintain attribution to the author(s) and the published article’s title, journal citation, and DOI.

Published by the American Physical Society

Physics Subject Headings (PhySH)

Statistical Physics & Thermodynamics

Authors & Affiliations

Stefano Sarao Mannelli1, Giulio Biroli2, Chiara Cammarota3, Florent Krzakala4, Pierfrancesco Urbani1, and Lenka Zdeborová1

  • 1Université Paris-Saclay, CNRS, CEA, Institut de Physique Théorique, 91191, Gif-sur-Yvette, France
  • 2Laboratoire de Physique de l’Ecole Normale Supérieure ENS, Université PSL, CNRS, Sorbonne Université, Université Paris-Diderot, Sorbonne Paris Cité Paris, France
  • 3Department of Mathematics, King’s College London, Strand London WC2R 2LS, United Kingdom
  • 4Laboratoire de Physique Statistique, CNRS & Université Pierre & Marie Curie & Ecole Normale Supérieure & PSL Université, 75005 Paris, France

Popular Summary

Many tasks of current interest in science and technology are solved using machine-learning methods. While machine-learning methods work impressively well in many cases of practical interest, so far, theoretical understanding of the properties of the algorithms is poor. Our work is driven by the hope that better theoretical understanding will lead to an improvement of existing methods. We introduce a model for which the performance of one of the commonly considered algorithms, the Langevin algorithm, can be fully analyzed, and we locate the region of parameters for which the algorithm reaches optimal performance.

Using methods designed to study the dynamics of glassy materials, we write and analyze a closed set of equations describing the quality of the result found by the Langevin algorithm. We deduce a very simple formula for a threshold of signal-to-noise ratio below which the Langevin algorithm fails. Comparing it to the performance of an alternative algorithm known as approximate message passing, we show that, remarkably, the algorithmic threshold of the Langevin algorithm is largely suboptimal.

We hope that the insight gained in our work can be extended to a better understanding of similar “gradient-based” algorithms in models that are closer to currently used neural networks.

Key Image

Article Text

Click to Expand

References

Click to Expand
Issue

Vol. 10, Iss. 1 — January - March 2020

Subject Areas
Reuse & Permissions
Author publication services for translation and copyediting assistance advertisement

Authorization Required


×
×

Images

×

Sign up to receive regular email alerts from Physical Review X

Reuse & Permissions

It is not necessary to obtain permission to reuse this article or its components as it is available under the terms of the Creative Commons Attribution 4.0 International license. This license permits unrestricted use, distribution, and reproduction in any medium, provided attribution to the author(s) and the published article's title, journal citation, and DOI are maintained. Please note that some figures may have been included with permission from other third parties. It is your responsibility to obtain the proper permission from the rights holder directly for these figures.

×

Log In

Cancel
×

Search


Article Lookup

Paste a citation or DOI

Enter a citation
×