Reinforcement learning meets minority game: Toward optimal resource allocation

Si-Ping Zhang, Jia-Qi Dong, Li Liu, Zi-Gang Huang, Liang Huang, and Ying-Cheng Lai
Phys. Rev. E 99, 032302 – Published 6 March 2019

Abstract

The main point of this paper is to provide an affirmative answer through exploiting reinforcement learning (RL) in artificial intelligence (AI) for eliminating herding without any external control in complex resource allocation systems. In particular, we demonstrate that when agents are empowered with RL (e.g., the popular Q-learning algorithm in AI) in that they get familiar with the unknown game environment gradually and attempt to deliver the optimal actions to maximize the payoff, herding can effectively be eliminated. Furthermore, computations reveal the striking phenomenon that, regardless of the initial state, the system evolves persistently and relentlessly toward the optimal state in which all resources are used efficiently. However, the evolution process is not without interruptions: there are large fluctuations that occur but only intermittently in time. The statistical distribution of the time between two successive fluctuating events is found to depend on the parity of the evolution, i.e., whether the number of time steps in between is odd or even. We develop a physical analysis and derive mean-field equations to gain an understanding of these phenomena. Since AI is becoming increasingly widespread, we expect our RL empowered minority game system to have broad applications.

  • Figure
  • Figure
  • Figure
  • Figure
  • Figure
  • Received 20 July 2018

DOI:https://doi.org/10.1103/PhysRevE.99.032302

©2019 American Physical Society

Physics Subject Headings (PhySH)

Nonlinear DynamicsStatistical Physics & ThermodynamicsNetworks

Authors & Affiliations

Si-Ping Zhang1,2, Jia-Qi Dong2,3, Li Liu4, Zi-Gang Huang1,*, Liang Huang2, and Ying-Cheng Lai5

  • 1The Key Laboratory of Biomedical Information Engineering of Ministry of Education, The Key Laboratory of Neuro-informatics & Rehabilitation Engineering of Ministry of Civil Affairs, and Institute of Health and Rehabilitation Science, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an 710049, China
  • 2Institute of Computational Physics and Complex Systems, Lanzhou University, Lanzhou 730000, China
  • 3Institute of Theoretical Physics, Key Laboratory of Theoretical Physics, Chinese Academy of Sciences, P.O. Box 2735, Beijing 100190, China
  • 4School of Software Engineering, Chongqing University, Chongqing 400044, People's Republic of China
  • 5School of Electrical, Computer and Energy Engineering, Department of Physics, Arizona State University, Tempe, Arizona 85287, USA

  • *huangzg@xjtu.edu.cn

Article Text (Subscription Required)

Click to Expand

References (Subscription Required)

Click to Expand
Issue

Vol. 99, Iss. 3 — March 2019

Reuse & Permissions
Access Options
CHORUS

Article Available via CHORUS

Download Accepted Manuscript
Author publication services for translation and copyediting assistance advertisement

Authorization Required


×
×

Images

×

Sign up to receive regular email alerts from Physical Review E

Log In

Cancel
×

Search


Article Lookup

Paste a citation or DOI

Enter a citation
×