Abstract
We propose a systematic method based on reinforcement learning (RL) techniques to find the optimal path to minimize the total entropy production between two equilibrium states of open systems at the same temperature in a given fixed period. Benefiting from the generalization of the deep RL techniques, we provide a powerful tool to address this problem in quantum systems even with two-dimensional continuous controllable parameters. We successfully apply our method to the classical and quantum two-level systems.
- Received 4 November 2021
- Accepted 20 April 2022
DOI:https://doi.org/10.1103/PhysRevE.105.054123
©2022 American Physical Society