Abstract
The efficient calculation of rare-event kinetics in complex dynamical systems, such as the rate and pathways of ligand dissociation from a protein, is a generally unsolved problem. Markov state models can systematically integrate ensembles of short simulations and thus effectively parallelize the computational effort, but the rare events of interest still need to be spontaneously sampled in the data. Enhanced sampling approaches, such as parallel tempering or umbrella sampling, can accelerate the computation of equilibrium expectations massively, but sacrifice the ability to compute dynamical expectations. In this work we establish a principle to combine knowledge of the equilibrium distribution with kinetics from fast “downhill” relaxation trajectories using reversible Markov models. This approach is general, as it does not invoke any specific dynamical model and can provide accurate estimates of the rare-event kinetics. Large gains in sampling efficiency can be achieved whenever one direction of the process occurs more rapidly than its reverse, making the approach especially attractive for downhill processes such as folding and binding in biomolecules. Our method is implemented in the PyEMMA software.
7 More- Received 24 September 2014
DOI:https://doi.org/10.1103/PhysRevX.6.011009
This article is available under the terms of the Creative Commons Attribution 3.0 License. Further distribution of this work must maintain attribution to the author(s) and the published article’s title, journal citation, and DOI.
Published by the American Physical Society
Popular Summary
The dynamics of proteins—the biomolecules central to the functioning of living organisms—are often characterized by infrequent, large-scale transitions between two or more overall geometric structures. Between these rare transition events, the protein only exhibits small fluctuations around a definite overall structure. Precise information about the probabilities of this stochastic switching behavior is very useful for understanding the role or the function of a specific protein. However, obtaining this information is unfortunately very difficult; the required length and time scales cannot be resolved simultaneously in an experiment. Here, we show that it is possible to combine standard simulations and a second class of simulations, from which the interesting switching probabilities cannot be directly extracted, to obtain reliable values with reasonable computational times.
Simulating the dynamics of a protein on a computer can resolve the system on atomistic length and time scales, but the rare-event nature of the interesting transitions requires impossibly long running times in order to obtain reliable information about the relevant quantities. Here, we conduct simulations of “downhill” relaxation paths. We focus on an alanine-dipeptide molecule, which is commonly used to study molecular dynamics, and we also simulate the motion of a Brownian particle. We are able to compute the kinetics of rare events using simulations that are between 10 and 1 million times faster than conventional simulations that wait for rare events to occur.
We anticipate that our approach will help with the computer-aided design of new drugs in which the rare unbinding of the drug molecule is invariably a central obstacle.