Entropies of biosequences: The role of repeats

Hanspeter Herzel; Werner Ebeling; Armin O. Schmitt

doi:10.1103/PhysRevE.50.5061

Entropies of biosequences: The role of repeats

Hanspeter Herzel, Werner Ebeling, and Armin O. Schmitt

Phys. Rev. E 50, 5061 – Published 1 December 1994

Abstract

DNA sequences of higher organisms contain thousands of nearly identical dispersed repetitive sequences. In order to understand the effect of such repeats on word entropies, we construct a model that can be analyzed analytically. The hypothetical model sequences consist of independent equidistributed symbols with randomly interspersed repeats. As a conclusion, we predict that the entropy of DNA sequences measuring the information content is much lower than suggested by earlier empirical studies.

Received 23 May 1994

DOI:https://doi.org/10.1103/PhysRevE.50.5061

Authors & Affiliations

Hanspeter Herzel

Institute of Theoretical Physics, Technical University, Hardenbergstrasse 36, D-10623 Berlin, Germany

Werner Ebeling and Armin O. Schmitt

Institute of Physics, Humboldt University, Invalidenstrasse 110, D-10115 Berlin, Germany

References (Subscription Required)

Click to Expand

Issue

Vol. 50, Iss. 6 — December 1994

Reuse & Permissions

Access Options

Author publication services for translation and copyediting assistance advertisement

Physical Review E

covering statistical, nonlinear, biological, and soft matter physics