Explaining Zipf's law via a mental lexicon

Armen E. Allahverdyan, Weibing Deng, and Q. A. Wang

Phys. Rev. E 88, 062804 – Published 3 December 2013

Abstract

Zipf's law is the major regularity of statistical linguistics that has served as a prototype for rank-frequency relations and scaling laws in natural sciences. Here we show that Zipf's law—together with its applicability for a single text and its generalizations to high and low frequencies including hapax legomena—can be derived from assuming that the words are drawn into the text with random probabilities. Their a priori density relates, via the Bayesian statistics, to the mental lexicon of the author who produced the text.

Received 21 December 2012

DOI:https://doi.org/10.1103/PhysRevE.88.062804

Authors & Affiliations

Armen E. Allahverdyan^1,2, Weibing Deng^1,3,4, and Q. A. Wang^1,3

¹Laboratoire de Physique Statistique et Systèmes Complexes, ISMANS, 44 ave. Bartholdi, 72000 Le Mans, France
²Yerevan Physics Institute, Alikhanian Brothers Street 2, Yerevan 375036, Armenia
³IMMM, UMR CNRS 6283, Université du Maine, 72085 Le Mans, France
⁴Complexity Science Center and Institute of Particle Physics, Hua-Zhong Normal University, Wuhan 430079, China

Article Text (Subscription Required)

Click to Expand

References (Subscription Required)

Click to Expand

Issue

Vol. 88, Iss. 6 — December 2013

Reuse & Permissions

Access Options

Author publication services for translation and copyediting assistance advertisement

Physical Review E

covering statistical, nonlinear, biological, and soft matter physics