Network properties of written human language

A. P. Masucci and G. J. Rodgers
Phys. Rev. E 74, 026102 – Published 2 August 2006

Abstract

We investigate the nature of written human language within the framework of complex network theory. In particular, we analyze the topology of Orwell’s 1984 focusing on the local properties of the network, such as the properties of the nearest neighbors and the clustering coefficient. We find a composite power law behavior for both the average nearest neighbor’s degree and average clustering coefficient as a function of the vertex degree. This implies the existence of different functional classes of vertices. Furthermore, we find that the second order vertex correlations are an essential component of the network architecture. To model our empirical results we extend a previously introduced model for language due to Dorogovtsev and Mendes. We propose an accelerated growing network model that contains three growth mechanisms: linear preferential attachment, local preferential attachment, and the random growth of a predetermined small finite subset of initial vertices. We find that with these elementary stochastic rules we are able to produce a network showing syntacticlike structures.

  • Figure
  • Figure
  • Figure
  • Figure
  • Figure
  • Figure
  • Figure
2 More
  • Received 7 April 2006

DOI:https://doi.org/10.1103/PhysRevE.74.026102

©2006 American Physical Society

Authors & Affiliations

A. P. Masucci and G. J. Rodgers

  • Department of Mathematical Sciences, Brunel University, Uxbridge, Middlesex, UB8 3PH, United Kingdom

Article Text (Subscription Required)

Click to Expand

References (Subscription Required)

Click to Expand
Issue

Vol. 74, Iss. 2 — August 2006

Reuse & Permissions
Access Options
Author publication services for translation and copyediting assistance advertisement

Authorization Required


×
×

Images

×

Sign up to receive regular email alerts from Physical Review E

Log In

Cancel
×

Search


Article Lookup

Paste a citation or DOI

Enter a citation
×