Towards real-time community detection in large networks

Ian X. Y. Leung, Pan Hui, Pietro Liò, and Jon Crowcroft
Phys. Rev. E 79, 066107 – Published 16 June 2009

Abstract

The recent boom of large-scale online social networks (OSNs) both enables and necessitates the use of parallelizable and scalable computational techniques for their analysis. We examine the problem of real-time community detection and a recently proposed linear time—O(m) on a network with m edges—label propagation, or “epidemic” community detection algorithm. We identify characteristics and drawbacks of the algorithm and extend it by incorporating different heuristics to facilitate reliable and multifunctional real-time community detection. With limited computational resources, we employ the algorithm on OSN data with 1×106 nodes and about 58×106 directed edges. Experiments and benchmarks reveal that the extended algorithm is not only faster but its community detection accuracy compares favorably over popular modularity-gain optimization algorithms known to suffer from their resolution limits.

  • Figure
  • Figure
  • Figure
  • Figure
  • Figure
  • Figure
  • Figure
4 More
  • Received 2 September 2008

DOI:https://doi.org/10.1103/PhysRevE.79.066107

©2009 American Physical Society

Authors & Affiliations

Ian X. Y. Leung*, Pan Hui, Pietro Liò, and Jon Crowcroft§

  • Computer Laboratory, University of Cambridge, Cambridge CB3 0FD, United Kingdom

  • *ian.leung@cl.cam.ac.uk
  • pan.hui@cl.cam.ac.uk
  • pietro.lio@cl.cam.ac.uk
  • §jon.crowcroft@cl.cam.ac.uk

Article Text (Subscription Required)

Click to Expand

References (Subscription Required)

Click to Expand
Issue

Vol. 79, Iss. 6 — June 2009

Reuse & Permissions
Access Options
Author publication services for translation and copyediting assistance advertisement

Authorization Required


×
×

Images

×

Sign up to receive regular email alerts from Physical Review E

Log In

Cancel
×

Search


Article Lookup

Paste a citation or DOI

Enter a citation
×