Abstract
It was recently argued that sampling a network by traversing it with paths from a small number of sources, as with traceroutes on the Internet, creates a fundamental bias in observed topological features like the degree distribution. We examine this bias analytically and experimentally. For Erdős-Rényi random graphs with mean degree , we show analytically that such sampling gives an observed degree distribution for , despite the underlying distribution being Poissonian. For graphs whose degree distributions have power-law tails , sampling can significantly underestimate when the graph has a large excess (i.e., many more edges than vertices). We find that in order to accurately estimate , one must use a number of sources which grows linearly in the mean degree of the underlying graph. Finally, we comment on the accuracy of the published values of for the Internet.
- Received 4 October 2004
DOI:https://doi.org/10.1103/PhysRevLett.94.018701
©2005 American Physical Society