Abstract
To comprehend interconnected systems across the social and natural sciences, researchers have developed many powerful methods to identify functional modules. For example, with interaction data aggregated into a single network layer, flow-based methods have proven useful for identifying modular dynamics in weighted and directed networks that capture constraints on flow processes. However, many interconnected systems consist of agents or components that exhibit multiple layers of interactions, possibly from several different processes. Inevitably, representing this intricate network of networks as a single aggregated network leads to information loss and may obscure the actual organization. Here, we propose a method based on a compression of network flows that can identify modular flows both within and across layers in nonaggregated multilayer networks. Our numerical experiments on synthetic multilayer networks, with some layers originating from the same interaction process, show that the analysis fails in aggregated networks or when treating the layers separately, whereas the multilayer method can accurately identify modules across layers that originate from the same interaction process. We capitalize on our findings and reveal the community structure of two multilayer collaboration networks with topics as layers: scientists affiliated with the Pierre Auger Observatory and scientists publishing works on networks on the arXiv. Compared to conventional aggregated methods, the multilayer method uncovers connected topics and reveals smaller modules with more overlap that better capture the actual organization.
- Received 12 September 2014
DOI:https://doi.org/10.1103/PhysRevX.5.011027
This article is available under the terms of the Creative Commons Attribution 3.0 License. Further distribution of this work must maintain attribution to the author(s) and the published article’s title, journal citation, and DOI.
Published by the American Physical Society
Popular Summary
The convention of representing different types of interactions in a system with a single type of link is no longer the panacea of network science. Instead, temporal-, memory-, and multiplex-network representations have proven to be necessary to capture essential structural information in social and biological systems. Many tools that are useful in conventional network science have rapidly been generalized to multilayer networks, but generalizing community-detection algorithms has turned out to be a challenge characterized by two questions: What is a community in a multilayer network? How can it be identified? We demonstrate that the information-theoretic and flow-based community-detection method known as the map equation provides an effective answer to both questions.
We use numerical experiments to investigate the interconnectedness of complex networks, with the goal of identifying communities linked by some common attribute. These communities capture flow (of ideas, people, topics, etc.) within and across layers of the network. The map equation that we generalize measures how well a community partition can describe the flow, and it allows us to find the multilayer communities that reveal most regularities in the flow. We illustrate the mathematical machinery and test our method using two real-world examples: the cohort of scientists associated with the Pierre Auger Observatory (a center devoted to studying high-energy cosmic rays) and researchers publishing their studies on the website arXiv. We find that our algorithm is able to capture smaller communities than previous studies, effectively probing networks at a higher level of resolution.
We expect that our results will have applications to complex social and biological systems in which people, relationships, and cells, for instance, are linked in myriad ways.