4,228 research outputs found
Multilevel compression of random walks on networks reveals hierarchical organization in large integrated systems
To comprehend the hierarchical organization of large integrated systems, we
introduce the hierarchical map equation, which reveals multilevel structures in
networks. In this information-theoretic approach, we exploit the duality
between compression and pattern detection; by compressing a description of a
random walker as a proxy for real flow on a network, we find regularities in
the network that induce this system-wide flow. Finding the shortest multilevel
description of the random walker therefore gives us the best hierarchical
clustering of the network, the optimal number of levels and modular partition
at each level, with respect to the dynamics on the network. With a novel search
algorithm, we extract and illustrate the rich multilevel organization of
several large social and biological networks. For example, from the global air
traffic network we uncover countries and continents, and from the pattern of
scientific communication we reveal more than 100 scientific fields organized in
four major disciplines: life sciences, physical sciences, ecology and earth
sciences, and social sciences. In general, we find shallow hierarchical
structures in globally interconnected systems, such as neural networks, and
rich multilevel organizations in systems with highly separated regions, such as
road networks.Comment: 11 pages, 5 figures. For associated code, see
http://www.tp.umu.se/~rosvall/code.htm
An information-theoretic framework for resolving community structure in complex networks
To understand the structure of a large-scale biological, social, or
technological network, it can be helpful to decompose the network into smaller
subunits or modules. In this article, we develop an information-theoretic
foundation for the concept of modularity in networks. We identify the modules
of which the network is composed by finding an optimal compression of its
topology, capitalizing on regularities in its structure. We explain the
advantages of this approach and illustrate them by partitioning a number of
real-world and model networks.Comment: 5 pages, 4 figure
The map equation
Many real-world networks are so large that we must simplify their structure
before we can extract useful information about the systems they represent. As
the tools for doing these simplifications proliferate within the network
literature, researchers would benefit from some guidelines about which of the
so-called community detection algorithms are most appropriate for the
structures they are studying and the questions they are asking. Here we show
that different methods highlight different aspects of a network's structure and
that the the sort of information that we seek to extract about the system must
guide us in our decision. For example, many community detection algorithms,
including the popular modularity maximization approach, infer module
assignments from an underlying model of the network formation process. However,
we are not always as interested in how a system's network structure was formed,
as we are in how a network's extant structure influences the system's behavior.
To see how structure influences current behavior, we will recognize that links
in a network induce movement across the network and result in system-wide
interdependence. In doing so, we explicitly acknowledge that most networks
carry flow. To highlight and simplify the network structure with respect to
this flow, we use the map equation. We present an intuitive derivation of this
flow-based and information-theoretic method and provide an interactive on-line
application that anyone can use to explore the mechanics of the map equation.
We also describe an algorithm and provide source code to efficiently decompose
large weighted and directed networks based on the map equation.Comment: 9 pages and 3 figures, corrected typos. For associated Flash
application, see http://www.tp.umu.se/~rosvall/livemod/mapequation
Maps of random walks on complex networks reveal community structure
To comprehend the multipartite organization of large-scale biological and
social systems, we introduce a new information theoretic approach that reveals
community structure in weighted and directed networks. The method decomposes a
network into modules by optimally compressing a description of information
flows on the network. The result is a map that both simplifies and highlights
the regularities in the structure and their relationships. We illustrate the
method by making a map of scientific communication as captured in the citation
patterns of more than 6000 journals. We discover a multicentric organization
with fields that vary dramatically in size and degree of integration into the
network of science. Along the backbone of the network -- including physics,
chemistry, molecular biology, and medicine -- information flows
bidirectionally, but the map reveals a directional pattern of citation from the
applied fields to the basic sciences.Comment: 7 pages and 4 figures plus supporting material. For associated source
code, see http://www.tp.umu.se/~rosvall
- …