79 research outputs found
Universality of Zipf's Law
Zipf's law is the most common statistical distribution displaying scaling
behavior. Cities, populations or firms are just examples of this seemingly
universal law. Although many different models have been proposed, no general
theoretical explanation has been shown to exist for its universality. Here we
show that Zipf's law is, in fact, an inevitable outcome of a very general class
of stochastic systems. Borrowing concepts from Algorithmic Information Theory,
our derivation is based on the properties of the symbolic sequence obtained
through successive observations over a system with an unbounded number of
possible states. Specifically, we assume that the complexity of the description
of the system provided by the sequence of observations is the one expected for
a system evolving to a stable state between order and disorder. This result is
obtained from a small set of mild, physically relevant assumptions. The general
nature of our derivation and its model-free basis would explain the ubiquity of
such a law in real systems.Comment: 11 Pages, 2 figure
Detection of the elite structure in a virtual multiplex social system by means of a generalized -core
Elites are subgroups of individuals within a society that have the ability
and means to influence, lead, govern, and shape societies. Members of elites
are often well connected individuals, which enables them to impose their
influence to many and to quickly gather, process, and spread information. Here
we argue that elites are not only composed of highly connected individuals, but
also of intermediaries connecting hubs to form a cohesive and structured
elite-subgroup at the core of a social network. For this purpose we present a
generalization of the -core algorithm that allows to identify a social core
that is composed of well-connected hubs together with their `connectors'. We
show the validity of the idea in the framework of a virtual world defined by a
massive multiplayer online game, on which we have complete information of
various social networks. Exploiting this multiplex structure, we find that the
hubs of the generalized -core identify those individuals that are high
social performers in terms of a series of indicators that are available in the
game. In addition, using a combined strategy which involves the generalized
-core and the recently introduced -core, the elites of the different
'nations' present in the game are perfectly identified as modules of the
generalized -core. Interesting sudden shifts in the composition of the elite
cores are observed at deep levels. We show that elite detection with the
traditional -core is not possible in a reliable way. The proposed method
might be useful in a series of more general applications, such as community
detection.Comment: 13 figures, 3 tables, 19 pages. Accepted for publication in PLoS ON
Extreme robustness of scaling in sample space reducing processes explains Zipf's law in diffusion on directed networks
It has been shown recently that a specific class of path-dependent stochastic
processes, which reduce their sample space as they unfold, lead to exact
scaling laws in frequency and rank distributions. Such Sample Space Reducing
processes (SSRP) offer an alternative new mechanism to understand the emergence
of scaling in countless processes. The corresponding power law exponents were
shown to be related to noise levels in the process. Here we show that the
emergence of scaling is not limited to the simplest SSRPs, but holds for a huge
domain of stochastic processes that are characterized by non-uniform prior
distributions. We demonstrate mathematically that in the absence of noise the
scaling exponents converge to (Zipf's law) for almost all prior
distributions. As a consequence it becomes possible to fully understand
targeted diffusion on weighted directed networks and its associated scaling
laws law in node visit distributions. The presence of cycles can be properly
interpreted as playing the same role as noise in SSRPs and, accordingly,
determine the scaling exponents. The result that Zipf's law emerges as a
generic feature of diffusion on networks, regardless of its details, and that
the exponent of visiting times is related to the amount of cycles in a network
could be relevant for a series of applications in traffic-, transport- and
supply chain management.Comment: 11 pages, 5 figure
Understanding scaling through history-dependent processes with collapsing sample space
History-dependent processes are ubiquitous in natural and social systems.
Many such stochastic processes, especially those that are associated with
complex systems, become more constrained as they unfold, meaning that their
sample-space, or their set of possible outcomes, reduces as they age. We
demonstrate that these sample-space reducing (SSR) processes necessarily lead
to Zipf's law in the rank distributions of their outcomes. We show that by
adding noise to SSR processes the corresponding rank distributions remain exact
power-laws, , where the exponent directly corresponds to
the mixing ratio of the SSR process and noise. This allows us to give a precise
meaning to the scaling exponent in terms of the degree to how much a given
process reduces its sample-space as it unfolds. Noisy SSR processes further
allow us to explain a wide range of scaling exponents in frequency
distributions ranging from to . We discuss several
applications showing how SSR processes can be used to understand Zipf's law in
word frequencies, and how they are related to diffusion processes in directed
networks, or ageing processes such as in fragmentation processes. SSR processes
provide a new alternative to understand the origin of scaling in complex
systems without the recourse to multiplicative, preferential, or self-organised
critical processes.Comment: 7 pages, 5 figures in Proceedings of the National Academy of Sciences
USA (published ahead of print April 13, 2015
Sample space reducing cascading processes produce the full spectrum of scaling exponents
Sample Space Reducing (SSR) processes are simple stochastic processes that
offer a new route to understand scaling in path-dependent processes. Here we
define a cascading process that generalises the recently defined SSR processes
and is able to produce power laws with arbitrary exponents. We demonstrate
analytically that the frequency distributions of states are power laws with
exponents that coincide with the multiplication parameter of the cascading
process. In addition, we show that imposing energy conservation in SSR cascades
allows us to recover Fermi's classic result on the energy spectrum of cosmic
rays, with the universal exponent -2, which is independent of the
multiplication parameter of the cascade. Applications of the proposed process
include fragmentation processes or directed cascading diffusion on networks,
such as rumour or epidemic spreading.Comment: 10 pages, 6 figure
Decomposing information into copying versus transformation
In many real-world systems, information can be transmitted in two
qualitatively different ways: by copying or by transformation. Copying occurs
when messages are transmitted without modification, e.g., when an offspring
receives an unaltered copy of a gene from its parent. Transformation occurs
when messages are modified systematically during transmission, e.g., when
mutational biases occur during genetic replication. Standard
information-theoretic measures do not distinguish these two modes of
information transfer, although they may reflect different mechanisms and have
different functional consequences. Starting from a few simple axioms, we derive
a decomposition of mutual information into the information transmitted by
copying versus the information transmitted by transformation. We begin with a
decomposition that applies when the source and destination of the channel have
the same set of messages and a notion of message identity exists. We then
generalize our decomposition to other kinds of channels, which can involve
different source and destination sets and broader notions of similarity. In
addition, we show that copy information can be interpreted as the minimal work
needed by a physical copying process, which is relevant for understanding the
physics of replication. We use the proposed decomposition to explore a model of
amino acid substitution rates. Our results apply to any system in which the
fidelity of copying, rather than simple predictability, is of critical
relevance
- …