26,007 research outputs found
Local Tomography of Large Networks under the Low-Observability Regime
This article studies the problem of reconstructing the topology of a network
of interacting agents via observations of the state-evolution of the agents. We
focus on the large-scale network setting with the additional constraint of
observations, where only a small fraction of the agents can be
feasibly observed. The goal is to infer the underlying subnetwork of
interactions and we refer to this problem as . In order to
study the large-scale setting, we adopt a proper stochastic formulation where
the unobserved part of the network is modeled as an Erd\"{o}s-R\'enyi random
graph, while the observable subnetwork is left arbitrary. The main result of
this work is establishing that, under this setting, local tomography is
actually possible with high probability, provided that certain conditions on
the network model are met (such as stability and symmetry of the network
combination matrix). Remarkably, such conclusion is established under the
- , where the cardinality of the observable
subnetwork is fixed, while the size of the overall network scales to infinity.Comment: To appear in IEEE Transactions on Information Theor
Gaussian Belief with dynamic data and in dynamic network
In this paper we analyse Belief Propagation over a Gaussian model in a
dynamic environment. Recently, this has been proposed as a method to average
local measurement values by a distributed protocol ("Consensus Propagation",
Moallemi & Van Roy, 2006), where the average is available for read-out at every
single node. In the case that the underlying network is constant but the values
to be averaged fluctuate ("dynamic data"), convergence and accuracy are
determined by the spectral properties of an associated Ruelle-Perron-Frobenius
operator. For Gaussian models on Erdos-Renyi graphs, numerical computation
points to a spectral gap remaining in the large-size limit, implying
exceptionally good scalability. In a model where the underlying network also
fluctuates ("dynamic network"), averaging is more effective than in the dynamic
data case. Altogether, this implies very good performance of these methods in
very large systems, and opens a new field of statistical physics of large (and
dynamic) information systems.Comment: 5 pages, 7 figure
Probabilistic Graphical Model Representation in Phylogenetics
Recent years have seen a rapid expansion of the model space explored in
statistical phylogenetics, emphasizing the need for new approaches to
statistical model representation and software development. Clear communication
and representation of the chosen model is crucial for: (1) reproducibility of
an analysis, (2) model development and (3) software design. Moreover, a
unified, clear and understandable framework for model representation lowers the
barrier for beginners and non-specialists to grasp complex phylogenetic models,
including their assumptions and parameter/variable dependencies.
Graphical modeling is a unifying framework that has gained in popularity in
the statistical literature in recent years. The core idea is to break complex
models into conditionally independent distributions. The strength lies in the
comprehensibility, flexibility, and adaptability of this formalism, and the
large body of computational work based on it. Graphical models are well-suited
to teach statistical models, to facilitate communication among phylogeneticists
and in the development of generic software for simulation and statistical
inference.
Here, we provide an introduction to graphical models for phylogeneticists and
extend the standard graphical model representation to the realm of
phylogenetics. We introduce a new graphical model component, tree plates, to
capture the changing structure of the subgraph corresponding to a phylogenetic
tree. We describe a range of phylogenetic models using the graphical model
framework and introduce modules to simplify the representation of standard
components in large and complex models. Phylogenetic model graphs can be
readily used in simulation, maximum likelihood inference, and Bayesian
inference using, for example, Metropolis-Hastings or Gibbs sampling of the
posterior distribution
Active Topology Inference using Network Coding
Our goal is to infer the topology of a network when (i) we can send probes
between sources and receivers at the edge of the network and (ii) intermediate
nodes can perform simple network coding operations, i.e., additions. Our key
intuition is that network coding introduces topology-dependent correlation in
the observations at the receivers, which can be exploited to infer the
topology. For undirected tree topologies, we design hierarchical clustering
algorithms, building on our prior work. For directed acyclic graphs (DAGs),
first we decompose the topology into a number of two-source, two-receiver
(2-by-2) subnetwork components and then we merge these components to
reconstruct the topology. Our approach for DAGs builds on prior work on
tomography, and improves upon it by employing network coding to accurately
distinguish among all different 2-by-2 components. We evaluate our algorithms
through simulation of a number of realistic topologies and compare them to
active tomographic techniques without network coding. We also make connections
between our approach and alternatives, including passive inference, traceroute,
and packet marking
Spectral partitioning of time-varying networks with unobserved edges
We discuss a variant of `blind' community detection, in which we aim to
partition an unobserved network from the observation of a (dynamical) graph
signal defined on the network. We consider a scenario where our observed graph
signals are obtained by filtering white noise input, and the underlying network
is different for every observation. In this fashion, the filtered graph signals
can be interpreted as defined on a time-varying network. We model each of the
underlying network realizations as generated by an independent draw from a
latent stochastic blockmodel (SBM). To infer the partition of the latent SBM,
we propose a simple spectral algorithm for which we provide a theoretical
analysis and establish consistency guarantees for the recovery. We illustrate
our results using numerical experiments on synthetic and real data,
highlighting the efficacy of our approach.Comment: 5 pages, 2 figure
- …