20 research outputs found
Rescaling, thinning or complementing? On goodness-of-fit procedures for point process models and Generalized Linear Models
Generalized Linear Models (GLMs) are an increasingly popular framework for
modeling neural spike trains. They have been linked to the theory of stochastic
point processes and researchers have used this relation to assess
goodness-of-fit using methods from point-process theory, e.g. the
time-rescaling theorem. However, high neural firing rates or coarse
discretization lead to a breakdown of the assumptions necessary for this
connection. Here, we show how goodness-of-fit tests from point-process theory
can still be applied to GLMs by constructing equivalent surrogate point
processes out of time-series observations. Furthermore, two additional tests
based on thinning and complementing point processes are introduced. They
augment the instruments available for checking model adequacy of point
processes as well as discretized models.Comment: 9 pages, to appear in NIPS 2010 (Neural Information Processing
Systems), corrected missing referenc
Markov Network Structure Learning via Ensemble-of-Forests Models
Real world systems typically feature a variety of different dependency types
and topologies that complicate model selection for probabilistic graphical
models. We introduce the ensemble-of-forests model, a generalization of the
ensemble-of-trees model. Our model enables structure learning of Markov random
fields (MRF) with multiple connected components and arbitrary potentials. We
present two approximate inference techniques for this model and demonstrate
their performance on synthetic data. Our results suggest that the
ensemble-of-forests approach can accurately recover sparse, possibly
disconnected MRF topologies, even in presence of non-Gaussian dependencies
and/or low sample size. We applied the ensemble-of-forests model to learn the
structure of perturbed signaling networks of immune cells and found that these
frequently exhibit non-Gaussian dependencies with disconnected MRF topologies.
In summary, we expect that the ensemble-of-forests model will enable MRF
structure learning in other high dimensional real world settings that are
governed by non-trivial dependencies.Comment: 13 pages, 6 figure
A copula-based method to build diffusion models with prescribed marginal and serial dependence
This paper investigates the probabilistic properties that determine the
existence of space-time transformations between diffusion processes. We prove
that two diffusions are related by a monotone space-time transformation if and
only if they share the same serial dependence. The serial dependence of a
diffusion process is studied by means of its copula density and the effect of
monotone and non-monotone space-time transformations on the copula density is
discussed. This provides us a methodology to build diffusion models by freely
combining prescribed marginal behaviors and temporal dependence structures.
Explicit expressions of copula densities are provided for tractable models. A
possible application in neuroscience is sketched as a proof of concept
A study of dependency features of spike trains through copulas
Simultaneous recordings from many neurons hide important information and the
connections characterizing the network remain generally undiscovered despite
the progresses of statistical and machine learning techniques. Discerning the
presence of direct links between neuron from data is still a not completely
solved problem. To enlarge the number of tools for detecting the underlying
network structure, we propose here the use of copulas, pursuing on a research
direction we started in [1]. Here, we adapt their use to distinguish different
types of connections on a very simple network. Our proposal consists in
choosing suitable random intervals in pairs of spike trains determining the
shapes of their copulas. We show that this approach allows to detect different
types of dependencies. We illustrate the features of the proposed method on
synthetic data from suitably connected networks of two or three formal neurons
directly connected or influenced by the surrounding network. We show how a
smart choice of pairs of random times together with the use of empirical
copulas allows to discern between direct and un-direct interactions
A Semiparametric Bayesian Model for Detecting Synchrony Among Multiple Neurons
We propose a scalable semiparametric Bayesian model to capture dependencies
among multiple neurons by detecting their co-firing (possibly with some lag
time) patterns over time. After discretizing time so there is at most one spike
at each interval, the resulting sequence of 1's (spike) and 0's (silence) for
each neuron is modeled using the logistic function of a continuous latent
variable with a Gaussian process prior. For multiple neurons, the corresponding
marginal distributions are coupled to their joint probability distribution
using a parametric copula model. The advantages of our approach are as follows:
the nonparametric component (i.e., the Gaussian process model) provides a
flexible framework for modeling the underlying firing rates; the parametric
component (i.e., the copula model) allows us to make inference regarding both
contemporaneous and lagged relationships among neurons; using the copula model,
we construct multivariate probabilistic models by separating the modeling of
univariate marginal distributions from the modeling of dependence structure
among variables; our method is easy to implement using a computationally
efficient sampling algorithm that can be easily extended to high dimensional
problems. Using simulated data, we show that our approach could correctly
capture temporal dependencies in firing rates and identify synchronous neurons.
We also apply our model to spike train data obtained from prefrontal cortical
areas in rat's brain
Building population models for large-scale neural recordings: opportunities and pitfalls
Modern recording technologies now enable simultaneous recording from large
numbers of neurons. This has driven the development of new statistical models
for analyzing and interpreting neural population activity. Here we provide a
broad overview of recent developments in this area. We compare and contrast
different approaches, highlight strengths and limitations, and discuss
biological and mechanistic insights that these methods provide