839 research outputs found
Scalable software and models for large-scale extracellular recordings
The brain represents information about the world through the electrical activity of
populations of neurons. By placing an electrode near a neuron that is firing (spiking), it
is possible to detect the resulting extracellular action potential (EAP) that is transmitted
down an axon to other neurons. In this way, it is possible to monitor the communication
of a group of neurons to uncover how they encode and transmit information. As the
number of recorded neurons continues to increase, however, so do the data processing
and analysis challenges. It is crucial that scalable software and analysis tools are developed
and made available to the neuroscience community to keep up with the large
amounts of data that are already being gathered.
This thesis is composed of three pieces of work which I develop in order to better
process and analyze large-scale extracellular recordings. My work spans all stages of extracellular
analysis from the processing of raw electrical recordings to the development
of statistical models to reveal underlying structure in neural population activity.
In the first work, I focus on developing software to improve the comparison and adoption
of different computational approaches for spike sorting. When analyzing neural
recordings, most researchers are interested in the spiking activity of individual neurons,
which must be extracted from the raw electrical traces through a process called
spike sorting. Much development has been directed towards improving the performance
and automation of spike sorting. This continuous development, while essential,
has contributed to an over-saturation of new, incompatible tools that hinders rigorous
benchmarking and complicates reproducible analysis. To address these limitations, I
develop SpikeInterface, an open-source, Python framework designed to unify preexisting
spike sorting technologies into a single toolkit and to facilitate straightforward
benchmarking of different approaches. With this framework, I demonstrate that modern,
automated spike sorters have low agreement when analyzing the same dataset, i.e.
they find different numbers of neurons with different activity profiles; This result holds
true for a variety of simulated and real datasets. Also, I demonstrate that utilizing a
consensus-based approach to spike sorting, where the outputs of multiple spike sorters
are combined, can dramatically reduce the number of falsely detected neurons.
In the second work, I focus on developing an unsupervised machine learning approach
for determining the source location of individually detected spikes that are
recorded by high-density, microelectrode arrays. By localizing the source of individual
spikes, my method is able to determine the approximate position of the recorded neuriii
ons in relation to the microelectrode array. To allow my model to work with large-scale
datasets, I utilize deep neural networks, a family of machine learning algorithms that
can be trained to approximate complicated functions in a scalable fashion. I evaluate
my method on both simulated and real extracellular datasets, demonstrating that it is
more accurate than other commonly used methods. Also, I show that location estimates
for individual spikes can be utilized to improve the efficiency and accuracy of spike
sorting. After training, my method allows for localization of one million spikes in approximately
37 seconds on a TITAN X GPU, enabling real-time analysis of massive
extracellular datasets.
In my third and final presented work, I focus on developing an unsupervised machine
learning model that can uncover patterns of activity from neural populations
associated with a behaviour being performed. Specifically, I introduce Targeted Neural
Dynamical Modelling (TNDM), a statistical model that jointly models the neural activity
and any external behavioural variables. TNDM decomposes neural dynamics (i.e.
temporal activity patterns) into behaviourally relevant and behaviourally irrelevant dynamics;
the behaviourally relevant dynamics constitute all activity patterns required
to generate the behaviour of interest while behaviourally irrelevant dynamics may be
completely unrelated (e.g. other behavioural or brain states), or even related to behaviour
execution (e.g. dynamics that are associated with behaviour generally but are not
task specific). Again, I implement TNDM using a deep neural network to improve its
scalability and expressivity. On synthetic data and on real recordings from the premotor
(PMd) and primary motor cortex (M1) of a monkey performing a center-out reaching
task, I show that TNDM is able to extract low-dimensional neural dynamics that are
highly predictive of behaviour without sacrificing its fit to the neural data
Building population models for large-scale neural recordings: opportunities and pitfalls
Modern recording technologies now enable simultaneous recording from large
numbers of neurons. This has driven the development of new statistical models
for analyzing and interpreting neural population activity. Here we provide a
broad overview of recent developments in this area. We compare and contrast
different approaches, highlight strengths and limitations, and discuss
biological and mechanistic insights that these methods provide
SpikeInterface, a unified framework for spike sorting
Much development has been directed toward improving the performance and automation of spike sorting. This continuous development, while essential, has contributed to an over-saturation of new, incompatible tools that hinders rigorous benchmarking and complicates reproducible analysis. To address these limitations, we developed SpikeInterface, a Python framework designed to unify preexisting spike sorting technologies into a single codebase and to facilitate straightforward comparison and adoption of different approaches. With a few lines of code, researchers can reproducibly run, compare, and benchmark most modern spike sorting algorithms; pre-process, post-process, and visualize extracellular datasets; validate, curate, and export sorting outputs; and more. In this paper, we provide an overview of SpikeInterface and, with applications to real and simulated datasets, demonstrate how it can be utilized to reduce the burden of manual curation and to more comprehensively benchmark automated spike sorters.ISSN:2050-084
Scalable Spike Source Localization in Extracellular Recordings using Amortized Variational Inference
Targeted Neural Dynamical Modeling
Latent dynamics models have emerged as powerful tools for modeling and
interpreting neural population activity. Recently, there has been a focus on
incorporating simultaneously measured behaviour into these models to further
disentangle sources of neural variability in their latent space. These
approaches, however, are limited in their ability to capture the underlying
neural dynamics (e.g. linear) and in their ability to relate the learned
dynamics back to the observed behaviour (e.g. no time lag). To this end, we
introduce Targeted Neural Dynamical Modeling (TNDM), a nonlinear state-space
model that jointly models the neural activity and external behavioural
variables. TNDM decomposes neural dynamics into behaviourally relevant and
behaviourally irrelevant dynamics; the relevant dynamics are used to
reconstruct the behaviour through a flexible linear decoder and both sets of
dynamics are used to reconstruct the neural activity through a linear decoder
with no time lag. We implement TNDM as a sequential variational autoencoder and
validate it on simulated recordings and recordings taken from the premotor and
motor cortex of a monkey performing a center-out reaching task. We show that
TNDM is able to learn low-dimensional latent dynamics that are highly
predictive of behaviour without sacrificing its fit to the neural data
In the Beginning: The First Sources of Light and the Reionization of the Universe
The formation of the first stars and quasars marks the transformation of the
universe from its smooth initial state to its clumpy current state. In popular
cosmological models, the first sources of light began to form at redshift 30
and reionized most of the hydrogen in the universe by redshift 7. Current
observations are at the threshold of probing the hydrogen reionization epoch.
The study of high-redshift sources is likely to attract major attention in
observational and theoretical cosmology over the next decade.Comment: Final revision: 136 pages, including 42 figures; to be published in
Physics Reports 2001. References updated, and a few minor corrections made.
In this submission, several figures were compressed, resulting in just a
slight reduction in quality; a postscript file with the full figures is
available at http://www.cita.utoronto.ca/~barkana/review.htm
- …