145 research outputs found
Quantifying selection in immune receptor repertoires
The efficient recognition of pathogens by the adaptive immune system relies
on the diversity of receptors displayed at the surface of immune cells. T-cell
receptor diversity results from an initial random DNA editing process, called
VDJ recombination, followed by functional selection of cells according to the
interaction of their surface receptors with self and foreign antigenic
peptides. To quantify the effect of selection on the highly variable elements
of the receptor, we apply a probabilistic maximum likelihood approach to the
analysis of high-throughput sequence data from the -chain of human
T-cell receptors. We quantify selection factors for V and J gene choice, and
for the length and amino-acid composition of the variable region. Our approach
is necessary to disentangle the effects of selection from biases inherent in
the recombination process. Inferred selection factors differ little between
donors, or between naive and memory repertoires. The number of sequences shared
between donors is well-predicted by the model, indicating a purely stochastic
origin of such "public" sequences. We find a significant correlation between
biases induced by VDJ recombination and our inferred selection factors,
together with a reduction of diversity during selection. Both effects suggest
that natural selection acting on the recombination process has anticipated the
selection pressures experienced during somatic evolution
Predicting the spectrum of TCR repertoire sharing with a data-driven model of recombination
Despite the extreme diversity of T cell repertoires, many identical T-cell
receptor (TCR) sequences are found in a large number of individual mice and
humans. These widely-shared sequences, often referred to as `public', have been
suggested to be over-represented due to their potential immune functionality or
their ease of generation by V(D)J recombination. Here we show that even for
large cohorts the observed degree of sharing of TCR sequences between
individuals is well predicted by a model accounting for by the known
quantitative statistical biases in the generation process, together with a
simple model of thymic selection. Whether a sequence is shared by many
individuals is predicted to depend on the number of queried individuals and the
sampling depth, as well as on the sequence itself, in agreement with the data.
We introduce the degree of publicness conditional on the queried cohort size
and the size of the sampled repertoires. Based on these observations we propose
a public/private sequence classifier, `PUBLIC' (Public Universal Binary
Likelihood Inference Classifier), based on the generation probability, which
performs very well even for small cohort sizes
Karst aquifer discharge response to rainfall interpreted as anomalous transport
The discharge measured in karst springs is known to exhibit distinctive long tails during recession times following distinct short-duration discharge peaks. The long-tailed behavior is generally attributed to the occurrence of tortuous, ramified flow paths that develop in the underground structure of karst systems. Modeling the discharge behavior poses unique difficulties because of the poorly delineated flow path geometry and generally scarce information on the hydraulic properties of catchment-scale systems. In a different context, modeling of long-tailed behavior has been addressed in studies of chemical transport. Here, an adaptation of a continuous time random walkâparticle tracking (CTRW-PT) framework for anomalous transport is proposed, which offers a robust means to quantify long-tailed breakthrough curves that often arise during the transport of chemical species under various flow scenarios. A theoretical analogy is first established between partially water-saturated karst flow, characterized by temporally varying water storage, and chemical transport involving the accumulation and release of a chemical tracer. This analogy is then used to develop and implement a CTRW-PT model. Application of this numerical model to the examination of 3 years of summer rainfall and discharge data from a karst aquifer system â the Disnergschroef high-alpine site in the Austrian Alps â is shown to yield robust fits between modeled and measured discharge values. In particular, the analysis underscores the predominance of slow diffusive flow over rapid conduit flow. The study affirms the analogy between partially saturated karst flow and chemical transport, exemplifying the compatibility of the CTRW-PT model for this purpose. Within the specific context of the Disnergschroef karst system, these findings highlight the predominance of slow diffusive flow over rapid conduit flow. The agreement between measured and simulated data supports the proposed analogy between partially saturated karst flow and chemical transport; it also highlights the potential ability of the anomalous transport framework to further enhance modeling of flow and transport in karst systems
Inferring processes underlying B-cell repertoire diversity
We quantify the VDJ recombination and somatic hypermutation processes in
human B-cells using probabilistic inference methods on high-throughput DNA
sequence repertoires of human B-cell receptor heavy chains. Our analysis
captures the statistical properties of the naive repertoire, first after its
initial generation via VDJ recombination and then after selection for
functionality. We also infer statistical properties of the somatic
hypermutation machinery (exclusive of subsequent effects of selection). Our
main results are the following: the B-cell repertoire is substantially more
diverse than T-cell repertoires, due to longer junctional insertions; sequences
that pass initial selection are distinguished by having a higher probability of
being generated in a VDJ recombination event; somatic hypermutations have a
non-uniform distribution along the V gene that is well explained by an
independent site model for the sequence context around the hypermutation site.Comment: acknowledgement adde
OLGA: fast computation of generation probabilities of B- and T-cell receptor amino acid sequences and motifs
Motivation: High-throughput sequencing of large immune repertoires has
enabled the development of methods to predict the probability of generation by
V(D)J recombination of T- and B-cell receptors of any specific nucleotide
sequence. These generation probabilities are very non-homogeneous, ranging over
20 orders of magnitude in real repertoires. Since the function of a receptor
really depends on its protein sequence, it is important to be able to predict
this probability of generation at the amino acid level. However, brute-force
summation over all the nucleotide sequences with the correct amino acid
translation is computationally intractable. The purpose of this paper is to
present a solution to this problem.
Results: We use dynamic programming to construct an efficient and flexible
algorithm, called OLGA (Optimized Likelihood estimate of immunoGlobulin
Amino-acid sequences), for calculating the probability of generating a given
CDR3 amino acid sequence or motif, with or without V/J restriction, as a result
of V(D)J recombination in B or T cells. We apply it to databases of
epitope-specific T-cell receptors to evaluate the probability that a typical
human subject will possess T cells responsive to specific disease-associated
epitopes. The model prediction shows an excellent agreement with published
data. We suggest that OLGA may be a useful tool to guide vaccine design.
Availability: Source code is available at https://github.com/zsethna/OLG
Population variability in the generation and thymic selection of T-cell repertoires
The diversity of T-cell receptor (TCR) repertoires is achieved by a
combination of two intrinsically stochastic steps: random receptor generation
by VDJ recombination, and selection based on the recognition of random
self-peptides presented on the major histocompatibility complex. These
processes lead to a large receptor variability within and between individuals.
However, the characterization of the variability is hampered by the limited
size of the sampled repertoires. We introduce a new software tool SONIA to
facilitate inference of individual-specific computational models for the
generation and selection of the TCR beta chain (TRB) from sequenced repertoires
of 651 individuals, separating and quantifying the variability of the two
processes of generation and selection in the population. We find not only that
most of the variability is driven by the VDJ generation process, but there is a
large degree of consistency between individuals with the inter-individual
variance of repertoires being about 2% of the intra-individual variance. Known
viral-specific TCRs follow the same generation and selection statistics as all
TCRs.Comment: 13 pages, 7 figure, 2 table
On generative models of T-cell receptor sequences
T-cell receptors (TCR) are key proteins of the adaptive immune system,
generated randomly in each individual, whose diversity underlies our ability to
recognize infections and malignancies. Modeling the distribution of TCR
sequences is of key importance for immunology and medical applications. Here,
we compare two inference methods trained on high-throughput sequencing data: a
knowledge-guided approach, which accounts for the details of sequence
generation, supplemented by a physics-inspired model of selection; and a
knowledge-free Variational Auto-Encoder based on deep artificial neural
networks. We show that the knowledge-guided model outperforms the deep network
approach at predicting TCR probabilities, while being more interpretable, at a
lower computational cost
Computational strategies for dissecting the high-dimensional complexity of adaptive immune repertoires
The adaptive immune system recognizes antigens via an immense array of
antigen-binding antibodies and T-cell receptors, the immune repertoire. The
interrogation of immune repertoires is of high relevance for understanding the
adaptive immune response in disease and infection (e.g., autoimmunity, cancer,
HIV). Adaptive immune receptor repertoire sequencing (AIRR-seq) has driven the
quantitative and molecular-level profiling of immune repertoires thereby
revealing the high-dimensional complexity of the immune receptor sequence
landscape. Several methods for the computational and statistical analysis of
large-scale AIRR-seq data have been developed to resolve immune repertoire
complexity in order to understand the dynamics of adaptive immunity. Here, we
review the current research on (i) diversity, (ii) clustering and network,
(iii) phylogenetic and (iv) machine learning methods applied to dissect,
quantify and compare the architecture, evolution, and specificity of immune
repertoires. We summarize outstanding questions in computational immunology and
propose future directions for systems immunology towards coupling AIRR-seq with
the computational discovery of immunotherapeutics, vaccines, and
immunodiagnostics.Comment: 27 pages, 2 figure
Restoration of energy homeostasis by SIRT6 extends healthy lifespan
Aging leads to a gradual decline in physical activity and disrupted energy homeostasis. The NAD+-dependent SIRT6 deacylase regulates aging and metabolism through mechanisms that largely remain unknown. Here, we show that SIRT6 overexpression leads to a reduction in frailty and lifespan extension in both male and female B6 mice. A combination of physiological assays, in vivo multi-omics analyses and 13C lactate tracing identified an age-dependent decline in glucose homeostasis and hepatic glucose output in wild type mice. In contrast, aged SIRT6-transgenic mice preserve hepatic glucose output and glucose homeostasis through an improvement in the utilization of two major gluconeogenic precursors, lactate and glycerol. To mediate these changes, mechanistically, SIRT6 increases hepatic gluconeogenic gene expression, de novo NAD+ synthesis, and systemically enhances glycerol release from adipose tissue. These findings show that SIRT6 optimizes energy homeostasis in old age to delay frailty and preserve healthy aging
- âŠ