145 research outputs found

    Quantifying selection in immune receptor repertoires

    Full text link
    The efficient recognition of pathogens by the adaptive immune system relies on the diversity of receptors displayed at the surface of immune cells. T-cell receptor diversity results from an initial random DNA editing process, called VDJ recombination, followed by functional selection of cells according to the interaction of their surface receptors with self and foreign antigenic peptides. To quantify the effect of selection on the highly variable elements of the receptor, we apply a probabilistic maximum likelihood approach to the analysis of high-throughput sequence data from the ÎČ\beta-chain of human T-cell receptors. We quantify selection factors for V and J gene choice, and for the length and amino-acid composition of the variable region. Our approach is necessary to disentangle the effects of selection from biases inherent in the recombination process. Inferred selection factors differ little between donors, or between naive and memory repertoires. The number of sequences shared between donors is well-predicted by the model, indicating a purely stochastic origin of such "public" sequences. We find a significant correlation between biases induced by VDJ recombination and our inferred selection factors, together with a reduction of diversity during selection. Both effects suggest that natural selection acting on the recombination process has anticipated the selection pressures experienced during somatic evolution

    Predicting the spectrum of TCR repertoire sharing with a data-driven model of recombination

    Full text link
    Despite the extreme diversity of T cell repertoires, many identical T-cell receptor (TCR) sequences are found in a large number of individual mice and humans. These widely-shared sequences, often referred to as `public', have been suggested to be over-represented due to their potential immune functionality or their ease of generation by V(D)J recombination. Here we show that even for large cohorts the observed degree of sharing of TCR sequences between individuals is well predicted by a model accounting for by the known quantitative statistical biases in the generation process, together with a simple model of thymic selection. Whether a sequence is shared by many individuals is predicted to depend on the number of queried individuals and the sampling depth, as well as on the sequence itself, in agreement with the data. We introduce the degree of publicness conditional on the queried cohort size and the size of the sampled repertoires. Based on these observations we propose a public/private sequence classifier, `PUBLIC' (Public Universal Binary Likelihood Inference Classifier), based on the generation probability, which performs very well even for small cohort sizes

    Karst aquifer discharge response to rainfall interpreted as anomalous transport

    Get PDF
    The discharge measured in karst springs is known to exhibit distinctive long tails during recession times following distinct short-duration discharge peaks. The long-tailed behavior is generally attributed to the occurrence of tortuous, ramified flow paths that develop in the underground structure of karst systems. Modeling the discharge behavior poses unique difficulties because of the poorly delineated flow path geometry and generally scarce information on the hydraulic properties of catchment-scale systems. In a different context, modeling of long-tailed behavior has been addressed in studies of chemical transport. Here, an adaptation of a continuous time random walk–particle tracking (CTRW-PT) framework for anomalous transport is proposed, which offers a robust means to quantify long-tailed breakthrough curves that often arise during the transport of chemical species under various flow scenarios. A theoretical analogy is first established between partially water-saturated karst flow, characterized by temporally varying water storage, and chemical transport involving the accumulation and release of a chemical tracer. This analogy is then used to develop and implement a CTRW-PT model. Application of this numerical model to the examination of 3 years of summer rainfall and discharge data from a karst aquifer system – the Disnergschroef high-alpine site in the Austrian Alps – is shown to yield robust fits between modeled and measured discharge values. In particular, the analysis underscores the predominance of slow diffusive flow over rapid conduit flow. The study affirms the analogy between partially saturated karst flow and chemical transport, exemplifying the compatibility of the CTRW-PT model for this purpose. Within the specific context of the Disnergschroef karst system, these findings highlight the predominance of slow diffusive flow over rapid conduit flow. The agreement between measured and simulated data supports the proposed analogy between partially saturated karst flow and chemical transport; it also highlights the potential ability of the anomalous transport framework to further enhance modeling of flow and transport in karst systems

    Inferring processes underlying B-cell repertoire diversity

    Full text link
    We quantify the VDJ recombination and somatic hypermutation processes in human B-cells using probabilistic inference methods on high-throughput DNA sequence repertoires of human B-cell receptor heavy chains. Our analysis captures the statistical properties of the naive repertoire, first after its initial generation via VDJ recombination and then after selection for functionality. We also infer statistical properties of the somatic hypermutation machinery (exclusive of subsequent effects of selection). Our main results are the following: the B-cell repertoire is substantially more diverse than T-cell repertoires, due to longer junctional insertions; sequences that pass initial selection are distinguished by having a higher probability of being generated in a VDJ recombination event; somatic hypermutations have a non-uniform distribution along the V gene that is well explained by an independent site model for the sequence context around the hypermutation site.Comment: acknowledgement adde

    OLGA: fast computation of generation probabilities of B- and T-cell receptor amino acid sequences and motifs

    Full text link
    Motivation: High-throughput sequencing of large immune repertoires has enabled the development of methods to predict the probability of generation by V(D)J recombination of T- and B-cell receptors of any specific nucleotide sequence. These generation probabilities are very non-homogeneous, ranging over 20 orders of magnitude in real repertoires. Since the function of a receptor really depends on its protein sequence, it is important to be able to predict this probability of generation at the amino acid level. However, brute-force summation over all the nucleotide sequences with the correct amino acid translation is computationally intractable. The purpose of this paper is to present a solution to this problem. Results: We use dynamic programming to construct an efficient and flexible algorithm, called OLGA (Optimized Likelihood estimate of immunoGlobulin Amino-acid sequences), for calculating the probability of generating a given CDR3 amino acid sequence or motif, with or without V/J restriction, as a result of V(D)J recombination in B or T cells. We apply it to databases of epitope-specific T-cell receptors to evaluate the probability that a typical human subject will possess T cells responsive to specific disease-associated epitopes. The model prediction shows an excellent agreement with published data. We suggest that OLGA may be a useful tool to guide vaccine design. Availability: Source code is available at https://github.com/zsethna/OLG

    Population variability in the generation and thymic selection of T-cell repertoires

    Full text link
    The diversity of T-cell receptor (TCR) repertoires is achieved by a combination of two intrinsically stochastic steps: random receptor generation by VDJ recombination, and selection based on the recognition of random self-peptides presented on the major histocompatibility complex. These processes lead to a large receptor variability within and between individuals. However, the characterization of the variability is hampered by the limited size of the sampled repertoires. We introduce a new software tool SONIA to facilitate inference of individual-specific computational models for the generation and selection of the TCR beta chain (TRB) from sequenced repertoires of 651 individuals, separating and quantifying the variability of the two processes of generation and selection in the population. We find not only that most of the variability is driven by the VDJ generation process, but there is a large degree of consistency between individuals with the inter-individual variance of repertoires being about 2% of the intra-individual variance. Known viral-specific TCRs follow the same generation and selection statistics as all TCRs.Comment: 13 pages, 7 figure, 2 table

    On generative models of T-cell receptor sequences

    Full text link
    T-cell receptors (TCR) are key proteins of the adaptive immune system, generated randomly in each individual, whose diversity underlies our ability to recognize infections and malignancies. Modeling the distribution of TCR sequences is of key importance for immunology and medical applications. Here, we compare two inference methods trained on high-throughput sequencing data: a knowledge-guided approach, which accounts for the details of sequence generation, supplemented by a physics-inspired model of selection; and a knowledge-free Variational Auto-Encoder based on deep artificial neural networks. We show that the knowledge-guided model outperforms the deep network approach at predicting TCR probabilities, while being more interpretable, at a lower computational cost

    Computational strategies for dissecting the high-dimensional complexity of adaptive immune repertoires

    Full text link
    The adaptive immune system recognizes antigens via an immense array of antigen-binding antibodies and T-cell receptors, the immune repertoire. The interrogation of immune repertoires is of high relevance for understanding the adaptive immune response in disease and infection (e.g., autoimmunity, cancer, HIV). Adaptive immune receptor repertoire sequencing (AIRR-seq) has driven the quantitative and molecular-level profiling of immune repertoires thereby revealing the high-dimensional complexity of the immune receptor sequence landscape. Several methods for the computational and statistical analysis of large-scale AIRR-seq data have been developed to resolve immune repertoire complexity in order to understand the dynamics of adaptive immunity. Here, we review the current research on (i) diversity, (ii) clustering and network, (iii) phylogenetic and (iv) machine learning methods applied to dissect, quantify and compare the architecture, evolution, and specificity of immune repertoires. We summarize outstanding questions in computational immunology and propose future directions for systems immunology towards coupling AIRR-seq with the computational discovery of immunotherapeutics, vaccines, and immunodiagnostics.Comment: 27 pages, 2 figure

    Restoration of energy homeostasis by SIRT6 extends healthy lifespan

    Get PDF
    Aging leads to a gradual decline in physical activity and disrupted energy homeostasis. The NAD+-dependent SIRT6 deacylase regulates aging and metabolism through mechanisms that largely remain unknown. Here, we show that SIRT6 overexpression leads to a reduction in frailty and lifespan extension in both male and female B6 mice. A combination of physiological assays, in vivo multi-omics analyses and 13C lactate tracing identified an age-dependent decline in glucose homeostasis and hepatic glucose output in wild type mice. In contrast, aged SIRT6-transgenic mice preserve hepatic glucose output and glucose homeostasis through an improvement in the utilization of two major gluconeogenic precursors, lactate and glycerol. To mediate these changes, mechanistically, SIRT6 increases hepatic gluconeogenic gene expression, de novo NAD+ synthesis, and systemically enhances glycerol release from adipose tissue. These findings show that SIRT6 optimizes energy homeostasis in old age to delay frailty and preserve healthy aging
    • 

    corecore