1,597 research outputs found

    Bayesian correlated clustering to integrate multiple datasets

    Get PDF
    Motivation: The integration of multiple datasets remains a key challenge in systems biology and genomic medicine. Modern high-throughput technologies generate a broad array of different data types, providing distinct – but often complementary – information. We present a Bayesian method for the unsupervised integrative modelling of multiple datasets, which we refer to as MDI (Multiple Dataset Integration). MDI can integrate information from a wide range of different datasets and data types simultaneously (including the ability to model time series data explicitly using Gaussian processes). Each dataset is modelled using a Dirichlet-multinomial allocation (DMA) mixture model, with dependencies between these models captured via parameters that describe the agreement among the datasets. Results: Using a set of 6 artificially constructed time series datasets, we show that MDI is able to integrate a significant number of datasets simultaneously, and that it successfully captures the underlying structural similarity between the datasets. We also analyse a variety of real S. cerevisiae datasets. In the 2-dataset case, we show that MDI’s performance is comparable to the present state of the art. We then move beyond the capabilities of current approaches and integrate gene expression, ChIP-chip and protein-protein interaction data, to identify a set of protein complexes for which genes are co-regulated during the cell cycle. Comparisons to other unsupervised data integration techniques – as well as to non-integrative approaches – demonstrate that MDI is very competitive, while also providing information that would be difficult or impossible to extract using other methods

    Infinite factorization of multiple non-parametric views

    Get PDF
    Combined analysis of multiple data sources has increasing application interest, in particular for distinguishing shared and source-specific aspects. We extend this rationale of classical canonical correlation analysis into a flexible, generative and non-parametric clustering setting, by introducing a novel non-parametric hierarchical mixture model. The lower level of the model describes each source with a flexible non-parametric mixture, and the top level combines these to describe commonalities of the sources. The lower-level clusters arise from hierarchical Dirichlet Processes, inducing an infinite-dimensional contingency table between the views. The commonalities between the sources are modeled by an infinite block model of the contingency table, interpretable as non-negative factorization of infinite matrices, or as a prior for infinite contingency tables. With Gaussian mixture components plugged in for continuous measurements, the model is applied to two views of genes, mRNA expression and abundance of the produced proteins, to expose groups of genes that are co-regulated in either or both of the views. Cluster analysis of co-expression is a standard simple way of screening for co-regulation, and the two-view analysis extends the approach to distinguishing between pre- and post-translational regulation

    Single-cell analysis resolves the cell state transition and signaling dynamics associated with melanoma drug-induced resistance

    Get PDF
    Continuous BRAF inhibition of BRAF mutant melanomas triggers a series of cell state changes that lead to therapy resistance and escape from immune control before establishing acquired resistance genetically. We used genome-wide transcriptomics and single-cell phenotyping to explore the response kinetics to BRAF inhibition for a panel of patient-derived BRAF^(V600)-mutant melanoma cell lines. A subset of plastic cell lines, which followed a trajectory covering multiple known cell state transitions, provided models for more detailed biophysical investigations. Markov modeling revealed that the cell state transitions were reversible and mediated by both Lamarckian induction and nongenetic Darwinian selection of drug-tolerant states. Single-cell functional proteomics revealed activation of certain signaling networks shortly after BRAF inhibition, and before the appearance of drug-resistant phenotypes. Drug targeting those networks, in combination with BRAF inhibition, halted the adaptive transition and led to prolonged growth inhibition in multiple patient-derived cell lines

    Dissecting gene expression at the blood-brain barrier

    Get PDF
    The availability of genome-wide expression data for the blood-brain barrier is an invaluable resource that has recently enabled the discovery of several genes and pathways involved in the development and maintenance of the blood-brain barrier, particularly in rodent models. The broad distribution of published datasets represents a viable starting point for the molecular dissection of the blood-brain barrier and will further direct the discovery of novel mechanisms of blood-brain barrier formation and function. Technical advances in purifying brain endothelial cells, the key cell that forms the critical barrier, have allowed for greater specificity in gene expression comparisons with other central nervous system cell types, and more systematic characterizations of the molecular composition of the blood-brain barrier. Nevertheless, our understanding of how the blood-brain barrier changes during aging and disease is underrepresented. Blood-brain barrier datasets from a wider range of experimental paradigms and species, including invertebrates and primates, would be invaluable for investigating the function and evolution of the blood-brain barrier. Newer technologies in gene expression profiling, such as RNA-sequencing, now allow for finer resolution of transcriptomic changes, including isoform specificity and RNA-editing. As our field continues to utilize more advanced expression profiling in its ongoing efforts to elucidate the blood-brain barrier, including in disease and drug delivery, we will continue to see rapid advances in our understanding of the molecular mediators of barrier biology. We predict that the recently published datasets, combined with forthcoming genomic and proteomic blood-brain barrier datasets, will continue to fuel the molecular genetic revolution of blood-brain barrier biology

    Life during Dormancy: Genetic Regulation in Fission Yeast Spores and in Killifish Diapause

    Get PDF
    Dormant stages allow organisms to survive in life-threatening environments for extended periods of time. Dormancy is characterised by a reversible arrest of cell replication and increased stress resistance. In addition, dormancy involves reprogramming of gene expression and energy metabolism from a mode of proliferation and development to a mode of suspended growth, possibly including suspended ageing. Here I study the spores of fission yeast and diapause embryos of turquoise killifish to reveal similarities in transcriptome and proteome changes during dormancy. Moreover, I uncover some conservation in the genetic regulation of dormancy and halted ageing across different organisms and dormant stages, including yeast quiescence and worm dauer stages. In particular, I find ribosomal proteins and autophagy play critical roles in supporting dormancy in yeast and killifish. Supporting this result, functional analysis using Barcode sequencing of the genome-wide deletion library for fission yeast identifies ribosomal proteins and autophagy as important factors for survival during dormancy. Furthermore, while traditionally it has been assumed that spores in yeast and diapause in killifish equate to suspension biological activity, I find that both dormant states can respond to environmental or physiological triggers by altering their gene-expression programmes. Specifically, this response includes proteomic and transcriptomic changes to heat stress as well as changes with the chronological passing of time following their formation. While some of these genetic changes mimic non-dormant yeast stress responses, they differ from expression signatures observed during ageing. This finding is consistent with the idea that dormant yeast and killifish cells are not ageing in the same manner as non- dormant cells, or that ageing is even suspended during dormancy. Finally, as dormant stages are a state of suspended activity, events occurring during the dormant stage are not thought to affect the organism in post-dormant cells. Intriguingly, I find that the stress experienced during dormancy and the duration the organism stays in dormancy is ‘remembered’ and can affect post-dormancy recovery in both yeast and killifish. This phenomenon is evidenced by changes in gene expression profiles. I also observe subtle differences in stress resistance and chronological lifespan in germinates from stressed or old spores. This is exhibited by a type of hormesis where sub-lethal stress during dormancy might confer a slight lifespan extension in the post-dormant state. These new insights transform our understanding of “dormant states” and the implication of dormancy to post-dormancy stress survival and ageing

    Proteo-Transcriptomic Dynamics of Cellular Response to HIV-1 Infection.

    Get PDF
    Throughout the HIV-1 replication cycle, complex host-pathogen interactions take place in the infected cell, leading to the production of new virions. The virus modulates the host cellular machinery in order to support its life cycle, while counteracting intracellular defense mechanisms. We investigated the dynamic host response to HIV-1 infection by systematically measuring transcriptomic, proteomic, and phosphoproteomic expression changes in infected and uninfected SupT1 CD4+ T cells at five time points of the viral replication process. By means of a Gaussian mixed-effects model implemented in the new R/Bioconductor package TMixClust, we clustered host genes based on their temporal expression patterns. We identified a proteo-transcriptomic gene expression signature of 388 host genes specific for HIV-1 replication. Comprehensive functional analyses of these genes confirmed the previously described roles of some of the genes and revealed novel key virus-host interactions affecting multiple molecular processes within the host cell, including signal transduction, metabolism, cell cycle, and immune system. The results of our analysis are accessible through a freely available, dedicated and user-friendly R/Shiny application, called PEACHi2.0. This resource constitutes a catalogue of dynamic host responses to HIV-1 infection that provides a basis for a more comprehensive understanding of virus-host interactions

    Proteomic and transcriptomic changes in hibernating grizzly bears reveal metabolic and signaling pathways that protect against muscle atrophy

    Get PDF
    Muscle atrophy is a physiological response to disuse and malnutrition, but hibernating bears are largely resistant to this phenomenon. Unlike other mammals, they efficiently reabsorb amino acids from urine, periodically activate muscle contraction, and their adipocytes differentially responds to insulin. The contribution of myocytes to the reduced atrophy remains largely unknown. Here we show how metabolism and atrophy signaling are regulated in skeletal muscle of hibernating grizzly bear. Metabolic modeling of proteomic changes suggests an autonomous increase of non-essential amino acids (NEAA) in muscle and treatment of differentiated myoblasts with NEAA is sufficient to induce hypertrophy. Our comparison of gene expression in hibernation versus muscle atrophy identified several genes differentially regulated during hibernation, including Pdk4 and Serpinf1. Their trophic effects extend to myoblasts from non-hibernating species (including C. elegans), as documented by a knockdown approach. Together, these changes reflect evolutionary favored adaptations that, once translated to the clinics, could help improve atrophy treatment
    corecore