59 research outputs found

    Of gene expression and cell division time: a mathematical framework for advanced differential gene expression and data analysis

    Get PDF
    Estimating fold changes of average mRNA and protein molecule counts per cell is the most common way to perform differential expression analysis. However, these gene expression data may be affected by cell division, an often-neglected phenomenon. Here, we develop a quantitative framework that links population-based mRNA and protein measurements to rates of gene expression in single cells undergoing cell division. The equations we derive are easy-to-use and widely robust against biological variability. They integrate multiple "omics" data into a coherent, quantitative description of single-cell gene expression and improve analysis when comparing systems or states with different cell division times. We explore these ideas in the context of resting versus activated B cells. Analyzing differences in protein synthesis rates enables to account for differences in cell division times. We demonstrate that this improves the resolution and hit rate of differential gene expression analysis when compared to analyzing population protein abundances alone

    Large-scale literature mining to assess the relation between anti-cancer drugs and cancer types

    Get PDF
    Background:There is a huge body of scientific literature describing the relation between tumor types and anti-cancer drugs. The vast amount of scientific literature makes it impossible for researchers and physicians to extract all relevant information manually.Methods:In order to cope with the large amount of literature we applied an automated text mining approach to assess the relations between 30 most frequent cancer types and 270 anti-cancer drugs. We applied two different approaches, a classical text mining based on named entity recognition and an AI-based approach employing word embeddings. The consistency of literature mining results was validated with 3 independent methods: first, using data from FDA approvals, second, using experimentally measured IC-50 cell line data and third, using clinical patient survival data.Results:We demonstrated that the automated text mining was able to successfully assess the relation between cancer types and anti-cancer drugs. All validation methods showed a good correspondence between the results from literature mining and independent confirmatory approaches. The relation between most frequent cancer types and drugs employed for their treatment were visualized in a large heatmap. All results are accessible in an interactive web-based knowledge base using the following link: https://knowledgebase.microdiscovery.de/heatmap.Conclusions:Our approach is able to assess the relations between compounds and cancer types in an automated manner. Both, cancer types and compounds could be grouped into different clusters. Researchers can use the inter-active knowledge base to inspect the presented results and follow their own research questions, for example the identification of novel indication areas for known drugs

    Differential binding studies applying functional protein microarrays and surface plasmon resonance

    Get PDF
    A variety of different in vivo and in vitro technologies provide comprehensive insights in protein-protein interaction networks. Here we demonstrate a novel approach to analyze, verify and quantify putative interactions between two members of the S100 protein family and 80 recombinant proteins derived from a proteome-wide protein expression library. Surface plasmon resonance (SPR) using Biacore technology and functional protein microarrays were used as two independent methods to study protein-protein interactions. With this combined approach we were able to detect nine calcium-dependent interactions between Arg-Gly-Ser-(RGS)-His6 tagged proteins derived from the library and GST-tagged S100B and S100A6, respectively. For the protein microarray affinity-purified proteins from the expression library were spotted onto modified glass slides and probed with the S100 proteins. SPR experiments were performed in the same setup and in a vice-versa approach reversing analytes and ligands to determine distinct association and dissociation patterns of each positive interaction. Besides already known interaction partners, several novel binders were found independently with both detection methods, albeit analogous immobilization strategies had to be applied in both assays

    High-throughput and single-cell imaging of NF-kappaB oscillations using monoclonal cell lines

    Get PDF
    BACKGROUND: The nuclear factor-κB (NF-κB) family of transcription factors plays a role in a wide range of cellular processes including the immune response and cellular growth. In addition, deregulation of the NF-κB system has been associated with a number of disease states, including cancer. Therefore, insight into the regulation of NF-κB activation has crucial medical relevance, holding promise for novel drug target discovery. Transcription of NF-κB-induced genes is regulated by differential dynamics of single NF-κB subunits, but only a few methods are currently being applied to study dynamics. In particular, while oscillations of NF-κB activation have been observed in response to the cytokine tumor necrosis factor α (TNFα), little is known about the occurrence of oscillations in response to bacterial infections. RESULTS: To quantitatively assess NF-κB dynamics we generated human and murine monoclonal cell lines that stably express the NF-κB subunit p65 fused to GFP. Furthermore, a high-throughput assay based on automated microscopy coupled to image analysis to quantify p65-nuclear translocation was established. Using this assay, we demonstrate a stimulus- and cell line-specific temporal control of p65 translocation, revealing, for the first time, oscillations of p65 translocation in response to bacterial infection. Oscillations were detected at the single-cell level using real-time microscopy as well as at the population level using high-throughput image analysis. In addition, mathematical modeling of NF-κB dynamics during bacterial infections predicted masking of oscillations on the population level in asynchronous activations, which was experimentally confirmed. CONCLUSIONS: Taken together, this simple and cost effective assay constitutes an integrated approach to infer the dynamics of NF-κB kinetics in single cells and cell populations. Using a single system, novel factors modulating NF-κB can be identified and analyzed, providing new possibilities for a wide range of applications from therapeutic discovery and understanding of disease to host-pathogen interactions

    Protein structure search and local structure characterization

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Structural similarities among proteins can provide valuable insight into their functional mechanisms and relationships. As the number of available three-dimensional (3D) protein structures increases, a greater variety of studies can be conducted with increasing efficiency, among which is the design of protein structural alphabets. Structural alphabets allow us to characterize local structures of proteins and describe the global folding structure of a protein using a one-dimensional (1D) sequence. Thus, 1D sequences can be used to identify structural similarities among proteins using standard sequence alignment tools such as BLAST or FASTA.</p> <p>Results</p> <p>We used self-organizing maps in combination with a minimum spanning tree algorithm to determine the optimum size of a structural alphabet and applied the k-means algorithm to group protein fragnts into clusters. The centroids of these clusters defined the structural alphabet. We also developed a flexible matrix training system to build a substitution matrix (TRISUM-169) for our alphabet. Based on FASTA and using TRISUM-169 as the substitution matrix, we developed the SA-FAST alignment tool. We compared the performance of SA-FAST with that of various search tools in database-scale search tasks and found that SA-FAST was highly competitive in all tests conducted. Further, we evaluated the performance of our structural alphabet in recognizing specific structural domains of EGF and EGF-like proteins. Our method successfully recovered more EGF sub-domains using our structural alphabet than when using other structural alphabets. SA-FAST can be found at <url>http://140.113.166.178/safast/</url>.</p> <p>Conclusion</p> <p>The goal of this project was two-fold. First, we wanted to introduce a modular design pipeline to those who have been working with structural alphabets. Secondly, we wanted to open the door to researchers who have done substantial work in biological sequences but have yet to enter the field of protein structure research. Our experiments showed that by transforming the structural representations from 3D to 1D, several 1D-based tools can be applied to structural analysis, including similarity searches and structural motif finding.</p

    Global quantification of mammalian gene expression control

    Get PDF
    Gene expression is a multistep process that involves the transcription, translation and turnover of messenger RNAs and proteins. Although it is one of the most fundamental processes of life, the entire cascade has never been quantified on a genome-wide scale. Here we simultaneously measured absolute mRNA and protein abundance and turnover by parallel metabolic pulse labelling for more than 5,000 genes in mammalian cells. Whereas mRNA and protein levels correlated better than previously thought, corresponding half-lives showed no correlation. Using a quantitative model we have obtained the first genome-scale prediction of synthesis rates of mRNAs and proteins. We find that the cellular abundance of proteins is predominantly controlled at the level of translation. Genes with similar combinations of mRNA and protein stability shared functional properties, indicating that half-lives evolved under energetic and dynamic constraints. Quantitative information about all stages of gene expression provides a rich resource and helps to provide a greater understanding of the underlying design principles

    A “Crossomics” Study Analysing Variability of Different Components in Peripheral Blood of Healthy Caucasoid Individuals

    Get PDF
    Background: Different immunotherapy approaches for the treatment of cancer and autoimmune diseases are being developed and tested in clinical studies worldwide. Their resulting complex experimental data should be properly evaluated, therefore reliable normal healthy control baseline values are indispensable. Methodology/Principal Findings: To assess intra- and inter-individual variability of various biomarkers, peripheral blood of 16 age and gender equilibrated healthy volunteers was sampled on 3 different days within a period of one month. Complex "crossomics'' analyses of plasma metabolite profiles, antibody concentrations and lymphocyte subset counts as well as whole genome expression profiling in CD4(+)T and NK cells were performed. Some of the observed age, gender and BMI dependences are in agreement with the existing knowledge, like negative correlation between sex hormone levels and age or BMI related increase in lipids and soluble sugars. Thus we can assume that the distribution of all 39.743 analysed markers is well representing the normal Caucasoid population. All lymphocyte subsets, 20% of metabolites and less than 10% of genes, were identified as highly variable in our dataset. Conclusions/Significance: Our study shows that the intra- individual variability was at least two-fold lower compared to the inter-individual one at all investigated levels, showing the importance of personalised medicine approach from yet another perspective
    corecore