52 research outputs found

    Visualizing dimensionality reduction of systems biology data

    Full text link
    One of the challenges in analyzing high-dimensional expression data is the detection of important biological signals. A common approach is to apply a dimension reduction method, such as principal component analysis. Typically, after application of such a method the data is projected and visualized in the new coordinate system, using scatter plots or profile plots. These methods provide good results if the data have certain properties which become visible in the new coordinate system and which were hard to detect in the original coordinate system. Often however, the application of only one method does not suffice to capture all important signals. Therefore several methods addressing different aspects of the data need to be applied. We have developed a framework for linear and non-linear dimension reduction methods within our visual analytics pipeline SpRay. This includes measures that assist the interpretation of the factorization result. Different visualizations of these measures can be combined with functional annotations that support the interpretation of the results. We show an application to high-resolution time series microarray data in the antibiotic-producing organism Streptomyces coelicolor as well as to microarray data measuring expression of cells with normal karyotype and cells with trisomies of human chromosomes 13 and 21

    Mayday SeaSight: Combined Analysis of Deep Sequencing and Microarray Data

    Get PDF
    Recently emerged deep sequencing technologies offer new high-throughput methods to quantify gene expression, epigenetic modifications and DNA-protein binding. From a computational point of view, the data is very different from that produced by the already established microarray technology, providing a new perspective on the samples under study and complementing microarray gene expression data. Software offering the integrated analysis of data from different technologies is of growing importance as new data emerge in systems biology studies. Mayday is an extensible platform for visual data exploration and interactive analysis and provides many methods for dissecting complex transcriptome datasets. We present Mayday SeaSight, an extension that allows to integrate data from different platforms such as deep sequencing and microarrays. It offers methods for computing expression values from mapped reads and raw microarray data, background correction and normalization and linking microarray probes to genomic coordinates. It is now possible to use Mayday's wealth of methods to analyze sequencing data and to combine data from different technologies in one analysis

    An eQTL biological data visualization challenge and approaches from the visualization community

    Get PDF
    In 2011, the IEEE VisWeek conferences inaugurated a symposium on Biological Data Visualization. Like other domain-oriented Vis symposia, this symposium's purpose was to explore the unique characteristics and requirements of visualization within the domain, and to enhance both the Visualization and Bio/Life-Sciences communities by pushing Biological data sets and domain understanding into the Visualization community, and well-informed Visualization solutions back to the Biological community. Amongst several other activities, the BioVis symposium created a data analysis and visualization contest. Unlike many contests in other venues, where the purpose is primarily to allow entrants to demonstrate tour-de-force programming skills on sample problems with known solutions, the BioVis contest was intended to whet the participants' appetites for a tremendously challenging biological domain, and simultaneously produce viable tools for a biological grand challenge domain with no extant solutions. For this purpose expression Quantitative Trait Locus (eQTL) data analysis was selected. In the BioVis 2011 contest, we provided contestants with a synthetic eQTL data set containing real biological variation, as well as a spiked-in gene expression interaction network influenced by single nucleotide polymorphism (SNP) DNA variation and a hypothetical disease model. Contestants were asked to elucidate the pattern of SNPs and interactions that predicted an individual's disease state. 9 teams competed in the contest using a mixture of methods, some analytical and others through visual exploratory methods. Independent panels of visualization and biological experts judged entries. Awards were given for each panel's favorite entry, and an overall best entry agreed upon by both panels. Three special mention awards were given for particularly innovative and useful aspects of those entries. And further recognition was given to entries that correctly answered a bonus question about how a proposed "gene therapy" change to a SNP might change an individual's disease status, which served as a calibration for each approaches' applicability to a typical domain question. In the future, BioVis will continue the data analysis and visualization contest, maintaining the philosophy of providing new challenging questions in open-ended and dramatically underserved Bio/Life Sciences domains

    nocoRNAc: Characterization of non-coding RNAs in prokaryotes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The interest in non-coding RNAs (ncRNAs) constantly rose during the past few years because of the wide spectrum of biological processes in which they are involved. This led to the discovery of numerous ncRNA genes across many species. However, for most organisms the non-coding transcriptome still remains unexplored to a great extent. Various experimental techniques for the identification of ncRNA transcripts are available, but as these methods are costly and time-consuming, there is a need for computational methods that allow the detection of functional RNAs in complete genomes in order to suggest elements for further experiments. Several programs for the genome-wide prediction of functional RNAs have been developed but most of them predict a genomic locus with no indication whether the element is transcribed or not.</p> <p>Results</p> <p>We present <smcaps>NOCO</smcaps>RNAc, a program for the genome-wide prediction of ncRNA transcripts in bacteria. <smcaps>NOCO</smcaps>RNAc incorporates various procedures for the detection of transcriptional features which are then integrated with functional ncRNA loci to determine the transcript coordinates. We applied RNAz and <smcaps>NOCO</smcaps>RNAc to the genome of <it>Streptomyces coelicolor </it>and detected more than 800 putative ncRNA transcripts most of them located antisense to protein-coding regions. Using a custom design microarray we profiled the expression of about 400 of these elements and found more than 300 to be transcribed, 38 of them are predicted novel ncRNA genes in intergenic regions. The expression patterns of many ncRNAs are similarly complex as those of the protein-coding genes, in particular many antisense ncRNAs show a high expression correlation with their protein-coding partner.</p> <p>Conclusions</p> <p>We have developed <smcaps>NOCO</smcaps>RNAc, a framework that facilitates the automated characterization of functional ncRNAs. <smcaps>NOCO</smcaps>RNAc increases the confidence of predicted ncRNA loci, especially if they contain transcribed ncRNAs. <smcaps>NOCO</smcaps>RNAc is not restricted to intergenic regions, but it is applicable to the prediction of ncRNA transcripts in whole microbial genomes. The software as well as a user guide and example data is available at <url>http://www.zbit.uni-tuebingen.de/pas/nocornac.htm</url>.</p

    CAXII Is a Sero-Diagnostic Marker for Lung Cancer

    Get PDF
    To develop sero-diagnostic markers for lung cancer, we generated monoclonal antibodies using pulmonary adenocarcinoma (AD)-derived A549 cells as antigens by employing the random immunization method. Hybridoma supernatants were immunohistochemically screened for antibodies with AMeX-fixed and paraffin-embedded A549 cell preparations. Positive clones were monocloned twice through limiting dilutions. From the obtained monoclonal antibodies, we selected an antibody designated as KU-Lu-5 which showed intense membrane staining of A549 cells. Based on immunoprecipitation and MADLI TOF/TOF-MS analysis, this antibody was recognized as carbonic anhydrase XII (CAXII). To evaluate the utility of this antibody as a sero-diagnostic marker for lung cancer, we performed dot blot analysis with a training set consisting of sera from 70 lung cancer patients and 30 healthy controls. The CAXII expression levels were significantly higher in lung cancer patients than in healthy controls in the training set (P<0.0001), and the area under the curve of ROC was 0.794, with 70.0% specificity and 82.9% sensitivity. In lung cancers, expression levels of CAXII were significantly higher in patients with squamous cell carcinoma (SCC) than with AD (P = 0.035). Furthermore, CAXII was significantly higher in well- and moderately differentiated SCCs than in poorly differentiated ones (P = 0.027). To further confirm the utility of serum CAXII levels as a sero-diagnostic marker, an additional set consisting of sera from 26 lung cancer patients and 30 healthy controls was also investigated by dot blot analysis as a validation study. Serum CAXII levels were also significantly higher in lung cancer patients than in healthy controls in the validation set (P = 0.030). Thus, the serum CAXII levels should be applicable markers discriminating lung cancer patients from healthy controls. To our knowledge, this is the first report providing evidence that CAXII may be a novel sero-diagnostic marker for lung cancer

    Two Lysines in the Forkhead Domain of Foxp3 Are Key to T Regulatory Cell Function

    Get PDF
    Background: The forkhead box transcription factor, Foxp3, is master regulator of the development and function of CD4+CD25+ T regulatory (Treg) cells that limit autoimmunity and maintain immune homeostasis. The carboxyl-terminal forkhead (FKH) domain is required for the nuclear localization and DNA binding of Foxp3. We assessed how individual FKH lysines contribute to the functions of Foxp3 in Treg cells. Methodology/Principal Findings: We found that mutation of FKH lysines at position 382 (K17) and at position 393 (K18) impaired Foxp3 DNA binding and inhibited Treg suppressive function in vivo and in vitro. These lysine mutations did not affect the level of expression of Foxp3 but inhibited IL-2 promoter remodeling and had important and differing effects on Treg-associated gene expression. Conclusions/Significance: These data point to complex effects of post-translational modifications at individual lysines within the Foxp3 FKH domain that affect Treg function. Modulation of these events using small molecule inhibitors ma

    Immune monitoring and TCR sequencing of CD4 T cells in a long term responsive patient with metastasized pancreatic ductal carcinoma treated with individualized, neoepitope-derived multipeptide vaccines : a case report

    Get PDF
    Abstract Background Cancer vaccines can effectively establish clinically relevant tumor immunity. Novel sequencing approaches rapidly identify the mutational fingerprint of tumors, thus allowing to generate personalized tumor vaccines within a few weeks from diagnosis. Here, we report the case of a 62-year-old patient receiving a four-peptide-vaccine targeting the two sole mutations of his pancreatic tumor, identified via exome sequencing. Methods Vaccination started during chemotherapy in second complete remission and continued monthly thereafter. We tracked IFN-γ+ T cell responses against vaccine peptides in peripheral blood after 12, 17 and 34 vaccinations by analyzing T-cell receptor (TCR) repertoire diversity and epitope-binding regions of peptide-reactive T-cell lines and clones. By restricting analysis to sorted IFN-γ-producing T cells we could assure epitope-specificity, functionality, and TH1 polarization. Results A peptide-specific T-cell response against three of the four vaccine peptides could be detected sequentially. Molecular TCR analysis revealed a broad vaccine-reactive TCR repertoire with clones of discernible specificity. Four identical or convergent TCR sequences could be identified at more than one time-point, indicating timely persistence of vaccine-reactive T cells. One dominant TCR expressing a dual TCRVα chain could be found in three T-cell clones. The observed T-cell responses possibly contributed to clinical outcome: The patient is alive 6 years after initial diagnosis and in complete remission for 4 years now. Conclusions Therapeutic vaccination with a neoantigen-derived four-peptide vaccine resulted in a diverse and long-lasting immune response against these targets which was associated with prolonged clinical remission. These data warrant confirmation in a larger proof-of concept clinical trial

    Characterization of the cork oak transcriptome dynamics during acorn development

    Get PDF
    Background: Cork oak (Quercus suber L.) has a natural distribution across western Mediterranean regions and is a keystone forest tree species in these ecosystems. The fruiting phase is especially critical for its regeneration but the molecular mechanisms underlying the biochemical and physiological changes during cork oak acorn development are poorly understood. In this study, the transcriptome of the cork oak acorn, including the seed, was characterized in five stages of development, from early development to acorn maturation, to identify the dominant processes in each stage and reveal transcripts with important functions in gene expression regulation and response to water. Results: A total of 80,357 expressed sequence tags (ESTs) were de novo assembled from RNA-Seq libraries representative of the several acorn developmental stages. Approximately 7.6 % of the total number of transcripts present in Q. suber transcriptome was identified as acorn specific. The analysis of expression profiles during development returned 2,285 differentially expressed (DE) transcripts, which were clustered into six groups. The stage of development corresponding to the mature acorn exhibited an expression profile markedly different from other stages. Approximately 22 % of the DE transcripts putatively code for transcription factors (TF) or transcriptional regulators, and were found almost equally distributed among the several expression profile clusters, highlighting their major roles in controlling the whole developmental process. On the other hand, carbohydrate metabolism, the biological pathway most represented during acorn development, was especially prevalent in mid to late stages as evidenced by enrichment analysis. We further show that genes related to response to water, water deprivation and transport were mostly represented during the early (S2) and the last stage (S8) of acorn development, when tolerance to water desiccation is possibly critical for acorn viability. Conclusions: To our knowledge this work represents the first report of acorn development transcriptomics in oaks. The obtained results provide novel insights into the developmental biology of cork oak acorns, highlighting transcripts putatively involved in the regulation of the gene expression program and in specific processes likely essential for adaptation. It is expected that this knowledge can be transferred to other oak species of great ecological value.Fundação para a CiĂȘncia e a Tecnologi

    Exosome removal as a therapeutic adjuvant in cancer

    Get PDF
    • 

    corecore