6 research outputs found

    GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Since the inception of the GO annotation project, a variety of tools have been developed that support exploring and searching the GO database. In particular, a variety of tools that perform GO enrichment analysis are currently available. Most of these tools require as input a target set of genes and a background set and seek enrichment in the target set compared to the background set. A few tools also exist that support analyzing ranked lists. The latter typically rely on simulations or on union-bound correction for assigning statistical significance to the results.</p> <p>Results</p> <p><it>GOrilla </it>is a web-based application that identifies enriched GO terms in ranked lists of genes, without requiring the user to provide explicit target and background sets. This is particularly useful in many typical cases where genomic data may be naturally represented as a ranked list of genes (e.g. by level of expression or of differential expression). <it>GOrilla </it>employs a flexible threshold statistical approach to discover GO terms that are significantly enriched at the <it>top </it>of a ranked gene list. Building on a complete theoretical characterization of the underlying distribution, called mHG, <it>GOrilla </it>computes an exact p-value for the observed enrichment, taking threshold multiple testing into account without the need for simulations. This enables rigorous statistical analysis of thousand of genes and thousands of GO terms in order of seconds. The output of the enrichment analysis is visualized as a hierarchical structure, providing a clear view of the relations between enriched GO terms.</p> <p>Conclusion</p> <p><it>GOrilla </it>is an efficient GO analysis tool with unique features that make a useful addition to the existing repertoire of GO enrichment tools. <it>GOrilla</it>'s unique features and advantages over other threshold free enrichment tools include rigorous statistics, fast running time and an effective graphical representation. <it>GOrilla </it>is publicly available at: <url>http://cbl-gorilla.cs.technion.ac.il</url></p

    Observational multi-centre, prospective study to characterize novel pathogen-and host-related factors in hospitalized patients with lower respiratory tract infections and/or sepsis - the "TAILORED-Treatment" study

    Get PDF
    Background: The emergence and spread of antibiotic resistant micro-organisms is a global concern, which is largely attributable to inaccurate prescribing of antibiotics to patients presenting with non-bacterial infections. The use of 'omics' technologies for discovery of novel infection related biomarkers combined with novel treatment algorithms offers possibilities for rapidly distinguishing between bacterial and viral infections. This distinction can be particularly important for patients suffering from lower respiratory tract infections (LRTI) and/or sepsis as they represent a significant burden to healthcare systems. Here we present the study details of the TAILORED-Treatment study, an observational, prospective, multi-centre study aiming to generate a multi-parametric model, combining host and pathogen data, for distinguishing between bacterial and viral aetiologies in children and adults with LRTI and/or sepsis. Methods: A total number of 1200 paediatric and adult patients aged 1month and older with LRTI and/or sepsis or a non-infectious disease are recruited from Emergency Departments and hospital wards of seven Dutch and Israeli medical centres. A panel of three experienced physicians adjudicate a reference standard diagnosis for all patients (i.e., bacterial or viral infection) using all available clinical and laboratory information, including a 28-day follow-up assessment. Nasal swabs and blood samples are collected for multi-omics investigations including host RNA and protein biomarkers, nasal microbiota profiling, host genomic profiling and bacterial proteomics. Simplified data is entered into a custom-built database in order to develop a multi-parametric model and diagnostic tools fo

    Divergence in transcriptional and regulatory responses to mating in male and female fruitflies

    Get PDF
    Mating induces extensive physiological, biochemical and behavioural changes in female animals of many taxa. In contrast, the overall phenotypic and transcriptomic consequences of mating for males, hence how they might differ from those of females, are poorly described. Post mating responses in each sex are rapidly initiated, predicting the existence of regulatory mechanisms in addition to transcriptional responses involving de novo gene expression. That post mating responses appear different for each sex also predicts that the genome-wide signatures of mating should show evidence of sex-specific specialisation. In this study, we used high resolution RNA sequencing to provide the first direct comparisons of the transcriptomic responses of male and female Drosophila to mating, and the first comparison of mating-responsive miRNAs in both sexes in any species. As predicted, the results revealed the existence of sex- and body part-specific mRNA and miRNA expression profiles. More genes were differentially expressed in the female head-thorax than the abdomen following mating, whereas the opposite was true in males. Indeed, the transcriptional profile of male head-thorax tissue was largely unaffected by mating, and no differentially expressed genes were detected at the most stringent significance threshold. A subset of ribosomal genes in females were differentially expressed in both body parts, but in opposite directions, consistent with the existence of body part-specific resource allocation switching. Novel, mating-responsive miRNAs in each sex were also identified, and a miRNA-mRNA interactions analysis revealed putative targets among mating-responsive genes. We show that the structure of genome-wide responses by each sex to mating is strongly divergent, and provide new insights into how shared genomes can achieve characteristic distinctiveness

    A Feature-Based Approach to Modeling Protein–DNA Interactions

    Get PDF
    Transcription factor (TF) binding to its DNA target site is a fundamental regulatory interaction. The most common model used to represent TF binding specificities is a position specific scoring matrix (PSSM), which assumes independence between binding positions. However, in many cases, this simplifying assumption does not hold. Here, we present feature motif models (FMMs), a novel probabilistic method for modeling TF–DNA interactions, based on log-linear models. Our approach uses sequence features to represent TF binding specificities, where each feature may span multiple positions. We develop the mathematical formulation of our model and devise an algorithm for learning its structural features from binding site data. We also developed a discriminative motif finder, which discovers de novo FMMs that are enriched in target sets of sequences compared to background sets. We evaluate our approach on synthetic data and on the widely used TF chromatin immunoprecipitation (ChIP) dataset of Harbison et al. We then apply our algorithm to high-throughput TF ChIP data from mouse and human, reveal sequence features that are present in the binding specificities of mouse and human TFs, and show that FMMs explain TF binding significantly better than PSSMs. Our FMM learning and motif finder software are available at http://genie.weizmann.ac.il/
    corecore