10 research outputs found

    Assessing the functional structure of genomic data

    Get PDF
    Motivation: The availability of genome-scale data has enabled an abundance of novel analysis techniques for investigating a variety of systems-level biological relationships. As thousands of such datasets become available, they provide an opportunity to study high-level associations between cellular pathways and processes. This also allows the exploration of shared functional enrichments between diverse biological datasets, and it serves to direct experimenters to areas of low data coverage or with high probability of new discoveries

    Systematic Planning of Genome-Scale Experiments in Poorly Studied Species

    Get PDF
    Genome-scale datasets have been used extensively in model organisms to screen for specific candidates or to predict functions for uncharacterized genes. However, despite the availability of extensive knowledge in model organisms, the planning of genome-scale experiments in poorly studied species is still based on the intuition of experts or heuristic trials. We propose that computational and systematic approaches can be applied to drive the experiment planning process in poorly studied species based on available data and knowledge in closely related model organisms. In this paper, we suggest a computational strategy for recommending genome-scale experiments based on their capability to interrogate diverse biological processes to enable protein function assignment. To this end, we use the data-rich functional genomics compendium of the model organism to quantify the accuracy of each dataset in predicting each specific biological process and the overlap in such coverage between different datasets. Our approach uses an optimized combination of these quantifications to recommend an ordered list of experiments for accurately annotating most proteins in the poorly studied related organisms to most biological processes, as well as a set of experiments that target each specific biological process. The effectiveness of this experiment- planning system is demonstrated for two related yeast species: the model organism Saccharomyces cerevisiae and the comparatively poorly studied Saccharomyces bayanus. Our system recommended a set of S. bayanus experiments based on an S. cerevisiae microarray data compendium. In silico evaluations estimate that less than 10% of the experiments could achieve similar functional coverage to the whole microarray compendium. This estimation was confirmed by performing the recommended experiments in S. bayanus, therefore significantly reducing the labor devoted to characterize the poorly studied genome. This experiment-planning framework could readily be adapted to the design of other types of large-scale experiments as well as other groups of organisms

    Minimal components of the RNA polymerase II transcription apparatus determine the consensus TATA box

    Get PDF
    In Saccharomyces cerevisiae, multiple approaches have arrived at a consensus TATA box sequence of TATA(T/A)A(A/T)(A/G). TATA-binding protein (TBP) affinity alone does not determine TATA box function. To discover how a minimal set of factors required for basal and activated transcription contributed to the sequence requirements for a functional TATA box, we performed transcription reactions using highly purified proteins and CYC1 promoter TATA box mutants. The TATA box consensus sequence is a good predictor of promoter activity. However, several nonconsensus sequences are almost fully functional, indicating that mechanistic requirements are not the only selective pressure on the TATA box. We also found that the effect of a mutation at a certain position is often dependent on other bases within a particular TATA box. Although activators and coactivators strongly influence TBP recruitment and stability at promoters, neither Mediator, the activator Gal4-V16, nor TFIID specifically compensate for the low transcription levels of the weak TATA boxes. The addition of Mediator to purified transcription reactions did, however, increase the functional selectivity for certain consensus TATA sequences. Transcription in whole-cell extracts or in vivo with these TATA box mutants indicated that factors, other than those in our purified system, may help initiate transcription from weak TATA boxes

    Inferring Condition-Specific Modulation of Transcription Factor Activity in Yeast through Regulon-Based Analysis of Genomewide Expression

    Get PDF
    Background: A key goal of systems biology is to understand how genomewide mRNA expression levels are controlled by transcription factors (TFs) in a condition-specific fashion. TF activity is frequently modulated at the post-translational level through ligand binding, covalent modification, or changes in sub-cellular localization. In this paper, we demonstrate how prior information about regulatory network connectivity can be exploited to infer condition-specific TF activity as a hidden variable from the genomewide mRNA expression pattern in the yeast Saccharomyces cerevisiae. Methodology/Principal Findings: We first validate experimentally that by scoring differential expression at the level of gene sets or "regulons" comprised of the putative targets of a TF, we can accurately predict modulation of TF activity at the post-translational level. Next, we create an interactive database of inferred activities for a large number of TFs across a large number of experimental conditions in S. cerevisiae. This allows us to perform TF-centric analysis of the yeast regulatory network. Conclusions/Significance: We analyze the degree to which the mRNA expression level of each TF is predictive of its regulatory activity. We also organize TFs into "co-modulation networks" based on their inferred activity profile across conditions, and find that this reveals functional and mechanistic relationships. Finally, we present evidence that the PAC and rRPE motifs antagonize TBP-dependent regulation, and function as core promoter elements governed by the transcription regulator NC2. Regulon-based monitoring of TF activity modulation is a powerful tool for analyzing regulatory network function that should be applicable in other organisms. Tools and results are available online at http://bussemakerlab.org/RegulonProfiler/

    Genome-Wide Transcriptional Dependence on Conserved Regions of Mot1 ▿

    No full text
    TATA binding protein (TBP) plays a central role in transcription complex assembly and is regulated by a variety of transcription factors, including Mot1. Mot1 is an essential protein in Saccharomyces cerevisiae that exerts both negative and positive effects on transcription via interactions with TBP. It contains two conserved regions important for TBP interactions, another conserved region that hydrolyzes ATP to remove TBP from DNA, and a fourth conserved region with unknown function. Whether these regions contribute equally to transcriptional regulation genome-wide is unknown. Here, we employ a transient-replacement assay using deletion derivatives in the conserved regions of Mot1 to investigate their contributions to gene regulation throughout the S. cerevisiae genome. These four regions of Mot1 are essential for growth and are generally required for all Mot1-regulated genes. Loss of the ATPase region, but not other conserved regions, caused TBP to redistribute away from a subset of Mot1-inhibited genes, leading to decreased expression of those genes. A corresponding increase in TBP occupancy and expression occurred at another set of genes that are normally Mot1 independent. The data suggest that Mot1 uses ATP hydrolysis to redistribute accessible TBP away from intrinsically preferred sites to other sites of intrinsically low preference

    Anti-Aging Medicine LiteratureWatch

    No full text
    corecore