11,592 research outputs found

    Scaling laws in bacterial genomes: A side-effect of selection of mutational robustness?

    Get PDF
    In the past few years, numerous research projects have focused on identifying and understanding scaling properties in the gene content of prokaryote genomes and the intricacy of their regulation networks. Yet, and despite the increasing amount of data available, the origins of these scalings remain an open question. The RAevol model, a digital genetics model, provides us with an insight into the mechanisms involved in an evolutionary process. The results we present here show that (i) our model reproduces qualitatively these scaling laws and that (ii) these laws are not due to differences in lifestyles but to differences in the spontaneous rates of mutations and rearrangements. We argue that this is due to an indirect selective pressure for robustness that constrains the genome size

    Biases in the Experimental Annotations of Protein Function and their Effect on Our Understanding of Protein Function Space

    Get PDF
    The ongoing functional annotation of proteins relies upon the work of curators to capture experimental findings from scientific literature and apply them to protein sequence and structure data. However, with the increasing use of high-throughput experimental assays, a small number of experimental studies dominate the functional protein annotations collected in databases. Here we investigate just how prevalent is the "few articles -- many proteins" phenomenon. We examine the experimentally validated annotation of proteins provided by several groups in the GO Consortium, and show that the distribution of proteins per published study is exponential, with 0.14% of articles providing the source of annotations for 25% of the proteins in the UniProt-GOA compilation. Since each of the dominant articles describes the use of an assay that can find only one function or a small group of functions, this leads to substantial biases in what we know about the function of many proteins. Mass-spectrometry, microscopy and RNAi experiments dominate high throughput experiments. Consequently, the functional information derived from these experiments is mostly of the subcellular location of proteins, and of the participation of proteins in embryonic developmental pathways. For some organisms, the information provided by different studies overlap by a large amount. We also show that the information provided by high throughput experiments is less specific than those provided by low throughput experiments. Given the experimental techniques available, certain biases in protein function annotation due to high-throughput experiments are unavoidable. Knowing that these biases exist and understanding their characteristics and extent is important for database curators, developers of function annotation programs, and anyone who uses protein function annotation data to plan experiments.Comment: Accepted to PLoS Computational Biology. Press embargo applies. v4: text corrected for style and supplementary material inserte

    Trisomy 21 alters DNA methylation in parent-of-origin-dependent and independent manners

    Get PDF
    The supernumerary chromosome 21 in Down syndrome differentially affects the methylation statuses at CpG dinucleotide sites and creates genome-wide transcriptional dysregulation of parental alleles, ultimately causing diverse pathologies. At present, it is unknown whether those effects are dependent or independent of the parental origin of the nondis-joined chromosome 21. Linkage analysis is a standard method for the determination of the parental origin of this aneuploidy, although it is inadequate in cases with deficiency of samples from the progenitors. Here, we assessed the reliability of the epigenetic 5(m)CpG imprints resulting in the maternally (oocyte)-derived allele methylation at a differentially methylated region (DMR) of the candidate imprinted WRB gene for asserting the parental origin of chromosome 21. We developed a methylation-sensitive restriction enzyme-specific PCR assay, based on the WRB DMR, across single nucleotide polymorphisms (SNPs) to examine the methylation statuses in the parental alleles. In genomic DNA from blood cells of either disomic or trisomic subjects, the maternal alleles were consistently methylated, while the paternal alleles were unmethylated. However, the supernumerary chromosome 21 did alter the methylation patterns at the RUNX1 (chromosome 21) and TMEM131 (chromosome 2) CpG sites in a parent-of-origin-independent manner. To evaluate the 5(m)CpG imprints, we conducted a computational comparative epigenomic analysis of transcriptome RNA sequencing (RNA-Seq) and histone modification expression patterns. We found allele fractions consistent with the transcriptional biallelic expression of WRB and ten neighboring genes, despite the similarities in the confluence of both a 17-histone modification activation backbone module and a 5-histone modification repressive module between the WRB DMR and the DMRs of six imprinted genes. We concluded that the maternally inherited 5(m)CpG imprints at the WRB DMR are uncoupled from the parental allele expression of WRB and ten neighboring genes in several tissues and that trisomy 21 alters DNA methylation in parent-of-origin-dependent and -independent manners

    Transcriptional Regulation: a Genomic Overview

    Get PDF
    The availability of the Arabidopsis thaliana genome sequence allows a comprehensive analysis of transcriptional regulation in plants using novel genomic approaches and methodologies. Such a genomic view of transcription first necessitates the compilation of lists of elements. Transcription factors are the most numerous of the different types of proteins involved in transcription in eukaryotes, and the Arabidopsis genome codes for more than 1,500 of them, or approximately 6% of its total number of genes. A genome-wide comparison of transcription factors across the three eukaryotic kingdoms reveals the evolutionary generation of diversity in the components of the regulatory machinery of transcription. However, as illustrated by Arabidopsis, transcription in plants follows similar basic principles and logic to those in animals and fungi. A global view and understanding of transcription at a cellular and organismal level requires the characterization of the Arabidopsis transcriptome and promoterome, as well as of the interactome, the localizome, and the phenome of the proteins involved in transcription

    Domain Model Explains Propagation Dynamics and Stability of Histone H3K27 and H3K36 Methylation Landscapes

    Get PDF
    Chromatin states must be maintained during cell proliferation to uphold cellular identity and genome integrity. Inheritance of histone modifications is central in this process. However, the histone modification landscape is challenged by incorporation of new unmodified histones during each cell cycle, and the principles governing heritability remain unclear. We take a quantitative computational modeling approach to describe propagation of histone H3K27 and H3K36 methylation states. We measure combinatorial H3K27 and H3K36 methylation patterns by quantitative mass spectrometry on subsequent generations of histones. Using model comparison, we reject active global demethylation and invoke the existence of domains defined by distinct methylation endpoints. We find that H3K27me3 on pre-existing histones stimulates the rate of de novo H3K27me3 establishment, supporting a read-write mechanism in timely chromatin restoration. Finally, we provide a detailed quantitative picture of the mutual antagonism between H3K27 and H3K36 methylation and propose that it stabilizes epigenetic states across cell division
    corecore