114 research outputs found

    ContDist: a tool for the analysis of quantitative gene and promoter properties

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The understanding of how promoter regions regulate gene expression is complicated and far from being fully understood. It is known that histones' regulation of DNA compactness, DNA methylation, transcription factor binding sites and CpG islands play a role in the transcriptional regulation of a gene. Many high-throughput techniques exist nowadays which permit the detection of epigenetic marks and regulatory elements in the promoter regions of thousands of genes. However, so far the subsequent analysis of such experiments (e.g. the resulting gene lists) have been hampered by the fact that currently no tool exists for a detailed analysis of the promoter regions.</p> <p>Results</p> <p>We present ContDist, a tool to statistically analyze quantitative gene and promoter properties. The software includes approximately 200 quantitative features of gene and promoter regions for 7 commonly studied species. In contrast to "traditionally" ontological analysis which only works on qualitative data, all the features in the underlying annotation database are quantitative gene and promoter properties.</p> <p>Utilizing the strong focus on the promoter region of this tool, we show its usefulness in two case studies; the first on differentially methylated promoters and the second on the fundamental differences between housekeeping and tissue specific genes. The two case studies allow both the confirmation of recent findings as well as revealing previously unreported biological relations.</p> <p>Conclusion</p> <p>ContDist is a new tool with two important properties: 1) it has a strong focus on the promoter region which is usually disregarded by virtually all ontology tools and 2) it uses quantitative (continuously distributed) features of the genes and its promoter regions which are not available in any other tool. ContDist is available from <url>http://web.bioinformatics.cicbiogune.es/CD/ContDistribution.php</url></p

    Tight associations between transcription promoter type and epigenetic variation in histone positioning and modification

    Get PDF
    Abstract Background Transcription promoters are fundamental genomic cis-elements controlling gene expression. They can be classified into two types by the degree of imprecision of their transcription start sites: peak promoters, which initiate transcription from a narrow genomic region; and broad promoters, which initiate transcription from a wide-ranging region. Eukaryotic transcription initiation is suggested to be associated with the genomic positions and modifications of nucleosomes. For instance, it has been recently shown that histone with H3K9 acetylation (H3K9ac) is more likely to be distributed around broad promoters rather than peak promoters; it can thus be inferred that there is an association between histone H3K9 and promoter architecture. Results Here, we performed a systematic analysis of transcription promoters and gene expression, as well as of epigenetic histone behaviors, including genomic position, stability within the chromatin, and several modifications. We found that, in humans, broad promoters, but not peak promoters, generally had significant associations with nucleosome positioning and modification. Specifically, around broad promoters histones were highly distributed and aligned in an orderly fashion. This feature was more evident with histones that were methylated or acetylated; moreover, the nucleosome positions around the broad promoters were more stable than those around the peak ones. More strikingly, the overall expression levels of genes associated with broad promoters (but not peak promoters) with modified histones were significantly higher than the levels of genes associated with broad promoters with unmodified histones. Conclusion These results shed light on how epigenetic regulatory networks of histone modifications are associated with promoter architecture

    β€œCandidatus Competibacter”-lineage genomes retrieved from metagenomes reveal functional metabolic diversity

    Get PDF
    The glycogen-accumulating organism (GAO) β€˜Candidatus Competibacter’ (Competibacter) uses aerobically stored glycogen to enable anaerobic carbon uptake, which is subsequently stored as polyhydroxyalkanoates (PHAs). This biphasic metabolism is key for the Competibacter to survive under the cyclic anaerobic-β€˜feast’: aerobic-β€˜famine’ regime of enhanced biological phosphorus removal (EBPR) wastewater treatment systems. As they do not contribute to phosphorus (P) removal, but compete for resources with the polyphosphate-accumulating organisms (PAO), thought responsible for P removal, their proliferation theoretically reduces the EBPR capacity. In this study, two complete genomes from Competibacter were obtained from laboratory-scale enrichment reactors through metagenomics. Phylogenetic analysis identified the two genomes, β€˜Candidatus Competibacter denitrificans’ and β€˜Candidatus Contendobacter odensis’, as being affiliated with Competibacter-lineage subgroups 1 and 5, respectively. Both have genes for glycogen and PHA cycling and for the metabolism of volatile fatty acids. Marked differences were found in their potential for the Embden–Meyerhof–Parnas and Entner–Doudoroff glycolytic pathways, as well as for denitrification, nitrogen fixation, fermentation, trehalose synthesis and utilisation of glucose and lactate. Genetic comparison of P metabolism pathways with sequenced PAOs revealed the absence of the Pit phosphate transporter in the Competibacter-lineage genomesβ€”identifying a key metabolic difference with the PAO physiology. These genomes are the first from any GAO organism and provide new insights into the complex interaction and niche competition between PAOs and GAOs in EBPR systems

    Chromatin Organization in Sperm May Be the Major Functional Consequence of Base Composition Variation in the Human Genome

    Get PDF
    Chromatin in sperm is different from that in other cells, with most of the genome packaged by protamines not nucleosomes. Nucleosomes are, however, retained at some genomic sites, where they have the potential to transmit paternal epigenetic information. It is not understood how this retention is specified. Here we show that base composition is the major determinant of nucleosome retention in human sperm, predicting retention very well in both genic and non-genic regions of the genome. The retention of nucleosomes at GC-rich sequences with high intrinsic nucleosome affinity accounts for the previously reported retention at transcription start sites and at genes that regulate development. It also means that nucleosomes are retained at the start sites of most housekeeping genes. We also report a striking link between the retention of nucleosomes in sperm and the establishment of DNA methylation-free regions in the early embryo. Taken together, this suggests that paternal nucleosome transmission may facilitate robust gene regulation in the early embryo. We propose that chromatin organization in the male germline, rather than in somatic cells, is the major functional consequence of fine-scale base composition variation in the human genome. The selective pressure driving base composition evolution in mammals could, therefore, be the need to transmit paternal epigenetic information to the zygote

    BayesPI - a new model to study protein-DNA interactions: a case study of condition-specific protein binding parameters for Yeast transcription factors

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>We have incorporated Bayesian model regularization with biophysical modeling of protein-DNA interactions, and of genome-wide nucleosome positioning to study protein-DNA interactions, using a high-throughput dataset. The newly developed method (BayesPI) includes the estimation of a transcription factor (TF) binding energy matrices, the computation of binding affinity of a TF target site and the corresponding chemical potential.</p> <p>Results</p> <p>The method was successfully tested on synthetic ChIP-chip datasets, real yeast ChIP-chip experiments. Subsequently, it was used to estimate condition-specific and species-specific protein-DNA interaction for several yeast TFs.</p> <p>Conclusion</p> <p>The results revealed that the modification of the protein binding parameters and the variation of the individual nucleotide affinity in either recognition or flanking sequences occurred under different stresses and in different species. The findings suggest that such modifications may be adaptive and play roles in the formation of the environment-specific binding patterns of yeast TFs and in the divergence of TF binding sites across the related yeast species.</p

    Mechanics of the IL2RA Gene Activation Revealed by Modeling and Atomic Force Microscopy

    Get PDF
    Transcription implies recruitment of RNA polymerase II and transcription factors (TFs) by DNA melting near transcription start site (TSS). Combining atomic force microscopy and computer modeling, we investigate the structural and dynamical properties of the IL2RA promoter and identify an intrinsically negative supercoil in the PRRII region (containing Elf-1 and HMGA1 binding sites), located upstream of a curved DNA region encompassing TSS. Conformational changes, evidenced by time-lapse studies, result in the progressive positioning of curvature apex towards the TSS, likely facilitating local DNA melting. In vitro assays confirm specific binding of the General Transcription Factors (GTFs) TBP and TFIIB over TATA-TSS position, where an inhibitory nucleosome prevented preinitiation complex (PIC) formation and uncontrolled DNA melting. These findings represent a substantial advance showing, first, that the structural properties of the IL2RA promoter are encoded in the DNA sequence and second, that during the initiation process DNA conformation is dynamic and not static

    Genetic Diversity among Enterococcus faecalis

    Get PDF
    Enterococcus faecalis, a ubiquitous member of mammalian gastrointestinal flora, is a leading cause of nosocomial infections and a growing public health concern. The enterococci responsible for these infections are often resistant to multiple antibiotics and have become notorious for their ability to acquire and disseminate antibiotic resistances. In the current study, we examined genetic relationships among 106 strains of E. faecalis isolated over the past 100 years, including strains identified for their diversity and used historically for serotyping, strains that have been adapted for laboratory use, and isolates from previously described E. faecalis infection outbreaks. This collection also includes isolates first characterized as having novel plasmids, virulence traits, antibiotic resistances, and pathogenicity island (PAI) components. We evaluated variation in factors contributing to pathogenicity, including toxin production, antibiotic resistance, polymorphism in the capsule (cps) operon, pathogenicity island (PAI) gene content, and other accessory factors. This information was correlated with multi-locus sequence typing (MLST) data, which was used to define genetic lineages. Our findings show that virulence and antibiotic resistance traits can be found within many diverse lineages of E. faecalis. However, lineages have emerged that have caused infection outbreaks globally, in which several new antibiotic resistances have entered the species, and in which virulence traits have converged. Comparing genomic hybridization profiles, using a microarray, of strains identified by MLST as spanning the diversity of the species, allowed us to identify the core E. faecalis genome as consisting of an estimated 2057 unique genes

    Varieties of living things: Life at the intersection of lineage and metabolism

    Get PDF
    publication-status: Publishedtypes: Articl
    • …
    corecore