37 research outputs found

    CSMET: Comparative Genomic Motif Detection via Multi-Resolution Phylogenetic Shadowing

    Get PDF
    Functional turnover of transcription factor binding sites (TFBSs), such as whole-motif loss or gain, are common events during genome evolution. Conventional probabilistic phylogenetic shadowing methods model the evolution of genomes only at nucleotide level, and lack the ability to capture the evolutionary dynamics of functional turnover of aligned sequence entities. As a result, comparative genomic search of non-conserved motifs across evolutionarily related taxa remains a difficult challenge, especially in higher eukaryotes, where the cis-regulatory regions containing motifs can be long and divergent; existing methods rely heavily on specialized pattern-driven heuristic search or sampling algorithms, which can be difficult to generalize and hard to interpret based on phylogenetic principles. We propose a new method: Conditional Shadowing via Multi-resolution Evolutionary Trees, or CSMET, which uses a context-dependent probabilistic graphical model that allows aligned sites from different taxa in a multiple alignment to be modeled by either a background or an appropriate motif phylogeny conditioning on the functional specifications of each taxon. The functional specifications themselves are the output of a phylogeny which models the evolution not of individual nucleotides, but of the overall functionality (e.g., functional retention or loss) of the aligned sequence segments over lineages. Combining this method with a hidden Markov model that autocorrelates evolutionary rates on successive sites in the genome, CSMET offers a principled way to take into consideration lineage-specific evolution of TFBSs during motif detection, and a readily computable analytical form of the posterior distribution of motifs under TFBS turnover. On both simulated and real Drosophila cis-regulatory modules, CSMET outperforms other state-of-the-art comparative genomic motif finders

    Contribution of Transcription Factor Binding Site Motif Variants to Condition-Specific Gene Expression Patterns in Budding Yeast

    Get PDF
    It is now experimentally well known that variant sequences of a cis transcription factor binding site motif can contribute to differential regulation of genes. We characterize the relationship between motif variants and gene expression by analyzing expression microarray data and binding site predictions. To accomplish this, we statistically detect motif variants with effects that differ among environments. Such environmental specificity may be due to either affinity differences between variants or, more likely, differential interactions of TFs bound to these variants with cofactors, and with differential presence of cofactors across environments. We examine conservation of functional variants across four Saccharomyces species, and find that about a third of transcription factors have target genes that are differentially expressed in a condition-specific manner that is correlated with the nucleotide at variant motif positions. We find good correspondence between our results and some cases in the experimental literature (Reb1, Sum1, Mcm1, and Rap1). These results and growing consensus in the literature indicates that motif variants may often be functionally distinct, that this may be observed in genomic data, and that variants play an important role in condition-specific gene regulation

    Distinct Functional Constraints Partition Sequence Conservation in a cis-Regulatory Element

    Get PDF
    Different functional constraints contribute to different evolutionary rates across genomes. To understand why some sequences evolve faster than others in a single cis-regulatory locus, we investigated function and evolutionary dynamics of the promoter of the Caenorhabditis elegans unc-47 gene. We found that this promoter consists of two distinct domains. The proximal promoter is conserved and is largely sufficient to direct appropriate spatial expression. The distal promoter displays little if any conservation between several closely related nematodes. Despite this divergence, sequences from all species confer robustness of expression, arguing that this function does not require substantial sequence conservation. We showed that even unrelated sequences have the ability to promote robust expression. A prominent feature shared by all of these robustness-promoting sequences is an AT-enriched nucleotide composition consistent with nucleosome depletion. Because general sequence composition can be maintained despite sequence turnover, our results explain how different functional constraints can lead to vastly disparate rates of sequence divergence within a promoter

    A Naturally Occurring Polymorphism at Drosophila melanogaster Lim3 Locus, a Homolog of Human LHX3/4, Affects Lim3 Transcription and Fly Lifespan

    Get PDF
    Lim3 encodes an RNA polymerase II transcription factor with a key role in neuron specification. It was also identified as a candidate gene that affects lifespan. These pleiotropic effects indicate the fundamental significance of the potential interplay between neural development and lifespan control. The goal of this study was to analyze the causal relationships between Lim3 structural variations, and gene expression and lifespan changes, and to provide insights into regulatory pathways controlling lifespan. Fifty substitution lines containing second chromosomes from a Drosophila natural population were used to analyze the association between lifespan and sequence variation in the 5′-regulatory region, and first exon and intron of Lim3A, in which we discovered multiple transcription start sites (TSS). The core and proximal promoter organization for Lim3A and a previously unknown mRNA named Lim3C were described. A haplotype of two markers in the Lim3A regulatory region was significantly associated with variation in lifespan. We propose that polymorphisms in the regulatory region affect gene transcription, and consequently lifespan. Indeed, five polymorphic markers located within 380 to 680 bp of the Lim3A major TSS, including two markers associated with lifespan variation, were significantly associated with the level of Lim3A transcript, as evaluated by real time RT-PCR in embryos, adult heads, and testes. A naturally occurring polymorphism caused a six-fold change in gene transcription and a 25% change in lifespan. Markers associated with long lifespan and intermediate Lim3A transcription were present in the population at high frequencies. We hypothesize that polymorphic markers associated with Lim3A expression are located within the binding sites for proteins that regulate gene function, and provide general rather than tissue-specific regulation of transcription, and that intermediate levels of Lim3A expression confer a selective advantage and longer lifespan

    Mathematics and biology: a Kantian view on the history of pattern formation theory

    Get PDF
    Driesch’s statement, made around 1900, that the physics and chemistry of his day were unable to explain self-regulation during embryogenesis was correct and could be extended until the year 1972. The emergence of theories of self-organisation required progress in several areas including chemistry, physics, computing and cybernetics. Two parallel lines of development can be distinguished which both culminated in the early 1970s. Firstly, physicochemical theories of self-organisation arose from theoretical (Lotka 1910–1920) and experimental work (Bray 1920; Belousov 1951) on chemical oscillations. However, this research area gained broader acceptance only after thermodynamics was extended to systems far from equilibrium (1922–1967) and the mechanism of the prime example for a chemical oscillator, the Belousov–Zhabotinski reaction, was deciphered in the early 1970s. Secondly, biological theories of self-organisation were rooted in the intellectual environment of artificial intelligence and cybernetics. Turing wrote his The chemical basis of morphogenesis (1952) after working on the construction of one of the first electronic computers. Likewise, Gierer and Meinhardt’s theory of local activation and lateral inhibition (1972) was influenced by ideas from cybernetics. The Gierer–Meinhardt theory provided an explanation for the first time of both spontaneous formation of spatial order and of self-regulation that proved to be extremely successful in elucidating a wide range of patterning processes. With the advent of developmental genetics in the 1980s, detailed molecular and functional data became available for complex developmental processes, allowing a new generation of data-driven theoretical approaches. Three examples of such approaches will be discussed. The successes and limitations of mathematical pattern formation theory throughout its history suggest a picture of the organism, which has structural similarity to views of the organic world held by the philosopher Immanuel Kant at the end of the eighteenth century

    Phylogeny Disambiguates the Evolution of Heat-Shock cis-Regulatory Elements in Drosophila

    Get PDF
    Heat-shock genes have a well-studied control mechanism for their expression that is mediated through cis-regulatory motifs known as heat-shock elements (HSEs). The evolution of important features of this control mechanism has not been investigated in detail, however. Here we exploit the genome sequencing of multiple Drosophila species, combined with a wealth of available information on the structure and function of HSEs in D. melanogaster, to undertake this investigation. We find that in single-copy heat shock genes, entire HSEs have evolved or disappeared 14 times, and the phylogenetic approach bounds the timing and direction of these evolutionary events in relation to speciation. In contrast, in the multi-copy gene Hsp70, the number of HSEs is nearly constant across species. HSEs evolve in size, position, and sequence within heat-shock promoters. In turn, functional significance of certain features is implicated by preservation despite this evolutionary change; these features include tail-to-tail arrangements of HSEs, gapped HSEs, and the presence or absence of entire HSEs. The variation among Drosophila species indicates that the cis-regulatory encoding of responsiveness to heat and other stresses is diverse. The broad dimensions of variation uncovered are particularly important as they suggest a substantial challenge for functional studies

    Estrogen- and Progesterone (P4)-Mediated Epigenetic Modifications of Endometrial Stromal Cells (EnSCs) and/or Mesenchymal Stem/Stromal Cells (MSCs) in the Etiopathogenesis of Endometriosis

    Get PDF
    Endometriosis is a common chronic inflammatory condition in which endometrial tissue appears outside the uterine cavity. Because ectopic endometriosis cells express both estrogen and progesterone (P4) receptors, they grow and undergo cyclic proliferation and breakdown similar to the endometrium. This debilitating gynecological disease affects up to 15% of reproductive aged women. Despite many years of research, the etiopathogenesis of endometrial lesions remains unclear. Retrograde transport of the viable menstrual endometrial cells with retained ability for attachment within the pelvic cavity, proliferation, differentiation and subsequent invasion into the surrounding tissue constitutes the rationale for widely accepted implantation theory. Accordingly, the most abundant cells in the endometrium are endometrial stromal cells (EnSCs). These cells constitute a particular population with clonogenic activity that resembles properties of mesenchymal stem/stromal cells (MSCs). Thus, a significant role of stem cell-based dysfunction in formation of the initial endometrial lesions is suspected. There is increasing evidence that the role of epigenetic mechanisms and processes in endometriosis have been underestimated. The importance of excess estrogen exposure and P4 resistance in epigenetic homeostasis failure in the endometrial/endometriotic tissue are crucial. Epigenetic alterations regarding transcription factors of estrogen and P4 signaling pathways in MSCs are robust in endometriotic tissue. Thus, perspectives for the future may include MSCs and EnSCs as the targets of epigenetic therapies in the prevention and treatment of endometriosis. Here, we reviewed the current known changes in the epigenetic background of EnSCs and MSCs due to estrogen/P4 imbalances in the context of etiopathogenesis of endometriosis
    corecore