41,275 research outputs found

    Transcription Factor Map Alignment of Promoter Regions

    Get PDF
    We address the problem of comparing and characterizing the promoter regions of genes with similar expression patterns. This remains a challenging problem in sequence analysis, because often the promoter regions of co-expressed genes do not show discernible sequence conservation. In our approach, thus, we have not directly compared the nucleotide sequence of promoters. Instead, we have obtained predictions of transcription factor binding sites, annotated the predicted sites with the labels of the corresponding binding factors, and aligned the resulting sequences of labels—to which we refer here as transcription factor maps (TF-maps). To obtain the global pairwise alignment of two TF-maps, we have adapted an algorithm initially developed to align restriction enzyme maps. We have optimized the parameters of the algorithm in a small, but well-curated, collection of human–mouse orthologous gene pairs. Results in this dataset, as well as in an independent much larger dataset from the CISRED database, indicate that TF-map alignments are able to uncover conserved regulatory elements, which cannot be detected by the typical sequence alignments

    Multiple non-collinear TF-map alignments of promoter regions

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The analysis of the promoter sequence of genes with similar expression patterns is a basic tool to annotate common regulatory elements. Multiple sequence alignments are on the basis of most comparative approaches. The characterization of regulatory regions from co-expressed genes at the sequence level, however, does not yield satisfactory results in many occasions as promoter regions of genes sharing similar expression programs often do not show nucleotide sequence conservation.</p> <p>Results</p> <p>In a recent approach to circumvent this limitation, we proposed to align the maps of predicted transcription factors (referred as TF-maps) instead of the nucleotide sequence of two related promoters, taking into account the label of the corresponding factor and the position in the primary sequence. We have now extended the basic algorithm to permit multiple promoter comparisons using the progressive alignment paradigm. In addition, non-collinear conservation blocks might now be identified in the resulting alignments. We have optimized the parameters of the algorithm in a small, but well-characterized collection of human-mouse-chicken-zebrafish orthologous gene promoters.</p> <p>Conclusion</p> <p>Results in this dataset indicate that TF-map alignments are able to detect high-level regulatory conservation at the promoter and the 3'UTR gene regions, which cannot be detected by the typical sequence alignments. Three particular examples are introduced here to illustrate the power of the multiple TF-map alignments to characterize conserved regulatory elements in absence of sequence similarity. We consider this kind of approach can be extremely useful in the future to annotate potential transcription factor binding sites on sets of co-regulated genes from high-throughput expression experiments.</p

    Expression of the plasma prekallikrein gene: utilization of multiple transcription start sites and alternative promoter regions

    Get PDF
    The plasma prekallikrein gene is expressed in many different human tissues at distinctly different levels and therefore tissue-specific control of the gene transcription is likely. In this study we demonstrate that transcription of the plasma prekallikrein gene can be initiated at multiple sites, for which at least four different promoters are utilized. A comparison of the genomic and mRNA sequences of mouse plasma prekallikrein revealed that the sequence segment that was formerly regarded as the first exon of the mouse plasma prekallikrein gene consists of three exons, with the first exon localized 14.2 kbp upstream of the translation start. For the rat and human plasma prekallikrein genes, in silico analysis suggested an analogous exon-intron organization. Determination of the transcription start sites showed that in both mouse and human, the proximal and distal regions could be utilized for transcription initiation; however, the proximal region is preferred. A deletion mutation analysis of the proximal promoter region using a 1.7-kbp segment revealed a strong activating region immediately upstream of the known mRNA, followed by both a modest repressor and an enhancer region

    Conserved noncoding sequences highlight shared components of regulatory networks in dicotyledonous plants

    Get PDF
    Conserved noncoding sequences (CNSs) in DNA are reliable pointers to regulatory elements controlling gene expression. Using a comparative genomics approach with four dicotyledonous plant species (Arabidopsis thaliana, papaya [Carica papaya], poplar [Populus trichocarpa], and grape [Vitis vinifera]), we detected hundreds of CNSs upstream of Arabidopsis genes. Distinct positioning, length, and enrichment for transcription factor binding sites suggest these CNSs play a functional role in transcriptional regulation. The enrichment of transcription factors within the set of genes associated with CNS is consistent with the hypothesis that together they form part of a conserved transcriptional network whose function is to regulate other transcription factors and control development. We identified a set of promoters where regulatory mechanisms are likely to be shared between the model organism Arabidopsis and other dicots, providing areas of focus for further research

    The trans-activation domain of the sporulation response regulator Spo0A revealed by X-ray crystallography

    Get PDF
    Sporulation in Bacillus involves the induction of scores of genes in a temporally and spatially co-ordinated programme of cell development. Its initiation is under the control of an expanded two-component signal transduction system termed a phosphorelay. The master control element in the decision to sporulate is the response regulator, Spo0A, which comprises a receiver or phosphoacceptor domain and an effector or transcription activation domain. The receiver domain of Spo0A shares sequence similarity with numerous response regulators, and its structure has been determined in phosphorylated and unphosphorylated forms. However, the effector domain (C-Spo0A) has no detectable sequence similarity to any other protein, and this lack of structural information is an obstacle to understanding how DNA binding and transcription activation are controlled by phosphorylation in Spo0A. Here, we report the crystal structure of C-Spo0A from Bacillus stearothermophilus revealing a single alpha -helical domain comprising six alpha -helices in an unprecedented fold. The structure contains a helix-turn-helix as part of a three alpha -helical bundle reminiscent of the catabolite gene activator protein (CAP), suggesting a mechanism for DNA binding. The residues implicated in forming the sigma (A)-activating region clearly cluster in a flexible segment of the polypeptide on the opposite side of the structure from that predicted to interact with DNA. The structural results are discussed in the context of the rich array of existing mutational data

    Quantitative test of the barrier nucleosome model for statistical positioning of nucleosomes up- and downstream of transcription start sites

    Get PDF
    The positions of nucleosomes in eukaryotic genomes determine which parts of the DNA sequence are readily accessible for regulatory proteins and which are not. Genome-wide maps of nucleosome positions have revealed a salient pattern around transcription start sites, involving a nucleosome-free region (NFR) flanked by a pronounced periodic pattern in the average nucleosome density. While the periodic pattern clearly reflects well-positioned nucleosomes, the positioning mechanism is less clear. A recent experimental study by Mavrich et al. argued that the pattern observed in S. cerevisiae is qualitatively consistent with a `barrier nucleosome model', in which the oscillatory pattern is created by the statistical positioning mechanism of Kornberg and Stryer. On the other hand, there is clear evidence for intrinsic sequence preferences of nucleosomes, and it is unclear to what extent these sequence preferences affect the observed pattern. To test the barrier nucleosome model, we quantitatively analyze yeast nucleosome positioning data both up- and downstream from NFRs. Our analysis is based on the Tonks model of statistical physics which quantifies the interplay between the excluded-volume interaction of nucleosomes and their positional entropy. We find that although the typical patterns on the two sides of the NFR are different, they are both quantitatively described by the same physical model, with the same parameters, but different boundary conditions. The inferred boundary conditions suggest that the first nucleosome downstream from the NFR (the +1 nucleosome) is typically directly positioned while the first nucleosome upstream is statistically positioned via a nucleosome-repelling DNA region. These boundary conditions, which can be locally encoded into the genome sequence, significantly shape the statistical distribution of nucleosomes over a range of up to ~1000 bp to each side.Comment: includes supporting materia

    12-h clock regulation of genetic information flow by XBP1s

    Get PDF
    © The Author(s), 2020. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in Pan, Y., Ballance, H., Meng, H., Gonzalez, N., Kim, S., Abdurehman, L., York, B., Chen, X., Schnytzer, Y., Levy, O., Dacso, C. C., McClung, C. A., O'Malley, B. W., Liu, S., & Zhu, B. 12-h clock regulation of genetic information flow by XBP1s. Plos Biology, 18(1), (2020): e3000580, doi:10.1371/journal.pbio.3000580.Our group recently characterized a cell-autonomous mammalian 12-h clock independent from the circadian clock, but its function and mechanism of regulation remain poorly understood. Here, we show that in mouse liver, transcriptional regulation significantly contributes to the establishment of 12-h rhythms of mRNA expression in a manner dependent on Spliced Form of X-box Binding Protein 1 (XBP1s). Mechanistically, the motif stringency of XBP1s promoter binding sites dictates XBP1s’s ability to drive 12-h rhythms of nascent mRNA transcription at dawn and dusk, which are enriched for basal transcription regulation, mRNA processing and export, ribosome biogenesis, translation initiation, and protein processing/sorting in the Endoplasmic Reticulum (ER)-Golgi in a temporal order consistent with the progressive molecular processing sequence described by the central dogma information flow (CEDIF). We further identified GA-binding proteins (GABPs) as putative novel transcriptional regulators driving 12-h rhythms of gene expression with more diverse phases. These 12-h rhythms of gene expression are cell autonomous and evolutionarily conserved in marine animals possessing a circatidal clock. Our results demonstrate an evolutionarily conserved, intricate network of transcriptional control of the mammalian 12-h clock that mediates diverse biological pathways. We speculate that the 12-h clock is coopted to accommodate elevated gene expression and processing in mammals at the two rush hours, with the particular genes processed at each rush hour regulated by the circadian and/or tissue-specific pathways.This study was supported by the American Diabetes Association junior faculty development award 1-18-JDF-025 to B.Z., by funding from National Institute of Health HD07879 and 1P01DK113954 to B.W.O, by funding from National Science Foundation award 1703170 to C.C.D. and B.Z., and by funding from Brockman Foundation to C.C.D and B.W.O. This work was further supported by the UPMC Genome Center with funding from UPMC’s Immunotherapy and Transplant Center. This research was supported in part by the University of Pittsburgh Center for Research Computing through the resources provided. Research reported in this publication was further supported by the National Institute of Diabetes And Digestive And Kidney Diseases of the National Institutes of Health under award number P30DK120531 to Pittsburgh Liver Research Center, in which both S.L. and B.Z. are members. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript

    Comparative analyses of CTCF and BORIS occupancies uncover two distinct classes of CTCF binding genomic regions.

    Get PDF
    BackgroundCTCF and BORIS (CTCFL), two paralogous mammalian proteins sharing nearly identical DNA binding domains, are thought to function in a mutually exclusive manner in DNA binding and transcriptional regulation.ResultsHere we show that these two proteins co-occupy a specific subset of regulatory elements consisting of clustered CTCF binding motifs (termed 2xCTSes). BORIS occupancy at 2xCTSes is largely invariant in BORIS-positive cancer cells, with the genomic pattern recapitulating the germline-specific BORIS binding to chromatin. In contrast to the single-motif CTCF target sites (1xCTSes), the 2xCTS elements are preferentially found at active promoters and enhancers, both in cancer and germ cells. 2xCTSes are also enriched in genomic regions that escape histone to protamine replacement in human and mouse sperm. Depletion of the BORIS gene leads to altered transcription of a large number of genes and the differentiation of K562 cells, while the ectopic expression of this CTCF paralog leads to specific changes in transcription in MCF7 cells.ConclusionsWe discover two functionally and structurally different classes of CTCF binding regions, 2xCTSes and 1xCTSes, revealed by their predisposition to bind BORIS. We propose that 2xCTSes play key roles in the transcriptional program of cancer and germ cells
    corecore