17 research outputs found

    Transcriptional features of genomic regulatory blocks

    Get PDF
    CAGE tag mapping of transcription start sites across different human tissues shows that genomic regulatory blocks have unique features that are the likely cause of their ability to respond to regulatory inputs from very long distances

    JASPAR, the open access database of transcription factor-binding profiles: new content and tools in the 2008 update

    Get PDF
    JASPAR is a popular open-access database for matrix models describing DNA-binding preferences for transcription factors and other DNA patterns. With its third major release, JASPAR has been expanded and equipped with additional functions aimed at both casual and power users. The heart of the JASPAR database—the JASPAR CORE sub-database—has increased by 12% in size, and three new specialized sub-databases have been added. New functions include clustering of matrix models by similarity, generation of random matrices by sampling from selected sets of existing models and a language-independent Web Service applications programming interface for matrix retrieval. JASPAR is available at http://jaspar.genereg.net

    Genome-wide Map of Quantified Epigenetic Changes during In vitro Chondrogenic Differentiation of Primary Human Mesenchymal Stem Cells

    Get PDF
    Background: For safe clinical application of engineered cartilage made from mesenchymal stem cells (MSCs), molecular mechanisms for chondrogenic differentiation must be known in detail. Changes in gene expression and extracellular matrix synthesis have been extensively studied, but the epigenomic modifications underlying these changes have not been described. To this end we performed whole-genome chromatin immunoprecipitation and deep sequencing to quantify six histone modifications, reduced representation bisulphite sequencing to quantify DNA methylation and mRNA microarrays to quantify gene expression before and after 7 days of chondrogenic differentiation of MSCs in an alginate scaffold. To add to the clinical relevance of our observations, the study is based on primary bone marrow-derived MSCs from four donors, allowing us to investigate inter-individual variations. Results: We see two levels of relationship between epigenetic marking and gene expression. First, a large number of genes ontogenetically linked to MSC properties and the musculoskeletal system are epigenetically prepatterned by moderate changes in H3K4me3 and H3K9ac near transcription start sites. Most of these genes remain transcriptionally unaltered. Second, transcriptionally upregulated genes, more closely associated with chondrogenesis, are marked by H3K36me3 in gene bodies, highly increased H3K4me3 and H3K9ac on promoters and 5' end of genes, and increased H3K27ac and H3K4me1 marking in at least one enhancer region per upregulated gene. Within the 7-day time frame, changes in promoter DNA methylation do not correlate significantly with changes in gene expression. Inter-donor variability analysis shows high level of similarity between the donors for this data set. Conclusions: Histone modifications, rather than DNA methylation, provide the primary epigenetic control of early differentiation of MSCs towards the chondrogenic lineage.Stem Cell and Regenerative Biolog

    The male germ cell gene regulator CTCFL is functionally different from CTCF and binds CTCF-like consensus sites in a nucleosome composition-dependent manner.

    Get PDF
    RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are.BACKGROUND: CTCF is a highly conserved and essential zinc finger protein expressed in virtually all cell types. In conjunction with cohesin, it organizes chromatin into loops, thereby regulating gene expression and epigenetic events. The function of CTCFL or BORIS, the testis-specific paralog of CTCF, is less clear. RESULTS: Using immunohistochemistry on testis sections and fluorescence-based microscopy on intact live seminiferous tubules, we show that CTCFL is only transiently present during spermatogenesis, prior to the onset of meiosis, when the protein co-localizes in nuclei with ubiquitously expressed CTCF. CTCFL distribution overlaps completely with that of Stra8, a retinoic acid-inducible protein essential for the propagation of meiosis. We find that absence of CTCFL in mice causes sub-fertility because of a partially penetrant testicular atrophy. CTCFL deficiency affects the expression of a number of testis-specific genes, including Gal3st1 and Prss50. Combined, these data indicate that CTCFL has a unique role in spermatogenesis. Genome-wide RNA expression studies in ES cells expressing a V5- and GFP-tagged form of CTCFL show that genes that are downregulated in CTCFL-deficient testis are upregulated in ES cells. These data indicate that CTCFL is a male germ cell gene regulator. Furthermore, genome-wide DNA-binding analysis shows that CTCFL binds a consensus sequence that is very similar to that of CTCF. However, only ~3,700 out of the ~5,700 CTCFL- and ~31,000 CTCF-binding sites overlap. CTCFL binds promoters with loosely assembled nucleosomes, whereas CTCF favors consensus sites surrounded by phased nucleosomes. Finally, an ES cell-based rescue assay shows that CTCFL is functionally different from CTCF. CONCLUSIONS: Our data suggest that nucleosome composition specifies the genome-wide binding of CTCFL and CTCF. We propose that the transient expression of CTCFL in spermatogonia and preleptotene spermatocytes serves to occupy a subset of promoters and maintain the expression of male germ cell genes

    The DBCLS BioHackathon: standardization and interoperability for bioinformatics web services and workflows. The DBCLS BioHackathon Consortium*

    Get PDF
    Web services have become a key technology for bioinformatics, since life science databases are globally decentralized and the exponential increase in the amount of available data demands for efficient systems without the need to transfer entire databases for every step of an analysis. However, various incompatibilities among database resources and analysis services make it difficult to connect and integrate these into interoperable workflows. To resolve this situation, we invited domain specialists from web service providers, client software developers, Open Bio* projects, the BioMoby project and researchers of emerging areas where a standard exchange data format is not well established, for an intensive collaboration entitled the BioHackathon 2008. The meeting was hosted by the Database Center for Life Science (DBCLS) and Computational Biology Research Center (CBRC) and was held in Tokyo from February 11th to 15th, 2008. In this report we highlight the work accomplished and the common issues arisen from this event, including the standardization of data exchange formats and services in the emerging fields of glycoinformatics, biological interaction networks, text mining, and phyloinformatics. In addition, common shared object development based on BioSQL, as well as technical challenges in large data management, asynchronous services, and security are discussed. Consequently, we improved interoperability of web services in several fields, however, further cooperation among major database centers and continued collaborative efforts between service providers and software developers are still necessary for an effective advance in bioinformatics web service technologies

    ELM: the status of the 2010 eukaryotic linear motif resource

    Get PDF
    Linear motifs are short segments of multidomain proteins that provide regulatory functions independently of protein tertiary structure. Much of intracellular signalling passes through protein modifications at linear motifs. Many thousands of linear motif instances, most notably phosphorylation sites, have now been reported. Although clearly very abundant, linear motifs are difficult to predict de novo in protein sequences due to the difficulty of obtaining robust statistical assessments. The ELM resource at http://elm.eu.org/ provides an expanding knowledge base, currently covering 146 known motifs, with annotation that includes >1300 experimentally reported instances. ELM is also an exploratory tool for suggesting new candidates of known linear motifs in proteins of interest. Information about protein domains, protein structure and native disorder, cellular and taxonomic contexts is used to reduce or deprecate false positive matches. Results are graphically displayed in a ‘Bar Code’ format, which also displays known instances from homologous proteins through a novel ‘Instance Mapper’ protocol based on PHI-BLAST. ELM server output provides links to the ELM annotation as well as to a number of remote resources. Using the links, researchers can explore the motifs, proteins, complex structures and associated literature to evaluate whether candidate motifs might be worth experimental investigation

    The genome-wide dynamics of the binding of Ldb1 complexes during erythroid differentiation

    Get PDF
    One of the complexes formed by the hematopoietic transcription factor Gata1 is a complex with the Ldb1 (LIM domain-binding protein 1) and Tal1 proteins. It is known to be important for the development and differentiation of the erythroid cell lineage and is thought to be implicated in long-range interactions. Here, the dynamics of the composition of the complex—in particular, the binding of the negative regulators Eto2 and Mtgr1—are studied, in the context of their genome-wide targets. This shows that the complex acts almost exclusively as an activator, binding a very specific combination of sequences, with a positioning relative to transcription start site, depending on the type of the core promoter. The activation is accompanied by a net decrease in the relative binding of Eto2 and Mtgr1. A Chromosome Conformation Capture sequencing (3C-seq) assay also shows that the binding of the Ldb1 complex marks genomic interaction sites in vivo. This establishes the Ldb1 complex as a positive regulator of the final steps of erythroid differentiation that acts through the shedding of negative regulators and the active interaction between regulatory sequences

    Genome-wide, whole mount in situ analysis of transcriptional regulators in zebrafish embryos

    Get PDF
    Transcription is the primary step in the retrieval of genetic information. A substantial proportion of the protein repertoire of each organism consists of transcriptional regulators (TRs). It is believed that the differential expression and combinatorial action of these TRs is essential for vertebrate development and body homeostasis. We mined the zebrafish genome exhaustively for genes encoding TRs and determined their expression in the zebrafish embryo by sequencing to saturation and in situ hybridisation. At the evolutionary conserved phylotypic stage, 75% of the 3302TR genes encoded in the genome are already expressed. The number of expressed TR genes increases only marginally in subsequent stages and is maintained during adulthood suggesting important roles of the TR genes in body homeostasis. Fewer than half of the TR genes (45%, n=1711 genes) are expressed in a tissue-restricted manner in the embryo. Transcripts of 207 genes were detected in a single tissue in the 24hour embryo, potentially acting as regulators of specific processes. Other TR genes were expressed in multiple tissues. However, with the exception of certain territories in the nervous system, we did not find significant synexpression suggesting that most tissue-restricted TRs act in a freely combinatorial fashion. Our data indicate that elaboration of body pattern and function from the phylotypic stage onward relies mostly on redeployment of TRs and post-transcriptional processes

    Unscrambling the genomic chaos of osteosarcoma reveals extensive transcript fusion, recurrent rearrangements and frequent novel TP53 aberrations

    No full text
    In contrast to many other sarcoma subtypes, the chaotic karyotypes of osteosarcoma have precluded the identification of pathognomonic translocations. We here report hundreds of genomic rearrangements in osteosarcoma cell lines, showing clear characteristics of microhomology-mediated break-induced replication (MMBIR) and end-joining repair (MMEJ) mechanisms. However, at RNA level, the majority of the fused transcripts did not correspond to genomic rearrangements, suggesting the involvement of trans-splicing, which was further supported by typical trans-splicing characteristics. By combining genomic and transcriptomic analysis, certain recurrent rearrangements were identified and further validated in patient biopsies, including a PMP22-ELOVL5 gene fusion, genomic structural variations affecting RB1, MTAP/CDKN2A and MDM2, and, most frequently, rearrangements involving TP53. Most cell lines (7/11) and a large fraction of tumor samples (10/25) showed TP53 rearrangements, in addition to somatic point mutations (6 patient samples, 1 cell line) and MDM2 amplifications (2 patient samples, 2 cell lines). The resulting inactivation of p53 was demonstrated by a deficiency of the radiation-induced DNA damage response. Thus, TP53 rearrangements are the major mechanism of p53 inactivation in osteosarcoma. Together with active MMBIR and MMEJ, this inactivation probably contributes to the exceptional chromosomal instability in these tumors. Although rampant rearrangements appear to be a phenotype of osteosarcomas, we demonstrate that among the huge number of probable passenger rearrangements, specific recurrent, possibly oncogenic, events are present. For the first time the genomic chaos of osteosarcoma is characterized so thoroughly and delivered new insights in mechanisms involved in osteosarcoma development and may contribute to new diagnostic and therapeutic strategies
    corecore