169 research outputs found

    A GO catalogue of human DNA-binding transcription factors

    Get PDF
    To control gene transcription, DNA-binding transcription factors recognise specific sequence motifs in gene regulatory regions. A complete and reliable GO annotation of all DNA-binding transcription factors is key to investigating the delicate balance of gene regulation in response to environmental and developmental stimuli. The need for such information is demonstrated by the many lists of transcription factors that have been produced over the past decade. The COST Action Gene Regulation Ensemble Effort for the Knowledge Commons (GREEKC) Consortium brought together experts in the field of transcription with the aim of providing high quality and interoperable gene regulatory data. The Gene Ontology (GO) Consortium provides strict definitions for gene product function, including factors that regulate transcription. The collaboration between the GREEKC and GO Consortia has enabled the application of those definitions to produce a new curated catalogue of over 1400 human DNA-binding transcription factors, that can be accessed at https://www.ebi.ac.uk/QuickGO/targetset/dbTF. This catalogue has facilitated an improvement in the GO annotation of human DNA-binding transcription factors and led to the GO annotation of almost sixty thousand DNA-binding transcription factors in over a hundred species. Thus, this work will aid researchers investigating the regulation of transcription in both biomedical and basic science

    A GO catalogue of human DNA-binding transcription factors

    Get PDF
    DNA-binding transcription factors recognise genomic addresses, specific sequence motifs in gene regulatory regions, to control gene transcription. A complete and reliable catalogue of all DNA-binding transcription factors is key to investigating the delicate balance of gene regulation in response to environmental and developmental stimuli. The need for such a catalogue of proteins is demonstrated by the many lists of DNA-binding transcription factors that have been produced over the past decade. The COST Action Gene Regulation Ensemble Effort for the Knowledge Commons (GREEKC) Consortium brought together experts in the field of transcription with the aim of providing high quality and interoperable gene regulatory data. The Gene Ontology (GO) Consortium provides strict definitions for gene product function, including factors that regulate transcription. The collaboration between the GREEKC and GO Consortia has enabled the application of those definitions to produce a new curated catalogue of human DNA-binding transcription factors, that can be accessed at https://www.ebi.ac.uk/QuickGO/targetset/dbTF. In addition, this curation effort has led to the GO annotation of almost sixty thousand DNA-binding transcription factors in over a hundred species. Thus, this work will aid researchers investigating the regulation of transcription in both biomedical and basic science

    A Reporter Screen in a Human Haploid Cell Line Identifies CYLD as a Constitutive Inhibitor of NF-ÎșB

    Get PDF
    The development of forward genetic screens in human haploid cells has the potential to transform our understanding of the genetic basis of cellular processes unique to man. So far, this approach has been limited mostly to the identification of genes that mediate cell death in response to a lethal agent, likely due to the ease with which this phenotype can be observed. Here, we perform the first reporter screen in the near-haploid KBM7 cell line to identify constitutive inhibitors of NF-ÎșB. CYLD was the only currently known negative regulator of NF-ÎșB to be identified, thus uniquely distinguishing this gene. Also identified were three genes with no previous known connection to NF-ÎșB. Our results demonstrate that reporter screens in haploid human cells can be applied to investigate the many complex signaling pathways that converge upon transcription factors

    FOXM1 binds directly to non-consensus sequences in the human genome.

    Get PDF
    BACKGROUND: The Forkhead (FKH) transcription factor FOXM1 is a key regulator of the cell cycle and is overexpressed in most types of cancer. FOXM1, similar to other FKH factors, binds to a canonical FKH motif in vitro. However, genome-wide mapping studies in different cell lines have shown a lack of enrichment of the FKH motif, suggesting an alternative mode of chromatin recruitment. We have investigated the role of direct versus indirect DNA binding in FOXM1 recruitment by performing ChIP-seq with wild-type and DNA binding deficient FOXM1. RESULTS: An in vitro fluorescence polarization assay identified point mutations in the DNA binding domain of FOXM1 that inhibit binding to a FKH consensus sequence. Cell lines expressing either wild-type or DNA binding deficient GFP-tagged FOXM1 were used for genome-wide mapping studies comparing the distribution of the DNA binding deficient protein to the wild-type. This shows that interaction of the FOXM1 DNA binding domain with target DNA is essential for recruitment. Moreover, analysis of the protein interactome of wild-type versus DNA binding deficient FOXM1 shows that the reduced recruitment is not due to inhibition of protein-protein interactions. CONCLUSIONS: A functional DNA binding domain is essential for FOXM1 chromatin recruitment. Even in FOXM1 mutants with almost complete loss of binding, the protein-protein interactions and pattern of phosphorylation are largely unaffected. These results strongly support a model whereby FOXM1 is specifically recruited to chromatin through co-factor interactions by binding directly to non-canonical DNA sequences.We would like to acknowledge the Genomics and bioinformatics core at the CRUK Research Institute for the Illumina sequencing and the Proteomics core for the LC/MS-MS protein analysis for the RIME experiments. We acknowledge the support from The University of Cambridge and Cancer Research UK. The Balasubramanian Laboratory is supported by core funding from Cancer Research UK (C14303/A17197). SB is a Wellcome Trust Principle Investigator.This is the final version of the article. It first appeared from BioMed Central via http://dx.doi.org/10.1186/s13059-015-0696-

    A systematic, large-scale comparison of transcription factor binding site models

    Get PDF
    Background The modelling of gene regulation is a major challenge in biomedical research. This process is dominated by transcription factors (TFs) and mutations in their binding sites (TFBSs) may cause the misregulation of genes, eventually leading to disease. The consequences of DNA variants on TF binding are modelled in silico using binding matrices, but it remains unclear whether these are capable of accurately representing in vivo binding. In this study, we present a systematic comparison of binding models for 82 human TFs from three freely available sources: JASPAR matrices, HT-SELEX-generated models and matrices derived from protein binding microarrays (PBMs). We determined their ability to detect experimentally verified “real” in vivo TFBSs derived from ENCODE ChIP-seq data. As negative controls we chose random downstream exonic sequences, which are unlikely to harbour TFBS. All models were assessed by receiver operating characteristics (ROC) analysis. Results While the area- under-curve was low for most of the tested models with only 47 % reaching a score of 0.7 or higher, we noticed strong differences between the various position-specific scoring matrices with JASPAR and HT-SELEX models showing higher success rates than PBM-derived models. In addition, we found that while TFBS sequences showed a higher degree of conservation than randomly chosen sequences, there was a high variability between individual TFBSs. Conclusions Our results show that only few of the matrix-based models used to predict potential TFBS are able to reliably detect experimentally confirmed TFBS. We compiled our findings in a freely accessible web application called ePOSSUM (http:/mutationtaster.charite.de/ePOSSUM/) which uses a Bayes classifier to assess the impact of genetic alterations on TF binding in user-defined sequences. Additionally, ePOSSUM provides information on the reliability of the prediction using our test set of experimentally confirmed binding sites

    Daily rhythms of the sleep-wake cycle

    Get PDF
    The amount and timing of sleep and sleep architecture (sleep stages) are determined by several factors, important among which are the environment, circadian rhythms and time awake. Separating the roles played by these factors requires specific protocols, including the constant routine and altered sleep-wake schedules. Results from such protocols have led to the discovery of the factors that determine the amounts and distribution of slow wave and rapid eye movement sleep as well as to the development of models to determine the amount and timing of sleep. One successful model postulates two processes. The first is process S, which is due to sleep pressure (and increases with time awake) and is attributed to a 'sleep homeostat'. Process S reverses during slow wave sleep (when it is called process S'). The second is process C, which shows a daily rhythm that is parallel to the rhythm of core temperature. Processes S and C combine approximately additively to determine the times of sleep onset and waking. The model has proved useful in describing normal sleep in adults. Current work aims to identify the detailed nature of processes S and C. The model can also be applied to circumstances when the sleep-wake cycle is different from the norm in some way. These circumstances include: those who are poor sleepers or short sleepers; the role an individual's chronotype (a measure of how the timing of the individual's preferred sleep-wake cycle compares with the average for a population); and changes in the sleep-wake cycle with age, particularly in adolescence and aging, since individuals tend to prefer to go to sleep later during adolescence and earlier in old age. In all circumstances, the evidence that sleep times and architecture are altered and the possible causes of these changes (including altered S, S' and C processes) are examined

    Occupancy maps of 208 chromatin-associated proteins in one human cell type

    Get PDF
    Transcription factors are DNA-binding proteins that have key roles in gene regulation. Genome-wide occupancy maps of transcriptional regulators are important for understanding gene regulation and its effects on diverse biological processes. However, only a minority of the more than 1,600 transcription factors encoded in the human genome has been assayed. Here we present, as part of the ENCODE (Encyclopedia of DNA Elements) project, data and analyses from chromatin immunoprecipitation followed by high-throughput sequencing (ChIP–seq) experiments using the human HepG2 cell line for 208 chromatin-associated proteins (CAPs). These comprise 171 transcription factors and 37 transcriptional cofactors and chromatin regulator proteins, and represent nearly one-quarter of CAPs expressed in HepG2 cells. The binding profiles of these CAPs form major groups associated predominantly with promoters or enhancers, or with both. We confirm and expand the current catalogue of DNA sequence motifs for transcription factors, and describe motifs that correspond to other transcription factors that are co-enriched with the primary ChIP target. For example, FOX family motifs are enriched in ChIP–seq peaks of 37 other CAPs. We show that motif content and occupancy patterns can distinguish between promoters and enhancers. This catalogue reveals high-occupancy target regions at which many CAPs associate, although each contains motifs for only a minority of the numerous associated transcription factors. These analyses provide a more complete overview of the gene regulatory networks that define this cell type, and demonstrate the usefulness of the large-scale production efforts of the ENCODE Consortium
    • 

    corecore