92 research outputs found

    Comprehensive prediction in 78 human cell lines reveals rigidity and compactness of transcription factor dimers

    Get PDF
    The binding of transcription factors (TFs) to their specific motifs in genomic regulatory regions is commonly studied in isolation. However, in order to elucidate the mechanisms of transcriptional regulation, it is essential to determine which TFs bind DNA cooperatively as dimers and to infer the precise nature of these interactions. So far, only a small number of such dimeric complexes are known. Here, we present an algorithm for predicting cell-type-specific TF-TF dimerization on DNA on a large scale, using DNase I hypersensitivity data from 78 human cell lines. We represented the universe of possible TF complexes by their corresponding motif complexes, and analyzed their occurrence at cell-type-specific DNase I hypersensitive sites. Based on ~1.4 billion tests for motif complex enrichment, we predicted 603 highly significant celltype- specific TF dimers, the vast majority of which are novel. Our predictions included 76% (19/25) of the known dimeric complexes and showed significant overlap with an e xperimental database of protein-protein interactions. They were also independently supported by evolutionary conservation, as well as quantitative variation in DNase I digestion patterns. Notably, the known and predicted TF dimers were almost always highly compact and rigidly spaced, suggesting that TFs dimerize in close proximity to their partners, which results in strict constraints on the structure of the DNA-bound complex. Overall, our results indicate that chromatin openness profiles are highly predictive of cell-type-specific TF-TF interactions. Moreover, cooperative TF dimerization seems to be a widespread phenomenon, with multiple TF complexes predicted in most cell types. © 2013, Published by Cold Spring Harbor Laboratory Press.Link_to_subscribed_fulltex

    SOXE transcription factors form selective dimers on non-compact DNA motifs through multifaceted interactions between dimerization and high-mobility group domains

    Get PDF
    The SOXE transcription factors SOX8, SOX9 and SOX10 are master regulators of mammalian development directing sex determination, gliogenesis, pancreas specification and neural crest development. We identified a set of palindromic SOX binding sites specifically enriched in regulatory regions of melanoma cells. SOXE proteins homodimerize on these sequences with high cooperativity. In contrast to other transcription factor dimers, which are typically rigidly spaced, SOXE group proteins can bind cooperatively at a wide range of dimer spacings. Using truncated forms of SOXE proteins, we show that a single dimerization (DIM) domain, that precedes the DNA binding high mobility group (HMG) domain, is sufficient for dimer formation, suggesting that DIM:HMG rather than DIM:DIM interactions mediate the dimerization. All SOXE members can also heterodimerize in this fashion, whereas SOXE heterodimers with SOX2, SOX4, SOX6 and SOX18 are not supported. We propose a structural model where SOXE-specific intramolecular DIM:HMG interactions are allosterically communicated to the HMG of juxtaposed molecules. Collectively, SO XE factors evolved a unique mode to combinatorially regulate their target genes that relies on a multifaceted interplay between the HMG and DIM domains. This property potentially extends further the diversity of target genes and cell-specific functions that are regulated by SOXE proteins.Link_to_subscribed_fulltex

    Quantitative profiling of selective Sox/POU pairing on hundreds of sequences in parallel by Coop-seq

    Get PDF
    © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. Cooperative binding of transcription factors is known to be important in the regulation of gene expression programs conferring cellular identities. However, current methods to measure cooperativity parameters have been laborious and therefore limited to studying only a few sequence variants at a time. We developed Coop-seq (cooperativity by sequencing) that is capable of efficiently and accurately determining the cooperativity parameters for hundreds of different DNA sequences in a single experiment. We apply Coop-seq to 12 dimer pairs from the Sox and POU families of transcription factors using 324 unique sequences with changed half-site orientation, altered spacing and discrete randomization within the binding elements. The study reveals specific dimerization profiles of different Sox factors with Oct4. By contrast, Oct4 and the three neural class III POU factors Brn2, Brn4 and Oct6 assemble with Sox2 in a surprisingly indistinguishable manner. Two novel half-site configurations can support functional Sox/Oct dimerization in addition to known composite motifs. Moreover, Coop-seq uncovers a nucleotide switch within the POU half-site when spacing is altered, which is mirrored in genomic loci bound by Sox2/Oct4 complexes.Link_to_subscribed_fulltex

    SOXE neofunctionalization and elaboration of the neural crest during chordate evolution

    Get PDF
    During chordate evolution, two genome-wide duplications facilitated acquisition of vertebrate traits, including emergence of neural crest cells (NCCs), in which neofunctionalization of the duplicated genes are thought to have facilitated development of craniofacial structures and the peripheral nervous system. How these duplicated genes evolve and acquire the ability to specify NC and their derivatives are largely unknown. Vertebrate SoxE paralogues, most notably Sox9/10, are essential for NC induction, delamination and lineage specification. In contrast, the basal chordate, amphioxus, has a single SoxE gene and lacks NC-like cells. Here, we test the hypothesis that duplication and divergence of an ancestral SoxE gene may have facilitated elaboration of NC lineages. By using an in vivo expression assay to compare effects of AmphiSoxE and vertebrate Sox9 on NC development, we demonstrate that all SOXE proteins possess similar DNA binding and homodimerization properties and can induce NCCs. However, AmphiSOXE is less efficient than SOX9 in transactivation activity and in the ability to preferentially promote glial over neuronal fate, a difference that lies within the combined properties of amino terminal and transactivation domains. We propose that acquisition of AmphiSoxE expression in the neural plate border led to NCC emergence while duplication and divergence produced advantageous mutations in vertebrate homologues, promoting elaboration of NC traits

    DNA-mediated cooperativity facilitates the co-selection of cryptic enhancer sequences by SOX2 and PAX6 transcription factors

    Get PDF
    © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. Sox2 and Pax6 are transcription factors that direct cell fate decision during neurogenesis, yet the mechanism behind how they cooperate on enhancer DNA elements and regulate gene expression is unclear. By systematically interrogating Sox2 and Pax6 interaction on minimal enhancer elements, we found that cooperative DNA recognition relies on combinatorial nucleotide switches and precisely spaced, but cryptic composite DNA motifs. Surprisingly, all tested Sox and Pax paralogs have the capacity to cooperate on such enhancer elements. NMR and molecular modeling reveal very few direct protein-protein interactions between Sox2 and Pax6, suggesting that cooperative binding is mediated by allosteric interactions propagating through DNA structure. Furthermore, we detected and validated several novel sites in the human genome targeted cooperatively by Sox2 and Pax6. Collectively, we demonstrate that Sox- Pax partnerships have the potential to substantially alter DNA target specificities and likely enable the pleiotropic and context-specific action of these cell-lineage specifiers.Link_to_subscribed_fulltex

    Dissecting the role of distinct OCT4-SOX2 heterodimer configurations in pluripotency

    Get PDF
    The transcription factors OCT4 and SOX2 are required for generating induced pluripotent stem cells (iPSCs) and for maintaining embryonic stem cells (ESCs). OCT4 and SOX2 associate and bind to DNA in different configurations depending on the arrangement of their individual DNA binding elements. Here we have investigated the role of the different OCT4-SOX2-DNA assemblies in regulating and inducing pluripotency. To this end, we have generated SOX2 mutants that interfere with specific OCT4-SOX2 heterodimer configurations and assessed their ability to generate iPSCs and to rescue ESC self-renewal. Our results demonstrate that the OCT4-SOX2 configuration that dimerizes on a Hoxb1-like composite, a canonical element with juxtaposed individual binding sites, plays a more critical role in the induction and maintenance of pluripotency than any other OCT4-SOX2 configuration. Overall, the results of this study provide new insight into the protein interactions required to establish a de novo pluripotent network and to maintain a true pluripotent cell fate.Link_to_subscribed_fulltex

    Deciphering the Sox-Oct partner code by quantitative cooperativity measurements

    Get PDF
    Several Sox-Oct transcription factor (TF) combinations have been shown to cooperate on diverse enhancers to determine cell fates. Here, we developed a method to quantify biochemically the Sox-Oct cooperation and assessed the pairing of the high-mobility group (HMG) domains of 11 Sox TFs with Oct4 on a series of composite DNA elements. This way, we clustered Sox proteins according to their dimerization preferences illustrating that Sox HMG domains evolved different propensities to cooperate with Oct4. Sox2, Sox14, Sox21 and Sox15 strongly cooperate on the canonical element but compete with Oct4 on a recently discovered compressed element. Sry also cooperates on the canonical element but binds additively to the compressed element. In contrast, Sox17 and Sox4 cooperate more strongly on the compressed than on the canonical element. Sox5 and Sox18 show some cooperation on both elements, whereas Sox8 and Sox9 compete on both elements. Testing rationally mutated Sox proteins combined with structural modeling highlights critical amino acids for differential Sox-Oct4 partnerships and demonstrates that the cooperativity correlates with the efficiency in producing induced pluripotent stem cells. Our results suggest selective Sox-Oct partnerships in genome regulation and provide a toolset to study protein cooperation on DNA

    SOXE neofunctionalization and elaboration of the neural crest during chordate evolution

    Get PDF
    During chordate evolution, two genome-wide duplications facilitated acquisition of vertebrate traits, including emergence of neural crest cells (NCCs), in which neofunctionalization of the duplicated genes are thought to have facilitated development of craniofacial structures and the peripheral nervous system. How these duplicated genes evolve and acquire the ability to specify NC and their derivatives are largely unknown. Vertebrate SoxE paralogues, most notably Sox9/10, are essential for NC induction, delamination and lineage specification. In contrast, the basal chordate, amphioxus, has a single SoxE gene and lacks NC-like cells. Here, we test the hypothesis that duplication and divergence of an ancestral SoxE gene may have facilitated elaboration of NC lineages. By using an in vivo expression assay to compare effects of AmphiSoxE and vertebrate Sox9 on NC development, we demonstrate that all SOXE proteins possess similar DNA binding and homodimerization properties and can induce NCCs. However, AmphiSOXE is less efficient than SOX9 in transactivation activity and in the ability to preferentially promote glial over neuronal fate, a difference that lies within the combined properties of amino terminal and transactivation domains. We propose that acquisition of AmphiSoxE expression in the neural plate border led to NCC emergence while duplication and divergence produced advantageous mutations in vertebrate homologues, promoting elaboration of NC traits

    TherMos: Estimating protein-DNA binding energies from in vivo binding profiles

    Get PDF
    Accurately characterizing transcription factor (TF)-DNA affinity is a central goal of regulatory genomics. Although thermodynamics provides the most natural language for describing the continuous range of TF-DNA affinity, traditional motif discovery algorithms focus instead on classification paradigms that aim to discriminate 'bound' and 'unbound' sequences. Moreover, these algorithms do not directly model the distribution of tags in ChIP-seq data. Here, we present a new algorithm named Thermodynamic Modeling of ChIP-seq (TherMos), which directly estimates a positionspecific binding energy matrix (PSEM) from ChIPseq/exo tag profiles. In cross-validation tests on seven genome-wide TF-DNA binding profiles, one of which we generated via ChIP-seq on a complex developing tissue, TherMos predicted quantitative TF-DNA binding with greater accuracy than five well-known algorithms. We experimentally validated TherMos binding energy models for Klf4 and Esrrb, using a novel protocol to measure PSEMs in vitro. Strikingly, our measurements revealed strong nonadditivity at multiple positions within the two PSEMs. Among the algorithms tested, only TherMos was able to model the entire binding energy landscape of Klf4 and Esrrb. Our study reveals new insights into the energetics of TF-DNA binding in vivo and provides an accurate first-principles approach to binding energy inference from ChIP-seq and ChIP-exo data. © 2013 The Author(s).Link_to_subscribed_fulltex
    corecore