76 research outputs found

    A compendium of Caenorhabditis elegans regulatory transcription factors: a resource for mapping transcription regulatory networks

    Get PDF
    Background Transcription regulatory networks are composed of interactions between transcription factors and their target genes. Whereas unicellular networks have been studied extensively, metazoan transcription regulatory networks remain largely unexplored. Caenorhabditis elegans provides a powerful model to study such metazoan networks because its genome is completely sequenced and many functional genomic tools are available. While C. elegans gene predictions have undergone continuous refinement, this is not true for the annotation of functional transcription factors. The comprehensive identification of transcription factors is essential for the systematic mapping of transcription regulatory networks because it enables the creation of physical transcription factor resources that can be used in assays to map interactions between transcription factors and their target genes. Results By computational searches and extensive manual curation, we have identified a compendium of 934 transcription factor genes (referred to as wTF2.0). We find that manual curation drastically reduces the number of both false positive and false negative transcription factor predictions. We discuss how transcription factor splice variants and dimer formation may affect the total number of functional transcription factors. In contrast to mouse transcription factor genes, we find that C. elegans transcription factor genes do not undergo significantly more splicing than other genes. This difference may contribute to differences in organism complexity. We identify candidate redundant worm transcription factor genes and orthologous worm and human transcription factor pairs. Finally, we discuss how wTF2.0 can be used together with physical transcription factor clone resources to facilitate the systematic mapping of C. elegans transcription regulatory networks. Conclusion wTF2.0 provides a starting point to decipher the transcription regulatory networks that control metazoan development and function

    Multiple transcription factors directly regulate Hox gene lin-39 expression in ventral hypodermal cells of the C. elegans embryo and larva, including the hypodermal fate regulators LIN-26 and ELT-6

    Get PDF
    BACKGROUND: Hox genes encode master regulators of regional fate specification during early metazoan development. Much is known about the initiation and regulation of Hox gene expression in Drosophila and vertebrates, but less is known in the non-arthropod invertebrate model system, C. elegans. The C. elegans Hox gene lin-39 is required for correct fate specification in the midbody region, including the Vulval Precursor Cells (VPCs). To better understand lin-39 regulation and function, we aimed to identify transcription factors necessary for lin-39 expression in the VPCs, and in particular sought factors that initiate lin-39 expression in the embryo. RESULTS: We used the yeast one-hybrid (Y1H) method to screen for factors that bound to 13 fragments from the lin-39 region: twelve fragments contained sequences conserved between C. elegans and two other nematode species, while one fragment was known to drive reporter gene expression in the early embryo in cells that generate the VPCs. Sixteen transcription factors that bind to eight lin-39 genomic fragments were identified in yeast, and we characterized several factors by verifying their physical interactions in vitro, and showing that reduction of their function leads to alterations in lin-39 levels and lin-39::GFP reporter expression in vivo. Three factors, the orphan nuclear hormone receptor NHR-43, the hypodermal fate regulator LIN-26, and the GATA factor ELT-6 positively regulate lin-39 expression in the embryonic precursors to the VPCs. In particular, ELT-6 interacts with an enhancer that drives GFP expression in the early embryo, and the ELT-6 site we identified is necessary for proper embryonic expression. These three factors, along with the factors ZTF-17, BED-3 and TBX-9, also positively regulate lin-39 expression in the larval VPCs. CONCLUSIONS: These results significantly expand the number of factors known to directly bind and regulate lin-39 expression, identify the first factors required for lin-39 expression in the embryo, and hint at a positive feedback mechanism involving GATA factors that maintains lin-39 expression in the vulval lineage. This work indicates that, as in other organisms, the regulation of Hox gene expression in C. elegans is complicated, redundant and robust

    Transcription factor binding to Caenorhabditis elegans first introns reveals lack of redundancy with gene promoters

    Get PDF
    Gene expression is controlled through the binding of transcription factors (TFs) to regulatory genomic regions. First introns are longer than other introns in multiple eukaryotic species and are under selective constraint. Here we explore the importance of first introns in TF binding in the nematode Caenorhabditis elegans by combining computational predictions and experimentally derived TF-DNA interaction data. We found that first introns of C. elegans genes, particularly those for families enriched in long first introns, are more conserved in length, have more conserved predicted TF interactions and are bound by more TFs than other introns. We detected a significant positive correlation between first intron size and the number of TF interactions obtained from chromatin immunoprecipitation assays or determined by yeast one-hybrid assays. TFs that bind first introns are largely different from those binding promoters, suggesting that the different interactions are complementary rather than redundant. By combining first intron and promoter interactions, we found that genes that share a large fraction of TF interactions are more likely to be co-expressed than when only TF interactions with promoters are considered. Altogether, our data suggest that C. elegans gene regulation may be additive through the combined effects of multiple regulatory regions

    Mapping and analysis of Caenorhabditis elegans transcription factor sequence specificities

    Get PDF
    Caenorhabditis elegans is a powerful model for studying gene regulation, as it has a compact genome and a wealth of genomic tools. However, identification of regulatory elements has been limited, as DNA-binding motifs are known for only 71 of the estimated 763 sequence-specific transcription factors (TFs). To address this problem, we performed protein binding microarray experiments on representatives of canonical TF families in C. elegans, obtaining motifs for 129 TFs. Additionally, we predict motifs for many TFs that have DNA-binding domains similar to those already characterized, increasing coverage of binding specificities to 292 C. elegans TFs (~40%). These data highlight the diversification of binding motifs for the nuclear hormone receptor and C2H2 zinc finger families, and reveal unexpected diversity of motifs for T-box and DM families. Motif enrichment in promoters of functionally related genes is consistent with known biology, and also identifies putative regulatory roles for unstudied TFs

    Using a structural and logics systems approach to infer bHLH–DNA binding specificity determinants

    Get PDF
    Numerous efforts are underway to determine gene regulatory networks that describe physical relationships between transcription factors (TFs) and their target DNA sequences. Members of paralogous TF families typically recognize similar DNA sequences. Knowledge of the molecular determinants of protein–DNA recognition by paralogous TFs is of central importance for understanding how small differences in DNA specificities can dictate target gene selection. Previously, we determined the in vitro DNA binding specificities of 19 Caenorhabditis elegans basic helix-loop-helix (bHLH) dimers using protein binding microarrays. These TFs bind E-box (CANNTG) and E-box-like sequences. Here, we combine these data with logics, bHLH–DNA co-crystal structures and computational modeling to infer which bHLH monomer can interact with which CAN E-box half-site and we identify a critical residue in the protein that dictates this specificity. Validation experiments using mutant bHLH proteins provide support for our inferences. Our study provides insights into the mechanisms of DNA recognition by bHLH dimers as well as a blueprint for system-level studies of the DNA binding determinants of other TF families in different model organisms and humans.National Institute of General Medical Sciences (U.S.) (DK068429)National Institute of General Medical Sciences (U.S.) (HG003985)European Union (PROSPECTS HEALTH-F4-2008-201648

    A Widespread Distribution of Genomic CeMyoD Binding Sites Revealed and Cross Validated by ChIP-Chip and ChIP-Seq Techniques

    Get PDF
    Identifying transcription factor binding sites genome-wide using chromatin immunoprecipitation (ChIP)-based technology is becoming an increasingly important tool in addressing developmental questions. However, technical problems associated with factor abundance and suitable ChIP reagents are common obstacles to these studies in many biological systems. We have used two completely different, widely applicable methods to determine by ChIP the genome-wide binding sites of the master myogenic regulatory transcription factor HLH-1 (CeMyoD) in C. elegans embryos. The two approaches, ChIP-seq and ChIP-chip, yield strongly overlapping results revealing that HLH-1 preferentially binds to promoter regions of genes enriched for E-box sequences (CANNTG), known binding sites for this well-studied class of transcription factors. HLH-1 binding sites were enriched upstream of genes known to be expressed in muscle, consistent with its role as a direct transcriptional regulator. HLH-1 binding was also detected at numerous sites unassociated with muscle gene expression, as has been previously described for its mouse homolog MyoD. These binding sites may reflect several additional functions for HLH-1, including its interactions with one or more co-factors to activate (or repress) gene expression or a role in chromatin organization distinct from direct transcriptional regulation of target genes. Our results also provide a comparison of ChIP methodologies that can overcome limitations commonly encountered in these types of studies while highlighting the complications of assigning in vivo functions to identified target sites

    Cdx ParaHox genes acquired distinct developmental roles after gene duplication in vertebrate evolution

    Get PDF
    BACKGROUND: The functional consequences of whole genome duplications in vertebrate evolution are not fully understood. It remains unclear, for instance, why paralogues were retained in some gene families but extensively lost in others. Cdx homeobox genes encode conserved transcription factors controlling posterior development across diverse bilaterians. These genes are part of the ParaHox gene cluster. Multiple Cdx copies were retained after genome duplication, raising questions about how functional divergence, overlap, and redundancy respectively contributed to their retention and evolutionary fate. RESULTS: We examined the degree of regulatory and functional overlap between the three vertebrate Cdx genes using single and triple morpholino knock-down in Xenopus tropicalis followed by RNA-seq. We found that one paralogue, Cdx4, has a much stronger effect on gene expression than the others, including a strong regulatory effect on FGF and Wnt genes. Functional annotation revealed distinct and overlapping roles and subtly different temporal windows of action for each gene. The data also reveal a colinear-like effect of Cdx genes on Hox genes, with repression of Hox paralogy groups 1 and 2, and activation increasing from Hox group 5 to 11. We also highlight cases in which duplicated genes regulate distinct paralogous targets revealing pathway elaboration after whole genome duplication. CONCLUSIONS: Despite shared core pathways, Cdx paralogues have acquired distinct regulatory roles during development. This implies that the degree of functional overlap between paralogues is relatively low and that gene expression pattern alone should be used with caution when investigating the functional evolution of duplicated genes. We therefore suggest that developmental programmes were extensively rewired after whole genome duplication in the early evolution of vertebrates

    OrthoList: A Compendium of C. elegans Genes with Human Orthologs

    Get PDF
    C. elegans is an important model for genetic studies relevant to human biology and disease. We sought to assess the orthology between C. elegans and human genes to understand better the relationship between their genomes and to generate a compelling list of candidates to streamline RNAi-based screens in this model.We performed a meta-analysis of results from four orthology prediction programs and generated a compendium, "OrthoList", containing 7,663 C. elegans protein-coding genes. Various assessments indicate that OrthoList has extensive coverage with low false-positive and false-negative rates. Part of this evaluation examined the conservation of components of the receptor tyrosine kinase, Notch, Wnt, TGF-ß and insulin signaling pathways, and led us to update compendia of conserved C. elegans kinases, nuclear hormone receptors, F-box proteins, and transcription factors. Comparison with two published genome-wide RNAi screens indicated that virtually all of the conserved hits would have been obtained had just the OrthoList set (∼38% of the genome) been targeted. We compiled Ortholist by InterPro domains and Gene Ontology annotation, making it easy to identify C. elegans orthologs of human disease genes for potential functional analysis.We anticipate that OrthoList will be of considerable utility to C. elegans researchers for streamlining RNAi screens, by focusing on genes with apparent human orthologs, thus reducing screening effort by ∼60%. Moreover, we find that OrthoList provides a useful basis for annotating orthology and reveals more C. elegans orthologs of human genes in various functional groups, such as transcription factors, than previously described

    The Homeobox Protein CEH-23 Mediates Prolonged Longevity in Response to Impaired Mitochondrial Electron Transport Chain in C. elegans

    Get PDF
    Recent findings indicate that perturbations of the mitochondrial electron transport chain (METC) can cause extended longevity in evolutionarily diverse organisms. To uncover the molecular basis of how altered METC increases lifespan in C. elegans, we performed an RNAi screen and revealed that three predicted transcription factors are specifically required for the extended longevity of mitochondrial mutants. In particular, we demonstrated that the nuclear homeobox protein CEH-23 uniquely mediates the longevity but not the slow development, reduced brood size, or resistance to oxidative stress associated with mitochondrial mutations. Furthermore, we showed that ceh-23 expression levels are responsive to altered METC, and enforced overexpression of ceh-23 is sufficient to extend lifespan in wild-type background. Our data point to mitochondria-to-nucleus communications to be key for longevity determination and highlight CEH-23 as a novel longevity factor capable of responding to mitochondrial perturbations. These findings provide a new paradigm for how mitochondria impact aging and age-dependent diseases
    corecore