29 research outputs found

    The ELT-2 GATA-factor and the global regulation of transcription in the C. elegans intestine

    Get PDF
    AbstractA SAGE library was prepared from hand-dissected intestines from adult Caenorhabditis elegans, allowing the identification of >4000 intestinally-expressed genes; this gene inventory provides fundamental information for understanding intestine function, structure and development. Intestinally-expressed genes fall into two broad classes: widely-expressed β€œhousekeeping” genes and genes that are either intestine-specific or significantly intestine-enriched. Within this latter class of genes, we identified a subset of highly-expressed highly-validated genes that are expressed either exclusively or primarily in the intestine. Over half of the encoded proteins are candidates for secretion into the intestinal lumen to hydrolyze the bacterial food (e.g. lysozymes, amoebapores, lipases and especially proteases). The promoters of this subset of intestine-specific/intestine-enriched genes were analyzed computationally, using both a word-counting method (RSAT oligo-analysis) and a method based on Gibbs sampling (MotifSampler). Both methods returned the same over-represented site, namely an extended GATA-related sequence of the general form AHTGATAARR, which agrees with experimentally determined cis-acting control sequences found in intestine genes over the past 20 years. All promoters in the subset contain such a site, compared to <5% for control promoters; moreover, our analysis suggests that the majority (perhaps all) of genes expressed exclusively or primarily in the worm intestine are likely to contain such a site in their promoters. There are three zinc-finger GATA-type factors that are candidates to bind this extended GATA site in the differentiating C. elegans intestine: ELT-2, ELT-4 and ELT-7. All evidence points to ELT-2 being the most important of the three. We show that worms in which both the elt-4 and the elt-7 genes have been deleted from the genome are essentially wildtype, demonstrating that ELT-2 provides all essential GATA-factor functions in the intestine. The SAGE analysis also identifies more than a hundred other transcription factors in the adult intestine but few show an RNAi-induced loss-of-function phenotype and none (other than ELT-2) show a phenotype primarily in the intestine. We thus propose a simple model in which the ELT-2 GATA factor directly participates in the transcription of all intestine-specific/intestine-enriched genes, from the early embryo through to the dying adult. Other intestinal transcription factors would thus modulate the action of ELT-2, depending on the worm's nutritional and physiological needs

    An Integrated Strategy to Study Muscle Development and Myofilament Structure in Caenorhabditis elegans

    Get PDF
    A crucial step in the development of muscle cells in all metazoan animals is the assembly and anchorage of the sarcomere, the essential repeat unit responsible for muscle contraction. In Caenorhabditis elegans, many of the critical proteins involved in this process have been uncovered through mutational screens focusing on uncoordinated movement and embryonic arrest phenotypes. We propose that additional sarcomeric proteins exist for which there is a less severe, or entirely different, mutant phenotype produced in their absence. We have used Serial Analysis of Gene Expression (SAGE) to generate a comprehensive profile of late embryonic muscle gene expression. We generated two replicate long SAGE libraries for sorted embryonic muscle cells, identifying 7,974 protein-coding genes. A refined list of 3,577 genes expressed in muscle cells was compiled from the overlap between our SAGE data and available microarray data. Using the genes in our refined list, we have performed two separate RNA interference (RNAi) screens to identify novel genes that play a role in sarcomere assembly and/or maintenance in either embryonic or adult muscle. To identify muscle defects in embryos, we screened specifically for the Pat embryonic arrest phenotype. To visualize muscle defects in adult animals, we fed dsRNA to worms producing a GFP-tagged myosin protein, thus allowing us to analyze their myofilament organization under gene knockdown conditions using fluorescence microscopy. By eliminating or severely reducing the expression of 3,300 genes using RNAi, we identified 122 genes necessary for proper myofilament organization, 108 of which are genes without a previously characterized role in muscle. Many of the genes affecting sarcomere integrity have human homologs for which little or nothing is known

    Cloning and annotation of novel transcripts from human embryonic stem cells

    Full text link
    Both cDNA tag-based and DNA chip hybridization assays have revealed widespread transcriptional activity across mammalian genomes, providing a rich source of novel protein-coding and non-coding transcripts. Annotation and functional evaluation of this undefined transcriptome space represents a major step towards the comprehensive definition of biomolecules regulating the properties of living cells, including embryonic stem cells (ESCs) and their derivatives. In this study I analysed 87 rare mRNA transcripts from human ESCs that mapped uniquely to the human genome, in regions lacking evidence for known genes or transcripts. In addition, the transcripts appeared enriched in the hESC transcriptome as enumerated by serial analysis of gene expression (SAGE). Full-length transcripts corresponding to twelve novel LongSAGE tags were recovered and evaluated with respect to gene structure, protein-coding potential, and gene regulatory features. In addition, transcript abundance was compared between RNA isolated from undifferentiated hESCs and differentiated cells. Analysis of full-length transcripts revealed that the novel ORFs did not exceed a size of 129 amino acids and no matches were observed to well characterized protein domains. Interesting protein level predictions included small disulfide-bonded proteins, known members of which are important in a variety of biological processes. Transcripts evaluated for differential expression by real-time RT-qPCR (Reverse Transcription followed by real-time quantitative Polymerase Chain Reaction) were found to be variably expressed (0.2- to 4.5-fold) in Day-2 or Day-4 retinoic acid-induced differentiation cultures compared to undifferentiated hESCs. Relative quantitation using a universal reference RNA (derived from pooled adult tissues) showed large differences in novel transcript levels (0.002- to 35-fold) compared to hESCs. Collectively, these results provide a detailed analysis of a set of novel hESC transcripts and their abundance in early and adult differentiated cell types, both of which may advance our understanding of the transcriptional events governing stem cell behavior.Medicine, Faculty ofMedical Genetics, Department ofGraduat

    Charting clonal heterogeneity in breast cancers : from bulk tumor genomes to single-cell genotypes

    Full text link
    Traditional classifications and treatment of human cancers have operated with limitations surrounding tumor homogeneity and mutational stasis. Clinical metrics of malignant tumors focused on descriptive and behavioral properties such as tissue of origin, cellular morphologic features and extent of spread. Missing has been an understanding of the dynamics of cellular subpopulations that underpin divergent functional properties in space and time. This dissertation is focused on the development and application of methods, including next generation DNA sequencing, computational modeling, and single-cell genotyping protocols to elucidate breast tumor heterogeneity and clonal evolution at single nucleotide and single-cell resolution. First, I present advances in our knowledge of the mutational spectrum that may occur and evolve in an individual epithelial cancer, namely a lobular breast cancer metastases and matched primary tumor separated by a nine year interval. This seminal study demonstrated clonal evolution in a patient’s breast cancer and the successful application of targeted deep sequencing for determining digital allelic prevalences and clonal genotypes in bulk tumors. Second, I describe the diversity of genomic sequence and clonal heterogeneity in tumors of the triple-negative breast cancer subtype. The study uncovered wide clonal diversity in these primary tumors at first diagnosis. Third, I demonstrate via genotyping single tumor cells, that computational inferences of tumor clonal architecture can be made reliably from bulk tissue-derived data sets. This was performed using both somatic point mutations and loss of heterozygosity loci as clonal marks. And fourth, I applied single-cell analysis to study the clonal evolution in breast tumor murine xenografts following engraftment and serial passaging. This research uncovered a range of outcomes in tumor clonal composition upon initial engraftment and serial passaging. The same clonal groups were found to arise independently in separate xenografts derived from the same primary tumor, suggesting selection of functionally significant genotypes. Comprehensive capabilities in the measurement and analysis of clonal structure in cancers offers improved classification and combinatorial treatments of subpopulations in heterogeneous tumors and better use of murine xenograft models. Functionally relevant subpopulations of tumor cells, irrespective of numerical abundance or spatiotemporal persistence, can thereby be targeted using clonally informative genomic profiles.Medicine, Faculty ofPathology and Laboratory Medicine, Department ofGraduat

    Systematic Recovery and Analysis of Full-ORF Human cDNA Clones

    Full text link
    The Mammalian Gene Collection (MGC) consortium (http://mgc.nci.nih.gov) seeks to establish publicly available collections of full-ORF cDNAs for several organisms of significance to biomedical research, including human. To date over 15,200 human cDNA clones containing full-length open reading frames (ORFs) have been identified via systematic expressed sequence tag (EST) analysis of a diverse set of cDNA libraries; however, further systematic EST analysis is no longer an efficient method for identifying new cDNAs. As part of our involvement in the MGC program, we have developed a scalable method for targeted recovery of cDNA clones to facilitate recovery of genes absent from the MGC collection. First, cDNA is synthesized from various RNAs, followed by polymerase chain reaction (PCR) amplification of transcripts in 96-well plates using gene-specific primer pairs flanking the ORFs. Amplicons are cloned into a sequencing vector, and full-length sequences are obtained. Sequences are processed and assembled using Phred and Phrap, and analyzed using Consed and a number of bioinformatics methods we have developed. Sequences are compared with the Reference Sequence (RefSeq) database, and validation of sequence discrepancies is attempted using other sequence databases including dbEST and dbSNP. Clones with identical sequence to RefSeq or containing only validated changes will become part of the MGC human gene collection. Clones containing novel splice variants or polymorphisms have also been identified. Our approach to clone recovery, applied at large scale, has the potential to recover many and possibly most of the genes absent from the MGC collection

    Identification of ciliary and ciliopathy genes in Caenorhabditis elegans through comparative genomics

    Full text link
    Background: The recent availability of genome sequences of multiple related Caenorhabditis species has made it possible to identify, using comparative genomics, similarly transcribed genes in Caenorhabditis elegans and its sister species. Taking this approach, we have identified numerous novel ciliary genes in C. elegans, some of which may be orthologs of unidentified human ciliopathy genes. Results: By screening for genes possessing canonical X-box sequences in promoters of three Caenorhabditis species, namely C. elegans, C. briggsae and C. remanei, we identified 93 genes (including known X-box regulated genes) that encode putative components of ciliated neurons in C. elegans and are subject to the same regulatory control. For many of these genes, restricted anatomical expression in ciliated cells was confirmed, and control of transcription by the ciliogenic DAF-19 RFX transcription factor was demonstrated by comparative transcriptional profiling of different tissue types and of daf-19(+) and daf-19(-) animals. Finally, we demonstrate that the dye-filling defect of dyf-5(mn400) animals, which is indicative of compromised exposure of cilia to the environment, is caused by a nonsense mutation in the serine/threonine protein kinase gene M04C9.5. Conclusion: Our comparative genomics-based predictions may be useful for identifying genes involved in human ciliopathies, including Bardet-Biedl Syndrome (BBS), since the C. elegans orthologs of known human BBS genes contain X-box motifs and are required for normal dye filling in C. elegans ciliated neurons.Science, Faculty ofZoology, Department ofNon UBCReviewedFacult

    Functional Genomics of the Cilium, a Sensory Organelle

    Get PDF
    SummaryCilia and flagella play important roles in many physiological processes, including cell and fluid movement, sensory perception, and development [1]. The biogenesis and maintenance of cilia depend on intraflagellar transport (IFT), a motility process that operates bidirectionally along the ciliary axoneme [1, 2]. Disruption in IFT and cilia function causes several human disorders, including polycystic kidneys, retinal dystrophy, neurosensory impairment, and Bardet-Biedl syndrome (BBS) [3–5]. To uncover new ciliary components, including IFT proteins, we compared C. elegans ciliated neuronal and nonciliated cells through serial analysis of gene expression (SAGE) and screened for genes potentially regulated by the ciliogenic transcription factor, DAF-19 [6]. Using these complementary approaches, we identified numerous candidate ciliary genes and confirmed the ciliated-cell-specific expression of 14 novel genes. One of these, C27H5.7a, encodes a ciliary protein that undergoes IFT. As with other IFT proteins, its ciliary localization and transport is disrupted by mutations in IFT and bbs genes. Furthermore, we demonstrate that the ciliary structural defect of C. elegans dyf-13(mn396) mutants is caused by a mutation in C27H5.7a. Together, our findings help define a ciliary transcriptome and suggest that DYF-13, an evolutionarily conserved protein, is a novel core IFT component required for cilia function
    corecore