70 research outputs found

    Differential Gene Expression in the Siphonophore Nanomia bijuga (Cnidaria) Assessed with Multiple Next-Generation Sequencing Workflows

    Get PDF
    We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through workflow choice and deeper reference sequencing

    The maternal and early embryonic transcriptome of the milkweed bug Oncopeltus fasciatus

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Most evolutionary developmental biology ("evo-devo") studies of emerging model organisms focus on small numbers of candidate genes cloned individually using degenerate PCR. However, newly available sequencing technologies such as 454 pyrosequencing have recently begun to allow for massive gene discovery in animals without sequenced genomes. Within insects, although large volumes of sequence data are available for holometabolous insects, developmental studies of basally branching hemimetabolous insects typically suffer from low rates of gene discovery.</p> <p>Results</p> <p>We used 454 pyrosequencing to sequence over 500 million bases of cDNA from the ovaries and embryos of the milkweed bug <it>Oncopeltus fasciatus</it>, which lacks a sequenced genome. This indirectly developing insect occupies an important phylogenetic position, branching basal to Diptera (including fruit flies) and Hymenoptera (including honeybees), and is an experimentally tractable model for short-germ development. 2,087,410 reads from both normalized and non-normalized cDNA assembled into 21,097 sequences (isotigs) and 112,531 singletons. The assembled sequences fell into 16,617 unique gene models, and included predictions of splicing isoforms, which we examined experimentally. Discovery of new genes plateaued after assembly of ~1.5 million reads, suggesting that we have sequenced nearly all transcripts present in the cDNA sampled. Many transcripts have been assembled at close to full length, and there is a net gain of sequence data for over half of the pre-existing <it>O. fasciatus </it>accessions for developmental genes in GenBank. We identified 10,775 unique genes, including members of all major conserved metazoan signaling pathways and genes involved in several major categories of early developmental processes. We also specifically address the effects of cDNA normalization on gene discovery in <it>de novo </it>transcriptome analyses.</p> <p>Conclusions</p> <p>Our sequencing, assembly and annotation framework provide a simple and effective way to achieve high-throughput gene discovery for organisms lacking a sequenced genome. These data will have applications to the study of the evolution of arthropod genes and genetic pathways, and to the wider evolution, development and genomics communities working with emerging model organisms.</p> <p>[The sequence data from this study have been submitted to GenBank under study accession number SRP002610 (<url>http://www.ncbi.nlm.nih.gov/sra?term=SRP002610</url>). Custom scripts generated are available at <url>http://www.extavourlab.com/protocols/index.html</url>. Seven Additional files are available.]</p

    De novo assembly and characterization of a maternal and developmental transcriptome for the emerging model crustacean Parhyale hawaiensis

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Arthropods are the most diverse animal phylum, but their genomic resources are relatively few. While the genome of the branchiopod <it>Daphnia pulex </it>is now available, no other large-scale crustacean genomic resources are available for comparison. In particular, genomic resources are lacking for the most tractable laboratory model of crustacean development, the amphipod <it>Parhyale hawaiensis</it>. Insight into shared and divergent characters of crustacean genomes will facilitate interpretation of future developmental, biomedical, and ecological research using crustacean models.</p> <p>Results</p> <p>To generate a transcriptome enriched for maternally provided and zygotically transcribed developmental genes, we created cDNA from ovaries and embryos of <it>P. hawaiensis</it>. Using 454 pyrosequencing, we sequenced over 1.1 billion bases of this cDNA, and assembled them <it>de novo </it>to create, to our knowledge, the second largest crustacean genomic resource to date. We found an unusually high proportion of C2H2 zinc finger-containing transcripts, as has also been reported for the genome of the pea aphid <it>Acyrthosiphon pisum</it>. Consistent with previous reports, we detected trans-spliced transcripts, but found that they did not noticeably impact transcriptome assembly. Our assembly products yielded 19,067 unique BLAST hits against <b>nr </b>(E-value cutoff e-10). These included over 400 predicted transcripts with significant similarity to <it>D. pulex </it>sequences but not to sequences of any other animal. Annotation of several hundred genes revealed <it>P. hawaiensis </it>homologues of genes involved in development, gametogenesis, and a majority of the members of six major conserved metazoan signaling pathways.</p> <p>Conclusions</p> <p>The amphipod <it>P. hawaiensis </it>has higher transcript complexity than known insect transcriptomes, and trans-splicing does not appear to be a major contributor to this complexity. We discuss the importance of a reliable comparative genomic framework within which to consider findings from new crustacean models such as <it>D. pulex </it>and <it>P. hawaiensis</it>, as well as the need for development of further substantial crustacean genomic resources.</p

    High through-put sequencing of the Parhyale hawaiensis mRNAs and microRNAs to aid comparative developmental studies

    Get PDF
    Understanding the genetic and evolutionary basis of animal morphological diversity will require comparative developmental studies that use new model organisms. This necessitates development of tools for the study of genetics and also the generation of sequence information of the organism to be studied. The development of next generation sequencing technology has enabled quick and cost effective generation of sequence information. Parhyale hawaiensis has emerged as a model organism of choice due to the development of advanced molecular tools, thus P. hawaiensis genetic information will help drive functional studies in this organism. Here we present a transcriptome and miRNA collection generated using next generation sequencing platforms. We generated approximately 1.7 million reads from a P. hawaiensis cDNA library constructed from embryos up to the germ band stage. These reads were assembled into a dataset comprising 163,501 transcripts. Using the combined annotation of Annot8r and pfam2go, Gene Ontology classifications was assigned to 20,597 transcripts. Annot8r was used to provide KEGG orthology to our transcript dataset. A total of 25,292 KEGG pathway assignments were defined and further confirmed with reciprocal blast against the NCBI nr protein database. This has identified many P. hawaiensis gene orthologs of key conserved signalling pathways involved in development. We also generated small RNA sequences from P. hawaiensis, identifying 55 conserved miRNAs. Sequenced small RNAs that were not annotated by stringent comparison to mirBase were used to search the Daphnia pulex for possible novel miRNAs. Using a conservative approach, we have identified 51 possible miRNA candidates conserved in the Daphnia pulex genome, which could be potential crustacean/arthropod specific miRNAs. Our study presents gene and miRNA discovery in a new model organism that does not have a sequenced genome. The data provided by our work will be valuable for the P. hawaiensis community as well as the wider evolutionary developmental biology community

    High-Throughput Sequencing to Reveal Genes Involved in Reproduction and Development in Bactrocera dorsalis (Diptera: Tephritidae)

    Get PDF
    BACKGROUND: Tephritid fruit flies in the genus Bactrocera are of major economic significance in agriculture causing considerable loss to the fruit and vegetable industry. Currently, there is no ideal control program. Molecular means is an effective method for pest control at present, but genomic or transcriptomic data for members of this genus remains limited. To facilitate molecular research into reproduction and development mechanisms, and finally effective control on these pests, an extensive transcriptome for the oriental fruit fly Bactrocera dorsalis was produced using the Roche 454-FLX platform. RESULTS: We obtained over 350 million bases of cDNA derived from the whole body of B. dorsalis at different developmental stages. In a single run, 747,206 sequencing reads with a mean read length of 382 bp were obtained. These reads were assembled into 28,782 contigs and 169,966 singletons. The mean contig size was 750 bp and many nearly full-length transcripts were assembled. Additionally, we identified a great number of genes that are involved in reproduction and development as well as genes that represent nearly all major conserved metazoan signal transduction pathways, such as insulin signal transduction. Furthermore, transcriptome changes during development were analyzed. A total of 2,977 differentially expressed genes (DEGs) were detected between larvae and pupae libraries, while there were 1,621 DEGs between adults and larvae, and 2,002 between adults and pupae. These DEGs were functionally annotated with KEGG pathway annotation and 9 genes were validated by qRT-PCR. CONCLUSION: Our data represent the extensive sequence resources available for B. dorsalis and provide for the first time access to the genetic architecture of reproduction and development as well as major signal transduction pathways in the Tephritid fruit fly pests, allowing us to elucidate the molecular mechanisms underlying courtship, ovipositing, development and detailed analyses of the signal transduction pathways

    Transcriptome Analysis of the Oriental Fruit Fly (Bactrocera dorsalis)

    Get PDF
    The oriental fruit fly, Bactrocera dorsalis (Hendel), is one of the most economically important pests in the world, causing serious damage to fruit production. However, lack of genetic information on this organism is an obstacle to understanding the mechanisms behind its development and its ability to resist insecticides. Analysis of the B. dorsalis transcriptome and its expression profile data is essential to extending the genetic information resources on this species, providing a shortcut that will support studies on B. dorsalis.We performed de novo assembly of a transcriptome using short read sequencing technology (Illumina). The results generated 484,628 contigs, 70,640 scaffolds, and 49,804 unigenes. Of those unigenes, 27,455 (55.13%) matched known proteins in the NCBI database, as determined by BLAST search. Clusters of orthologous groups (COG), gene orthology (GO), and the Kyoto Encyclopedia of Genes and Genomes (KEGG) annotations were performed to better understand the functions of these unigenes. Genes related to insecticide resistance were analyzed in additional detail. Digital gene expression (DGE) libraries showed differences in gene expression profiles at different developmental stages (eggs, third-instar larvae, pupae, and adults). To confirm the DGE results, the expression profiles of six randomly selected genes were analyzed.This transcriptome greatly improves our genetic understanding of B. dorsalis and makes a huge number of gene sequences available for further study, including both genes of known importance and genes of unknown function. The DGE data provide comprehensive insight into gene expression profiles at different developmental stages. This facilitates the study of the role of each gene in the developmental process and in insecticide resistance

    Global Transcriptome Profiling of the Pine Shoot Beetle, Tomicus yunnanensis (Coleoptera: Scolytinae)

    Get PDF
    Background: The pine shoot beetle Tomicus yunnanensis (Coleoptera: Scolytinae) is an economically important pest of Pinus yunnanensis in southwestern China. Developed resistance to insecticides due to chemical pesticides being used for a long time is a factor involved in its serious damage, which poses a challenge for management. In addition, highly efficient adaptation to divergent environmental ecologies results in this pest posing great potential threat to pine forests. However, the molecular mechanisms remain unknown as only limited nucleotide sequence data for this species is available. Methodology/Principal Findings: In this study, we applied next generation sequencing (Illumina sequencing) to sequence the adult transcriptome of T. yunnanensis. A total of 51,822,230 reads were obtained. They were assembled into 140,702 scaffolds, and 60,031 unigenes. The unigenes were further functionally annotated with gene descriptions, Gene Ontology (GO), Clusters of Orthologous Groups (COG), and Kyoto Encyclopedia of Genes and Genome (KEGG). In total, 80,932 unigenes were classified into GO, 13,599 unigenes were assigned to COG, and 33,875 unigenes were found in KO categories. A biochemical pathway database containing 219 predicted pathways was also created based on the annotations. In depth analysis of the data revealed a large number of genes related to insecticides resistance and heat shock protein genes associated with environmental stress. Conclusions/Significance: The results facilitate the investigations of molecular resistance mechanisms to insecticides an

    Generation of a reference transcriptome for evaluating rainbow trout responses to various stressors

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Fish under intensive culture conditions are exposed to a variety of acute and chronic stressors, including high rearing densities, sub-optimal water quality, and severe thermal fluctuations. Such stressors are inherent in aquaculture production and can induce physiological responses with adverse effects on traits important to producers and consumers, including those associated with growth, nutrition, reproduction, immune response, and fillet quality. Understanding and monitoring the biological mechanisms underlying stress responses will facilitate alleviating their negative effects through selective breeding and changes in management practices, resulting in improved animal welfare and production efficiency.</p> <p>Results</p> <p>Physiological responses to five treatments associated with stress were characterized by measuring plasma lysozyme activity, glucose, lactate, chloride, and cortisol concentrations, in addition to stress-associated transcripts by quantitative PCR. Results indicate that the fish had significant stressor-specific changes in their physiological conditions. Sequencing of a pooled normalized transcriptome library created from gill, brain, liver, spleen, kidney and muscle RNA of control and stressed fish produced 3,160,306 expressed sequence tags which were assembled and annotated. SNP discovery resulted in identification of ~58,000 putative single nucleotide polymorphisms including 24,479 which were predicted to fall within exons. Of these, 4907 were predicted to occupy the first position of a codon and 4110 the second, increasing the probability to impact amino acid sequence variation and potentially gene function.</p> <p>Conclusion</p> <p>We have generated and characterized a reference transcriptome for rainbow trout that represents multiple tissues responding to multiple stressors common to aquaculture production environments. This resource compliments existing public transcriptome data and will facilitate approaches aiming to evaluate gene expression associated with stress in this species.</p
    corecore