51,279 research outputs found
Recommended from our members
Improving nanopore read accuracy with the R2C2 method enables the sequencing of highly multiplexed full-length single-cell cDNA.
High-throughput short-read sequencing has revolutionized how transcriptomes are quantified and annotated. However, while Illumina short-read sequencers can be used to analyze entire transcriptomes down to the level of individual splicing events with great accuracy, they fall short of analyzing how these individual events are combined into complete RNA transcript isoforms. Because of this shortfall, long-distance information is required to complement short-read sequencing to analyze transcriptomes on the level of full-length RNA transcript isoforms. While long-read sequencing technology can provide this long-distance information, there are issues with both Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) long-read sequencing technologies that prevent their widespread adoption. Briefly, PacBio sequencers produce low numbers of reads with high accuracy, while ONT sequencers produce higher numbers of reads with lower accuracy. Here, we introduce and validate a long-read ONT-based sequencing method. At the same cost, our Rolling Circle Amplification to Concatemeric Consensus (R2C2) method generates more accurate reads of full-length RNA transcript isoforms than any other available long-read sequencing method. These reads can then be used to generate isoform-level transcriptomes for both genome annotation and differential expression analysis in bulk or single-cell samples
Transcriptome pathways unique to dehydration tolerant relatives of modern wheat
Among abiotic stressors, drought is a major factor responsible for dramatic yield loss in agriculture. In order to reveal differences in global expression profiles of drought tolerant and sensitive wild emmer wheat genotypes, a previously deployed shock-like dehydration process was utilized to compare transcriptomes at two time points in root and leaf tissues using the Affymetrix GeneChip(R) Wheat Genome Array hybridization. The comparison of transcriptomes reveal several unique genes or expression patterns such as differential usage of IP(3)-dependent signal transduction pathways, ethylene- and abscisic acid (ABA)-dependent signaling, and preferential or faster induction of ABA-dependent transcription factors by the tolerant genotype that distinguish contrasting genotypes indicative of distinctive stress response pathways. The data also show that wild emmer wheat is capable of engaging known drought stress responsive mechanisms. The global comparison of transcriptomes in the absence of and after dehydration underlined the gene networks especially in root tissues that may have been lost in the selection processes generating modern bread wheats
Evaluation of diversity, specialization, and gene specificity in transcriptomes
The transcriptome is a set of genes transcribed in a given tissue under specific conditions and can be characterized by a list of genes with their corresponding frequencies of transcription. Transcriptome changes can be measured by counting gene tags from mRNA libraries or by measuring light signals in DNA microarrays. Recently we proposed an approach to define and estimate the diversity and specialization of transcriptomes and gene specificity. This approach can be useful for the determination and measure of transcriptional networks. We defined transcriptome diversity as the Shannon entropy of its frequency distribution. Gene specificity is defined as the mutual information between the tissues and the corresponding transcript, allowing detection of either housekeeping or highly specific genes and clarifying the meaning of these concepts in the literature. Tissue specialization is measured by average gene specificity. Visualization of the positions of transcriptomes in a system of diversity and specialization coordinates makes it possible to understand at a glance their interrelations, summarizing in a powerful way which transcriptomes are richer in diversity of expressed genes, or which are relatively more specialized. This enlightens the relation among transcriptomes, allowing a better understanding of their changes through the development of the organism or in response to environmental stimuli. We present statistical tools based on resampling procedures to obtain confidence intervals for the parameters as well as perform statistical test. These approaches are illustrated with a human dataset
European sea bass (Dicentrarchus labrax) skin and scale transcriptomes
Fish skin and their appendages, the mineralized scales, are important organs for protection and homeostasis, but little is known about their specific transcript or protein repertoire. This study used RNA-seq to generate transcriptome data for skin and scales in the European sea bass (Dicentrarchus labrax), an important species for fisheries and aquaculture. RNA was extracted from the pectoral skin and from scales collected above the midline of immature one-year old sea bass. More than 20 x 10(6) reads were obtained for each tissue, using RNA-seq Illumina technology. De novo assembly resulted in 31,842 transcripts (of 500 base pairs or greater) for skin and 20,423 transcripts for scale. This dataset provides a useful resource for both aquaculture and fish conservation studies and for research into the physiology and molecular biology of fish skin and scales. (C) 2017 Elsevier B.V. All rights reserved.Foundation for Science and Technology of Portugal (FCT) [PTDC/AAG-GLO/4003/2012, CCMAR/Multi/04326/2013, SFRH/BPD/84033/2012
Orthology guided transcriptome assembly of Italian ryegrass and meadow fescue for single-nucleotide polymorphism discovery
Single-nucleotide polymorphisms (SNPs) represent natural DNA sequence variation. They can be used for various applications including the construction of high-density genetic maps, analysis of genetic variability, genome-wide association studies, and mapbased cloning. Here we report on transcriptome sequencing in the two forage grasses, meadow fescue (Festuca pratensis Huds.) and Italian ryegrass (Lolium multiflorum Lam.), and identification of various classes of SNPs. Using the Orthology Guided Assembly (OGA) strategy, we assembled and annotated a total of 18,952 and 19,036 transcripts for Italian ryegrass and meadow fescue, respectively. In addition, we used transcriptome sequence data of perennial ryegrass (L. perenne L.) from a previous study to identify 16,613 transcripts shared across all three species. Large numbers of intraspecific SNPs were identified in all three species: 248,000 in meadow fescue, 715,000 in Italian ryegrass, and 529,000 in perennial ryegrass. Moreover, we identified almost 25,000 interspecific SNPs located in 5343 genes that can distinguish meadow fescue from Italian ryegrass and 15,000 SNPs located in 3976 genes that discriminate meadow fescue from both Lolium species. All identified SNPs were positioned in silico on the seven linkage groups (LGs) of L. perenne using the GenomeZipper approach. With the identification and positioning of interspecific SNPs, our study provides a valuable resource for the grass research and breeding community and will enable detailed characterization of genomic composition and gene expression analysis in prospective Festuca Lolium hybrids
- …