53 research outputs found
Comparing Three Approaches
Hybridization-based target enrichment protocols require relatively large
starting amounts of genomic DNA, which is not always available. Here, we
tested three approaches to pre-capture library preparation starting from 10 ng
of genomic DNA: (i and ii) whole-genome amplification of DNA samples with
REPLI-g (Qiagen) and GenomePlex (Sigma) kits followed by standard library
preparation, and (iii) library construction with a low input oriented ThruPLEX
kit (Rubicon Genomics). Exome capture with Agilent SureSelectXT2 Human AllExon
v4+UTRs capture probes, and HiSeq2000 sequencing were performed for test
libraries along with the control library prepared from 1 µg of starting DNA.
Tested protocols were characterized in terms of mapping efficiency, enrichment
ratio, coverage of the target region, and reliability of SNP genotyping.
REPLI-g- and ThruPLEX-FD-based protocols seem to be adequate solutions for
exome sequencing of low input sample
Influence of RNA extraction methods and library selection schemes on RNA-seq data
BACKGROUND: Gene expression analysis by RNA sequencing is now widely used in a number of applications surveying the whole transcriptomes of cells and tissues. The recent introduction of ribosomal RNA depletion protocols, such as RiboZero, has extended the view of the polyadenylated transcriptome to the poly(A)- fraction of the RNA. However, substantial amounts of intronic transcriptional activity has been reported in RiboZero protocols, raising issues regarding their potential nuclear origin and the impact on the actual sequence depth in exonic regions. RESULTS: Using HEK293 human cells as source material, we assessed here the impact of the two commonly used RNA extraction methods and of the library construction protocols (rRNA depletion versus mRNA) on 1) the relative abundance of intronic reads and 2) on the estimation of gene expression values. We benchmarked the rRNA depletion-based sequencing with a specific analysis of the cytoplasmic and nuclear transcriptome fractions, suggesting that the large majority of the intronic reads correspond to unprocessed nuclear transcripts rather than to independent transcriptional units. We show that Qiagen or TRIzol extraction methods retain differentially nuclear RNA species, and that consequently, rRNA depletion-based RNA sequencing protocols are particularly sensitive to the extraction methods. CONCLUSIONS: We could show that the combination of Trizol-based RNA extraction with rRNA depletion sequencing protocols led to the largest fraction of intronic reads, after the sequencing of the nuclear transcriptome. We discuss here the impact of the various strategies on gene expression and alternative splicing estimation measures. Further, we propose guidelines and a double selection strategy for minimizing the expression biases, without loss of information
Janus—a comprehensive tool investigating the two faces of transcription
Motivation: Protocols to generate strand-specific transcriptomes with next-generation sequencing platforms have been used by the scientific community roughly since 2008. Strand-specific reads allow for detection of antisense events and a higher resolution of expression profiles enabling extension of current transcript annotations. However, applications making use of this strandedness information are still scarce. Results: Here we present a tool (Janus), which focuses on the identification of transcriptional active regions in antisense orientation to known and novel transcribed elements of the genome. Janus can compare the antisense events of multiple samples and assigns scores to identify mutual expression of either transcript in a sense/antisense pair, which could hint to regulatory mechanisms. Janus is able to make use of single-nucleotide variant (SNV) and methylation data, if available, and reports the sense to antisense ratio of regions in the vicinity of the identified genetic and epigenetic variation. Janus interrogates positions of heterozygous SNVs to identify strand-specific allelic imbalance. Availability: Janus is written in C/C++ and freely available at http://www.ikmb.uni-kiel.de/janus/janus.html under terms of GNU General Public License, for both, Linux and Windows 64×. Although the binaries will work without additional downloads, the software depends on bamtools (https://github.com/pezmaster31/bamtools) for compilation. A detailed tutorial section is included in the first section of the supplemental material and included as brief readme.txt in the tutorial archive. Contact: [email protected] or [email protected] Supplementary information: Supplementary data are available at Bioinformatics onlin
Transcriptome analysis by strand-specific sequencing of complementary DNA
High-throughput complementary DNA sequencing (RNA-Seq) is a powerful tool for whole-transcriptome analysis, supplying information about a transcript's expression level and structure. However, it is difficult to determine the polarity of transcripts, and therefore identify which strand is transcribed. Here, we present a simple cDNA sequencing protocol that preserves information about a transcript's direction. Using Saccharomyces cerevisiae and mouse brain transcriptomes as models, we demonstrate that knowing the transcript's orientation allows more accurate determination of the structure and expression of genes. It also helps to identify new genes and enables studying promoter-associated and antisense transcription. The transcriptional landscapes we obtained are available online
Recommended from our members
Transcriptome and genome sequencing uncovers functional variation in humans
Summary Genome sequencing projects are discovering millions of genetic variants in humans, and interpretation of their functional effects is essential for understanding the genetic basis of variation in human traits. Here we report sequencing and deep analysis of mRNA and miRNA from lymphoblastoid cell lines of 462 individuals from the 1000 Genomes Project – the first uniformly processed RNA-seq data from multiple human populations with high-quality genome sequences. We discovered extremely widespread genetic variation affecting regulation of the majority of genes, with transcript structure and expression level variation being equally common but genetically largely independent. Our characterization of causal regulatory variation sheds light on cellular mechanisms of regulatory and loss-of-function variation, and allowed us to infer putative causal variants for dozens of disease-associated loci. Altogether, this study provides a deep understanding of the cellular mechanisms of transcriptome variation and of the landscape of functional variants in the human genome
Influence of RNA extraction methods and library selection schemes on RNA-seq data.
BACKGROUND:Gene expression analysis by RNA sequencing is now widely used in a number of applications surveying the whole transcriptomes of cells and tissues. The recent introduction of ribosomal RNA depletion protocols, such as RiboZero, has extended the view of the polyadenylated transcriptome to the poly(A)- fraction of the RNA. However, substantial amounts of intronic transcriptional activity has been reported in RiboZero protocols, raising issues regarding their potential nuclear origin and the impact on the actual sequence depth in exonic regions.
RESULTS:Using HEK293 human cells as source material, we assessed here the impact of the two commonly used RNA extraction methods and of the library construction protocols (rRNA depletion versus mRNA) on 1) the relative abundance of intronic reads and 2) on the estimation of gene expression values. We benchmarked the rRNA depletion-based sequencing with a specific analysis of the cytoplasmic and nuclear transcriptome fractions, suggesting that the large majority of the intronic reads correspond to unprocessed nuclear transcripts rather than to independent transcriptional units. We show that Qiagen or TRIzol extraction methods retain differentially nuclear RNA species, and that consequently, rRNA depletion-based RNA sequencing protocols are particularly sensitive to the extraction methods.
CONCLUSIONS:We could show that the combination of Trizol-based RNA extraction with rRNA depletion sequencing protocols led to the largest fraction of intronic reads, after the sequencing of the nuclear transcriptome. We discuss here the impact of the various strategies on gene expression and alternative splicing estimation measures. Further, we propose guidelines and a double selection strategy for minimizing the expression biases, without loss of information
The direction of cross affects [corrected] obesity after puberty in male but not female offspring
Background We investigated parent-of-origin and allele-specific expression effects on obesity and hepatic gene expression in reciprocal crosses between the Berlin Fat Mouse Inbred line (BFMI) and C57Bl/6NCrl (B6N). Results We found that F1-males with a BFMI mother developed 1.8 times more fat mass on a high fat diet at 10 weeks than F1-males of a BFMI father. The phenotype was detectable from six weeks on and was preserved after cross-fostering. RNA-seq data of liver provided evidence for higher biosynthesis and elongation of fatty acids (p = 0.00635) in obese male offspring of a BFMI mother versus lean offspring of a BFMI father. Furthermore, fatty acid degradation (p = 0.00198) and the peroxisome pathway were impaired (p = 0.00094). The circadian rhythm was affected as well (p = 0.00087). Among the highest up-regulated protein coding genes in obese males were Acot4 (1.82 fold, p = 0.022), Cyp4a10 (1.35 fold, p = 0.026) and Cyp4a14 (1.32 fold, p = 0.012), which hydroxylize fatty acids and which are known to be increased in liver steatosis. Obese males showed lower expression of the genetically imprinted and paternally expressed 3 (Peg3) gene (0.31 fold, p = 0.046) and higher expression of the androgen receptor (Ar) gene (2.38 fold, p = 0.068). Allelic imbalance was found for expression of ATP-binding cassette transporter gene Abca8b. Several of the differentially expressed genes contain estrogen response elements. Conclusions Parent-of-origin effects during gametogenesis and/or fetal development in an obese mother epigenetically modify the transcription of genes that lead to enhanced fatty acid synthesis and impair β-oxidation in the liver of male, but not female F1 offspring. Down-regulation of Peg3 could contribute to trigger this metabolic setting. At puberty, higher amounts of the androgen receptor and altered access to estrogen response elements in affected genes are likely responsible for male specific expression of genes that were epigenetically triggered. A suggestive lack of estrogen binding motifs was found for highly down-regulated genes in adult hepatocytes of obese F1 males (p = 0.074)
Exome Sequencing from Nanogram Amounts of Starting DNA: Comparing Three Approaches
<div><p>Hybridization-based target enrichment protocols require relatively large starting amounts of genomic DNA, which is not always available. Here, we tested three approaches to pre-capture library preparation starting from 10 ng of genomic DNA: (i and ii) whole-genome amplification of DNA samples with REPLI-g (Qiagen) and GenomePlex (Sigma) kits followed by standard library preparation, and (iii) library construction with a low input oriented ThruPLEX kit (Rubicon Genomics). Exome capture with Agilent SureSelect<i><sup>XT2</sup></i> Human AllExon v4+UTRs capture probes, and HiSeq2000 sequencing were performed for test libraries along with the control library prepared from 1 µg of starting DNA. Tested protocols were characterized in terms of mapping efficiency, enrichment ratio, coverage of the target region, and reliability of SNP genotyping. REPLI-g- and ThruPLEX-FD-based protocols seem to be adequate solutions for exome sequencing of low input samples.</p></div
- …