Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences

Love, Michael I.; Robinson, Mark D.; Soneson, Charlotte

Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences

Authors: Michael I. Love
Mark D. Robinson
Charlotte Soneson
Publication date: 1 February 2016
Publisher: 'F1000 Research Ltd'
Doi

Abstract

High-throughput sequencing of cDNA (RNA-seq) is used extensively to characterize the transcriptome of cells. Many transcriptomic studies aim at comparing either abundance levels or the transcriptome composition between given conditions, and as a first step, the sequencing reads must be used as the basis for abundance quantification of transcriptomic features of interest, such as genes or transcripts. Several different quantification approaches have been proposed, ranging from simple counting of reads that overlap given genomic regions to more complex estimation of underlying transcript abundances. In this paper, we show that gene-level abundance estimates and statistical inference offer advantages over transcript-level analyses, in terms of performance and interpretability. We also illustrate that while the presence of differential isoform usage can lead to inflated false discovery rates in differential expression analyses on simple count matrices and transcript-level abundance estimates improve the performance in simulated data, the difference is relatively minor in several real data sets. Finally, we provide an R package ( tximport) to help users integrate transcript-level abundance estimates from common quantification pipelines into count-based statistical inference engines

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Directory of Open Access Journals

oai:doaj.org/article:a6b132db4...

Last time updated on 14/10/2017

Sustaining member

Harvard University - DASH

oai:dash.harvard.edu:1/2565848...

Last time updated on 17/04/2018