Search CORE

18 research outputs found

Methods to study splicing from high-throughput RNA Sequencing data

Author: A Ameur
A Bhasi
A Dobin
A Mortazavi
A Oshlack
A Roberts
A Roberts
AM Mezlini
AN Brooks
B Jackson
B Kakaradov
B Langmead
B Li
B Li
BJ Haas
BJ Haas
C Trapnell
C Trapnell
C Trapnell
D Hiller
D Singh
DL Wood
DW Bryant
E Eyras
E Lee
E Turro
ET Wang
F Birzele
F Bona De
F Denoeud
F Tang
G Robertson
G Xu
GA Sacomoto
GR Grant
GS Slater
H Bao
H Jiang
H Jiang
H Kim
H Richard
J Behr
J Du
J Feng
J Hu
J Lovén
J Martin
J Salzman
J Seok
J Seok
J Wu
J Wu
JE Allen
JJ Li
JP Venables
K Schneeberger
K Wang
KD Hansen
KF Au
KL Howe
KM Borgwardt
L Chen
L Chen
L Wang
L Wang
LY Chen
M Aschoff
M Fiume
M Garber
M Griffith
M Guttman
M Stanke
M Stanke
M Sultan
MC Ryan
MF Rogers
MG Grabherr
MH Schulz
MT Dimon
N Cloonan
N Cloonan
N Deng
N Leng
N Nicolae
N Philippe
N Vijay
NA Fonseca
O Stegle
P Drewe
P Glaus
PL Martelli
PP Labaj
Q Liu
Q Liu
Q Pan
QY Zhao
R Bohnert
R Guigó
R Li
S Anders
S Djebali
S Filichkin
S Heber
S Huang
S Lee
S Mangul
S Marco-Sola
S Shen
S Sonnenburg
S Srivastava
S Tang
S Zheng
SB Montgomery
SH Nagaraj
SK Lou
T Bonfert
TA Clark
TD Wu
TD Wu
W Li
W Li
W Wang
WJ Kent
Y Hu
Y Katz
Y Li
Y Liao
Y Surget-Groba
Y Xing
Y Xing
Y Zhang
Z Xia
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/07/2015
Field of study

The development of novel high-throughput sequencing (HTS) methods for RNA (RNA-Seq) has provided a very powerful mean to study splicing under multiple conditions at unprecedented depth. However, the complexity of the information to be analyzed has turned this into a challenging task. In the last few years, a plethora of tools have been developed, allowing researchers to process RNA-Seq data to study the expression of isoforms and splicing events, and their relative changes under different conditions. We provide an overview of the methods available to study splicing from short RNA-Seq data. We group the methods according to the different questions they address: 1) Assignment of the sequencing reads to their likely gene of origin. This is addressed by methods that map reads to the genome and/or to the available gene annotations. 2) Recovering the sequence of splicing events and isoforms. This is addressed by transcript reconstruction and de novo assembly methods. 3) Quantification of events and isoforms. Either after reconstructing transcripts or using an annotation, many methods estimate the expression level or the relative usage of isoforms and/or events. 4) Providing an isoform or event view of differential splicing or expression. These include methods that compare relative event/isoform abundance or isoform expression across two or more conditions. 5) Visualizing splicing regulation. Various tools facilitate the visualization of the RNA-Seq data in the context of alternative splicing. In this review, we do not describe the specific mathematical models behind each method. Our aim is rather to provide an overview that could serve as an entry point for users who need to decide on a suitable tool for a specific analysis. We also attempt to propose a classification of the tools according to the operations they do, to facilitate the comparison and choice of methods.Comment: 31 pages, 1 figure, 9 tables. Small corrections adde

arXiv.org e-Print Archive

Crossref

A computational method for estimating the PCR duplication rate in DNA and RNA-seq experiments

Author: A Adey
A Auton
A Mortazavi
AM Mezlini
CS Chilamakuri
D Aird
DR Bentley
EN Smith
H Li
I Kozarewa
IF Bronner
JA Casbon
KD Hansen
MA DePristo
MN Bainbridge
N Whiteford
PA ’t Hoen
S Islam
SR Head
T Daley
T Kivioja
T Lappalainen
Vikas Bansal
W Zhou
Y Chen
Y Kukita
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2017
Field of study

BACKGROUND: PCR amplification is an important step in the preparation of DNA sequencing libraries prior to high-throughput sequencing. PCR amplification introduces redundant reads in the sequence data and estimating the PCR duplication rate is important to assess the frequency of such reads. Existing computational methods do not distinguish PCR duplicates from “natural” read duplicates that represent independent DNA fragments and therefore, over-estimate the PCR duplication rate for DNA-seq and RNA-seq experiments. RESULTS: In this paper, we present a computational method to estimate the average PCR duplication rate of high-throughput sequence datasets that accounts for natural read duplicates by leveraging heterozygous variants in an individual genome. Analysis of simulated data and exome sequence data from the 1000 Genomes project demonstrated that our method can accurately estimate the PCR duplication rate on paired-end as well as single-end read datasets which contain a high proportion of natural read duplicates. Further, analysis of exome datasets prepared using the Nextera library preparation method indicated that 45–50% of read duplicates correspond to natural read duplicates likely due to fragmentation bias. Finally, analysis of RNA-seq datasets from individuals in the 1000 Genomes project demonstrated that 70–95% of read duplicates observed in such datasets correspond to natural duplicates sampled from genes with high expression and identified outlier samples with a 2-fold greater PCR duplication rate than other samples. CONCLUSIONS: The method described here is a useful tool for estimating the PCR duplication rate of high-throughput sequence datasets and for assessing the fraction of read duplicates that correspond to natural read duplicates. An implementation of the method is available at https://github.com/vibansal/PCRduplicates. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1471-9) contains supplementary material, which is available to authorized users

Crossref

PubMed Central

eScholarship - University of California

Transcriptome assembly and quantification from Ion Torrent RNA-Seq data

Author: A Mortazavi
A Roberts
A Roberts
Adrian Caciula
AI Tomescu
Alex Zelikovsky
AM Mezlini
B Li
B Li
C Gregg
C Ponting
C Trapnell
C Trapnell
CJ McManus
Consortium MAQC
DR Bentley
Dumitru Brinza
E Wang
G Robertson
Ion Mӑndoiu
J Duitama
J Feng
JF Degner
JM Rothberg
KF Au
L Song
LH Reid
M Garber
M Grabherr
M Griffith
M Guttman
M Nicolae
PA Pevzner
R Tibshirani
RK Thomas
S Mangul
S Mangul
S Pal
Sahar Al Seesi
Serghei Mangul
TR Mercer
V Pandey
W Li
W Li
YY Lin
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Erratum: Crowdsourced assessment of common genetic contribution to predicting anti-TNF treatment response in rheumatoid arthritis

Author: Aguilar D
Aittokallio T
Ammad-ud-din M
Anton B
Azencott CA
Balagurusamy VSK
Bellon V
Boeva V
Bonet J
Bunte K
Cheng L
Chheda H
Corander J
Cui J
Dillenberger D
Dumontier M
Eksi R
Falcao AO
Fornes O
Garcia-Garcia J
Goldenberg A
Gopalacharyulu P
Guney E
Hajiloo M
Hidru D
Hoff B
Jaiswal A
Kaski S
Khalfaoui B
Khan SA
Kramer ER
Li HD
Marin MA
Marttinen P
Mezlini AM
Molparia B
Neto EC
Norman T
Pandey G
Panwar B
Pappas D
Pirinen M
Planas-Iglesias J
Poglayen D
Pratap A
Saarela J
Samwald M
Sieberts SK
Stahl E
Stoven V
Suver C
Tang H
Tang J
Torkamani A
Vert JP
Wang B
Wang T
Wennerberg K
Wineinger NE
Xiao GH
Xie Y
Yeung R
Zhan XW
Zhao C
Zhu F
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/10/2016
Field of study

UTUPub

Crowdsourced assessment of common genetic contribution to predicting anti-TNF treatment response in rheumatoid arthritis

Author: Aguilar D
Aittokallio T
Allaart CF
Ammad-ud-din M
Anton B
Azencott CA
Balagurusamy VSK
Barton A
Bellon V
Boeva V
Bonet J
Bridges SL
Bunte K
Cheng L
Chheda H
Coenen M
Corander J
Criswell L
Cui J
de Vries N
Dillenberger D
Dumontier M
Eksi R
Falcao AO
Fornes O
Friend S
Garcia-Garcia J
Gerlag D
Goldenberg A
Gopalacharyulu P
Greenberg J
Gregersen PK
Guan YF
Guney E
Hajiloo M
Hidru D
Hoff B
Huizinga TWJ
Jaiswal A
Kaski S
Khalfaoui B
Khan SA
Klareskog L
Kramer ER
Kremer J
Kurreeman F
Li HD
Mangravite LM
Mariette X
Marin MA
Marttinen P
Mezlini AM
Miceli C
Michaud K
Molparia B
Moreland L
Neto EC
Norman T
Oliva B
Padyukov L
Pandey G
Panwar B
Pappas D
Pirinen M
Planas-Iglesias J
Plenge R
Poglayen D
Pratap A
Saarela J
Saevarsdottir S
Samwald M
Shadick N
Sieberts SK
Stahl E
Stolovitzky G
Stoven V
Suver C
Tak PP
Tang H
Tang J
Torkamani A
Vert JP
Wang B
Wang T
Weinblatt M
Wennerberg K
Wineinger NE
Xiao GH
Xie Y
Yeung R
Zhan XW
Zhao C
Zhu F
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 28/10/2022
Field of study

Rheumatoid arthritis (RA) affects millions world-wide. While anti-TNF treatment is widely used to reduce disease progression, treatment fails in Bone-third of patients. No biomarker currently exists that identifies non-responders before treatment. A rigorous community-based assessment of the utility of SNP data for predicting anti-TNF treatment efficacy in RA patients was performed in the context of a DREAM Challenge (http://www.synapse.org/RA_Challenge). An open challenge framework enabled the comparative evaluation of predictions developed by 73 research groups using the most comprehensive available data and covering a wide range of state-of-the-art modelling methodologies. Despite a significant genetic heritability estimate of treatment non-response trait (h(2) = 0.18, P value = 0.02), no significant genetic contribution to prediction accuracy is observed. Results formally confirm the expectations of the rheumatology community that SNP information does not significantly improve predictive performance relative to standard clinical traits, thereby justifying a refocusing of future efforts on collection of other data

UTUPub