Search CORE

261 research outputs found

SAW: A Method to Identify Splicing Events from RNA-Seq Data Based on Splicing Fingerprints

Author: A Mortazavi
AJ Matlin
AJ Pinho
B Modrek
BJ Blencowe
BR Graveley
C Trapnell
CA Maher
D Brett
D Brett
Damian Fermin
DL Black
ET Wang
F De Bona
F Mo
JC Castle
Juan Valcarcel
Kang Ning
KS Elenitoba-Johnson
M Burrows
WJ Kent
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Splicing event identification is one of the most important issues in the comprehensive analysis of transcription profile. Recent development of next-generation sequencing technology has generated an extensive profile of alternative splicing. However, while many of these splicing events are between exons that are relatively close on genome sequences, reads generated by RNA-Seq are not limited to alternative splicing between close exons but occur in virtually all splicing events. In this work, a novel method, SAW, was proposed for the identification of all splicing events based on short reads from RNA-Seq. It was observed that short reads not in known gene models are actually absent words from known gene sequences. An efficient method to filter and cluster these short reads by fingerprint fragments of splicing events without aligning short reads to genome sequences was developed. Additionally, the possible splicing sites were also determined without alignment against genome sequences. A consensus sequence was then generated for each short read cluster, which was then aligned to the genome sequences. Results demonstrated that this method could identify more than 90% of the known splicing events with a very low false discovery rate, as well as accurately identify, a number of novel splicing events between distant exons

CiteSeerX

Public Library of Science (PLOS)

Crossref

PubMed Central

Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences

Transkingdom Networks: A Systems Biology Approach to Identify Causal Members of Host-Microbiota Interactions

Improvements in sequencing technologies and reduced experimental costs have resulted in a vast number of studies generating high-throughput data. Although the number of methods to analyze these "omics" data has also increased, computational complexity and lack of documentation hinder researchers from analyzing their high-throughput data to its true potential. In this chapter we detail our data-driven, transkingdom network (TransNet) analysis protocol to integrate and interrogate multi-omics data. This systems biology approach has allowed us to successfully identify important causal relationships between different taxonomic kingdoms (e.g. mammals and microbes) using diverse types of data

arXiv.org e-Print Archive

Crossref

Towards reliable isoform quantification using RNA-SEQ data

Author: A Mortazavi
APM Weber
B-B Wang
BJ Blencowe
Brian E Howard
C Clopper
D Swarbeck
DP Dixon
ET Wang
H Jiang
I Kozarewa
K Salehi-Ashtiani
Li
MS Cline
R Lister
S Gupta
SA Filichkin
SEV Linsern
Steffen Heber
The Gene Ontology Consortium
V Lacroix
X Zhou
Y Benjamini
Y Xing
Z Wang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Methods to study splicing from high-throughput RNA Sequencing data

Author: A Ameur
A Bhasi
A Dobin
A Mortazavi
A Oshlack
A Roberts
A Roberts
AM Mezlini
AN Brooks
B Jackson
B Kakaradov
B Langmead
B Li
B Li
BJ Haas
BJ Haas
C Trapnell
C Trapnell
C Trapnell
D Hiller
D Singh
DL Wood
DW Bryant
E Eyras
E Lee
E Turro
ET Wang
F Birzele
F Bona De
F Denoeud
F Tang
G Robertson
G Xu
GA Sacomoto
GR Grant
GS Slater
H Bao
H Jiang
H Jiang
H Kim
H Richard
J Behr
J Du
J Feng
J Hu
J Lovén
J Martin
J Salzman
J Seok
J Seok
J Wu
J Wu
JE Allen
JJ Li
JP Venables
K Schneeberger
K Wang
KD Hansen
KF Au
KL Howe
KM Borgwardt
L Chen
L Chen
L Wang
L Wang
LY Chen
M Aschoff
M Fiume
M Garber
M Griffith
M Guttman
M Stanke
M Stanke
M Sultan
MC Ryan
MF Rogers
MG Grabherr
MH Schulz
MT Dimon
N Cloonan
N Cloonan
N Deng
N Leng
N Nicolae
N Philippe
N Vijay
NA Fonseca
O Stegle
P Drewe
P Glaus
PL Martelli
PP Labaj
Q Liu
Q Liu
Q Pan
QY Zhao
R Bohnert
R Guigó
R Li
S Anders
S Djebali
S Filichkin
S Heber
S Huang
S Lee
S Mangul
S Marco-Sola
S Shen
S Sonnenburg
S Srivastava
S Tang
S Zheng
SB Montgomery
SH Nagaraj
SK Lou
T Bonfert
TA Clark
TD Wu
TD Wu
W Li
W Li
W Wang
WJ Kent
Y Hu
Y Katz
Y Li
Y Liao
Y Surget-Groba
Y Xing
Y Xing
Y Zhang
Z Xia
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/07/2015
Field of study

The development of novel high-throughput sequencing (HTS) methods for RNA (RNA-Seq) has provided a very powerful mean to study splicing under multiple conditions at unprecedented depth. However, the complexity of the information to be analyzed has turned this into a challenging task. In the last few years, a plethora of tools have been developed, allowing researchers to process RNA-Seq data to study the expression of isoforms and splicing events, and their relative changes under different conditions. We provide an overview of the methods available to study splicing from short RNA-Seq data. We group the methods according to the different questions they address: 1) Assignment of the sequencing reads to their likely gene of origin. This is addressed by methods that map reads to the genome and/or to the available gene annotations. 2) Recovering the sequence of splicing events and isoforms. This is addressed by transcript reconstruction and de novo assembly methods. 3) Quantification of events and isoforms. Either after reconstructing transcripts or using an annotation, many methods estimate the expression level or the relative usage of isoforms and/or events. 4) Providing an isoform or event view of differential splicing or expression. These include methods that compare relative event/isoform abundance or isoform expression across two or more conditions. 5) Visualizing splicing regulation. Various tools facilitate the visualization of the RNA-Seq data in the context of alternative splicing. In this review, we do not describe the specific mathematical models behind each method. Our aim is rather to provide an overview that could serve as an entry point for users who need to decide on a suitable tool for a specific analysis. We also attempt to propose a classification of the tools according to the operations they do, to facilitate the comparison and choice of methods.Comment: 31 pages, 1 figure, 9 tables. Small corrections adde

arXiv.org e-Print Archive

Crossref

Towards the reconstruction of integrated genome-scale models of metabolism and gene expression

Author: A Brazma
A Mortazavi
A Santos-Zavaleta
AS Blazier
BJ Schmidt
C Colijn
C Willson
CJ Lloyd
D Eckweiler
D Lee
D Machado
DL Nelson
DS Johnson
DT Banos
E Motamedian
H Lodish
JD Orth
JJ Faith
JP Faria
JP Faria
JP Faria
L Marmiesse
M Moretto
MN Price
MW Covert
N Kolesnikov
N Sierro
NE Lewis
PA Jensen
PA Jensen
PS Novichkov
PS Novichkov
RA Young
RJP Berlo van
S Chandrasekaran
T Barrett
T Shlomi
U Nagalakshmi
VE Velculescu
VR Iyer
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

The reconstruction of integrated genome-scale models of metabolism and gene expression has been a challenge for a while now. In fact, various methods that allow integrating reconstructions of Transcriptional Regulatory Networks, gene expression data or both into Genome-Scale Metabolic Models have been proposed. Several of these methods are surveyed in this article, which allowed identifying their strengths and weaknesses concerning the reconstruction of integrated models for multiple prokaryotic organisms. Additionally, the main resources of regulatory information were also surveyed, as the existence of novel sources of regulatory information and gene expression data may contribute for the improvement of methodologies referred herein.This study was supported by the Portuguese Foundation for Science andTechnology (FCT) under the scope of the strategic funding of UID/BIO/04469/2019 unit andBioTecNorte operation (NORTE-01-0145-FEDER-000004) funded by the European RegionalDevelopment Fund under the scope of Norte2020-Programa Operacional Regional do Norte. Fernando Cruz holds a doctoral fellowship (SFRH/BD/139198/2018) funded by the FCT. The authors thank project SHIKIFACTORY100 - Modular cell factories for the production of 100 compounds from the shikimate pathway (814408) funded by the European Commission.info:eu-repo/semantics/publishedVersio

Universidade do Minho: RepositoriUM

Crossref

Protocol Dependence of Sequencing-Based Gene Expression Measurements

Author: A Goren
A Mortazavi
A Oshlack
BJ Blencowe
C Hart
C Plessy
C Trapnell
CD Armour
D Lipson
Doron Lipson
E Klein
F Ozsolak
GA Heap
I Chepelev
JC Marioni
John F. Thompson
KD Sullivan
L Mamanova
L Shi
LL Baumbach
LT Sam
M Sultan
MJ Fullwood
Najib M. El-Sayed
O Morozova
P Carninci
P Kapranov
P Kapranov
PA 't Hoen
Patrice M. Milos
Philipp Kapranov
Q Pan
R Rosenkranz
RD Canales
S Djebali
S Marguerat
SP Mane
Stan Letovsky
T Nagaike
Tal Raz
TR Breitman
YW Asmann
Z Wang
ZJ Wu
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

RNA Seq provides unparalleled levels of information about the transcriptome including precise expression levels over a wide dynamic range. It is essential to understand how technical variation impacts the quality and interpretability of results, how potential errors could be introduced by the protocol, how the source of RNA affects transcript detection, and how all of these variations can impact the conclusions drawn. Multiple human RNA samples were used to assess RNA fragmentation, RNA fractionation, cDNA synthesis, and single versus multiple tag counting. Though protocols employing polyA RNA selection generate the highest number of non-ribosomal reads and the most precise measurements for coding transcripts, such protocols were found to detect only a fraction of the non-ribosomal RNA in human cells. PolyA RNA excludes thousands of annotated and even more unannotated transcripts, resulting in an incomplete view of the transcriptome. Ribosomal-depleted RNA provides a more cost-effective method for generating complete transcriptome coverage. Expression measurements using single tag counting provided advantages for assessing gene expression and for detecting short RNAs relative to multi-read protocols. Detection of short RNAs was also hampered by RNA fragmentation. Thus, this work will help researchers choose from among a range of options when analyzing gene expression, each with its own advantages and disadvantages

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Genomic sequencing in clinical trials

Author: A Mortazavi
A Zimprich
AA Bhinge
BJ O'Roak
C Allen
C Betancur
C Mele
C Vilarino-Guell
CS Ku
DA Rasko
DI Shalowitz
DS Johnson
E Hodges
EE Schadt
ER Mardis
ES Martens-Uzunova
ET Wang
EW Clayton
G Xu
GH Fernald
GJ Porreca
H Greulich
J Amberger
J Rios
JF Thompson
K Kannan
K Musunuru
Karen K Mestan
KJ Buckingham
Leonard Ilkhanoff
LG Biesecker
M Allison
M Kircher
MA Chapman
O Harismendy
P Aldhous
PJ Campbell
Q Zhao
R Drmanac
RM Toydemir
Samdeep Mouli
Simon Lin
SM Hawkins
SP Shah
ST Bennett
TJ Ley
W Lee
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Human genome sequencing is the process by which the exact order of nucleic acid base pairs in the 24 human chromosomes is determined. Since the completion of the Human Genome Project in 2003, genomic sequencing is rapidly becoming a major part of our translational research efforts to understand and improve human health and disease. This article reviews the current and future directions of clinical research with respect to genomic sequencing, a technology that is just beginning to find its way into clinical trials both nationally and worldwide. We highlight the currently available types of genomic sequencing platforms, outline the advantages and disadvantages of each, and compare first- and next-generation techniques with respect to capabilities, quality, and cost. We describe the current geographical distributions and types of disease conditions in which these technologies are used, and how next-generation sequencing is strategically being incorporated into new and existing studies. Lastly, recent major breakthroughs and the ongoing challenges of using genomic sequencing in clinical research are discussed

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Second-Generation Sequencing Supply an Effective Way to Screen RNAi Targets in Large Scale for Potential Application in Pest Insect Control

Author: A Fire
A Kahvejian
A Mortazavi
AS Morrissy
BE Tabashnik
BE Tabashnik
BJ Haas
BS Coates
C Iseli
C Zhang
CT Turner
D Hegedus
D Kuttenkeuler
D Martin
DP Walshe
DR Joanisse
DR Price
Frederic Marion-Poll
G Broehan
GJ Hannon
H Huvenne
Haichao Li
Hao Zhang
IB Dawid
JA Baum
JW Pridgeon
KJ Livak
L Timmons
L Timmons
M Chen
M Ote
MA Bautista
NS Mutti
O Pechanova
O Terenius
PA t Hoen
R Li
RK Gaur
RN Araujo
S Sivakumar
S Whyard
SR Gallagher
X Belles
Xuexia Miao
Y Dong
Y Liu
Y Tomoyasu
YB Mao
Yubing Wang
YW Asmann
Z Wang
Publication venue: Public Library of Science
Publication date: 11/04/2011
Field of study

The key of RNAi approach success for potential insect pest control is mainly dependent on careful target selection and a convenient delivery system. We adopted second-generation sequencing technology to screen RNAi targets. Illumina's RNA-seq and digital gene expression tag profile (DGE-tag) technologies were used to screen optimal RNAi targets from Ostrinia furnalalis. Total 14690 stage specific genes were obtained which can be considered as potential targets, and 47 were confirmed by qRT-PCR. Ten larval stage specific expression genes were selected for RNAi test. When 50 ng/µl dsRNAs of the genes DS10 and DS28 were directly sprayed on the newly hatched larvae which placed on the filter paper, the larval mortalities were around 40∼50%, while the dsRNAs of ten genes were sprayed on the larvae along with artificial diet, the mortalities reached 73% to 100% at 5 d after treatment. The qRT-PCR analysis verified the correlation between larval mortality and the down-regulation of the target gene expression. Topically applied fluorescent dsRNA confirmed that dsRNA did penetrate the body wall and circulate in the body cavity. It seems likely that the combination of DGE-tag with RNA-seq is a rapid, high-throughput, cost less and an easy way to select the candidate target genes for RNAi. More importantly, it demonstrated that dsRNAs are able to penetrate the integument and cause larval developmental stunt and/or death in a lepidopteron insect. This finding largely broadens the target selection for RNAi from just gut-specific genes to the targets in whole insects and may lead to new strategies for designing RNAi-based technology against insect damage

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Web-based bioinformatics workflows for end-to-end RNA-seq data computation and analysis in agricultural animal species

Author: A Dobin
A Mortazavi
AM Bolger
AR Quinlan
B Langmead
B Langmead
B Li
BJ Haas
C Trapnell
C Trapnell
C Trapnell
D Kim
ER Mardis
F De Bona
G Robertson
GR Grant
H Li
H Li
H Li
H Lin
J Goecks
JT Robinson
K Wang
L Wang
M Garber
M Guttman
MD Robinson
MG Grabherr
MH Schulz
MP Cox
P Flicek
PG Engstrom
Qiyun Zhu
R Li
R Li
R. Alexander Richter
Robert W. Li
S Anders
SF Altschul
SM Rumble
TD Wu
Weizhong Li
Y Katz
Yunsup Jung
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Transcriptomic landscape of breast cancers through mRNA sequencing

Author: A Jemal
A Morabito
A Mortazavi
A Naderi
A Seute
B Weigelt
B Weigelt
B Weigelt
BA Gusterson
BA Gusterson
BJ Haas
C Oakman
C Trapnell
C Trapnell
CM Perou
CW Elston
DB Williams
DC Sgroi
DD Licatalosi
EA Rakha
EA Rakha
F Andre
F Ozsolak
F Richard
FC Geyer
FC Geyer
G Ricolleau
G Watkins
G Watkins
GC Santos
H Buerger
H Buerger
H Li
I Soerjomataram
J Stingl
JD Brenton
JS Parker
JS Reis-Filho
KR Bauer
L Pusztai
LA Carey
LJ van 't Veer
LJ van 't Veer
M Garber
M Krzywinski
MJ van de Vijver
R Dent
S Anders
S Badve
SJ Dawson
T Sjoblom
T Sorlie
T Sorlie
T Sorlie
T Vargo-Gogola
TJ Finnegan
WD Foulkes
XX Cao
XX Cao
XX Cao
Y Wang
Z Hu
Publication venue: Nature Publishing Group
Publication date: 14/02/2012
Field of study

Breast cancer is a heterogeneous disease with a poorly defined genetic landscape, which poses a major challenge in diagnosis and treatment. By massively parallel mRNA sequencing, we obtained 1.2 billion reads from 17 individual human tissues belonging to TNBC, Non-TNBC, and HER2-positive breast cancers and defined their comprehensive digital transcriptome for the first time. Surprisingly, we identified a high number of novel and unannotated transcripts, revealing the global breast cancer transcriptomic adaptations. Comparative transcriptomic analyses elucidated differentially expressed transcripts between the three breast cancer groups, identifying several new modulators of breast cancer. Our study also identified common transcriptional regulatory elements, such as highly abundant primary transcripts, including osteonectin, RACK1, calnexin, calreticulin, FTL, and B2M, and “genomic hotspots” enriched in primary transcripts between the three groups. Thus, our study opens previously unexplored niches that could enable a better understanding of the disease and the development of potential intervention strategies

Crossref

PubMed Central

George Washington University: Health Sciences Research Commons (HSRC)