Search CORE

21 research outputs found

Sensitivity, specificity, and reproducibility of RNA-Seq differential expression calls

Author: A Dobin
C Trapnell
CW Law
D Kim
D Thierry-Mieg
David P. Kreil
JT Leek
L Shi
M David
MB Gerstein
MD Robinson
MD Robinson
ME Ritchie
MI Love
NL Bray
O Stegle
P Glaus
Paweł P. Łabaj
PP Labaj
R Patro
S Li
Y Liao
Y Sha
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

An analysis of single amino acid repeats as use case for application specific background models

Author: C Notredame
David P Kreil
DP Depledge
DP Kreil
E Birney
E Delot
EL Sonnhammer
EM Marcotte
G Gouridis
G Nuel
G Reinert
H Gerber
H Nielsen
H Nielsen
IB Kuznetsov
J Thompson
J Wootton
J Xie
JD Bendtsen
JM Hancock
JW Fondon
L Brown
L Zhang
M Hoebeke
M Mar Alba
M Thomas-Chollier
M Tipping
M Tipping
MA Huntley
O Weiss
OB Ptitsyn
P Siwach
P Siwach
Paweł P Łabaj
Peter Sykacek
PP Łabaj
R Lopez
R Lyne
RI Sadreyev
RS Hegde
S Caburet
S Hands
S Henikoff
S Karlin
S Karlin
SF Altschul
SF Altschul
SF Altschul
T Koestler
VJ Promponas
VR Chechetkin
VS Pande
WR Pearson
Y Kashi
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Background Sequence analysis aims to identify biologically relevant signals against a backdrop of functionally meaningless variation. Increasingly, it is recognized that the quality of the background model directly affects the performance of analyses. State-of-the-art approaches rely on classical sequence models that are adapted to the studied dataset. Although performing well in the analysis of globular protein domains, these models break down in regions of stronger compositional bias or low complexity. While these regions are typically filtered, there is increasing anecdotal evidence of functional roles. This motivates an exploration of more complex sequence models and application-specific approaches for the investigation of biased regions. Results Traditional Markov-chains and application-specific regression models are compared using the example of predicting runs of single amino acids, a particularly simple class of biased regions. Cross-fold validation experiments reveal that the alternative regression models capture the multi-variate trends well, despite their low dimensionality and in contrast even to higher-order Markov-predictors. We show how the significance of unusual observations can be computed for such empirical models. The power of a dedicated model in the detection of biologically interesting signals is then demonstrated in an analysis identifying the unexpected enrichment of contiguous leucine-repeats in signal-peptides. Considering different reference sets, we show how the question examined actually defines what constitutes the 'background'. Results can thus be highly sensitive to the choice of appropriate model training sets. Conversely, the choice of reference data determines the questions that can be investigated in an analysis. Conclusions Using a specific case of studying biased regions as an example, we have demonstrated that the construction of application-specific background models is both necessary and feasible in a challenging sequence analysis situation

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Publikationsserver der Universitätsbibliothek Bodenkultur Wien

Warwick Research Archives Portal Repository

Identification of co-expression gene networks, regulatory genes and pathways for obesity based on adipose tissue RNA Sequencing in a porcine model

Author: A Baranova
A Das
A Hoshino
A Joshi
A Joshi
A Martí
A Moncada-Pazos
A Oshlack
A Scuteri
A Tönjes
AC Nica
AG Smith
AM Pino
AW Ferrante
B Haas
B Li
B Zhang
C Clarke
CJ Rosen
CM Matter
CN Lumeng
Daria V Zhernakova
DK Slonim
E Bonnet
E Ortega Martinez de Victoria
E Segal
EE Kershaw
EK Speliotes
F Chan Yingguang
F Wang
FR Day
G Matarese
G-Y Hou
GA Bray
GJ Tranah
GK Smyth
H Jeong
H Morgan
Haja N Kadarmideen
HN Kadarmideen
HN Kadarmideen
HN Kadarmideen
HN Kadarmideen
HY Chuang
I Iatan
IA Ferreira
J Gomez-Ambrosi
J Sun
J-H Lee
JA Clowes
JM Gimble
JU Adams
K Heindl
K Jaworski
K Takemura
KE Wellen
KH Pietilainen
L Ginaldi
L Wang
LA Lynch
Lisette J A Kogelman
LJA Kogelman
LJA Kogelman
LJA Kogelman
Lude Franke
M Ahmadian
M Ahmadian
M Cattaneo
M Igarashi
M Keophiphath
M Vaittinen
M Young
M-L Kauts
ME Spurlock
Merete Fredholm
MJ Heller
MJ Robertson
MJ Sweet
MM Tondravi
N Shao
ND Cameron
O Fabre
O Osborn
OD Iancu
P Codoñer-Franch
P Langfelder
P Langfelder
PP Łabaj
PS Patel
R Edgar
R MacLaren
R Stienstra
R Tabassum
R-Core-Team: R
RDG Leslie
RO Alvim
S Anders
S Choi
S Haider
S Wuschke
SFA Grant
SL Ferrari
SP Weisberg
Susanna Cirera
T Fuller
T Johansen
T Mahdi
T Michoel
T Naka
TM Darlington
V Vermeirssen
W Zhao
Z Wang
Z Yuan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Background: Obesity is a complex metabolic condition in strong association with various diseases, like type 2 diabetes, resulting in major public health and economic implications. Obesity is the result of environmental and genetic factors and their interactions, including genome-wide genetic interactions. Identification of co-expressed and regulatory genes in RNA extracted from relevant tissues representing lean and obese individuals provides an entry point for the identification of genes and pathways of importance to the development of obesity. The pig, an omnivorous animal, is an excellent model for human obesity, offering the possibility to study in-depth organ-level transcriptomic regulations of obesity, unfeasible in humans. Our aim was to reveal adipose tissue co-expression networks, pathways and transcriptional regulations of obesity using RNA Sequencing based systems biology approaches in a porcine model. Methods: We selected 36 animals for RNA Sequencing from a previously created F2 pig population representing three extreme groups based on their predicted genetic risks for obesity. We applied Weighted Gene Co-expression Network Analysis (WGCNA) to detect clusters of highly co-expressed genes (modules). Additionally, regulator genes were detected using Lemon-Tree algorithms. Results: WGCNA revealed five modules which were strongly correlated with at least one obesity-related phenotype (correlations ranging from -0.54 to 0.72, P <0.001). Functional annotation identified pathways enlightening the association between obesity and other diseases, like osteoporosis (osteoclast differentiation, P = 1.4E(-7)), and immune-related complications (e. g. Natural killer cell mediated cytotoxity, P = 3.8E(-5); B cell receptor signaling pathway, P = 7.2E(-5)). Lemon-Tree identified three potential regulator genes, using confident scores, for the WGCNA module which was associated with osteoclast differentiation: CCR1, MSR1 and SI1 (probability scores respectively 95.30, 62.28, and 34.58). Moreover, detection of differentially connected genes identified various genes previously identified to be associated with obesity in humans and rodents, e.g. CSF1R and MARC2. Conclusions: To our knowledge, this is the first study to apply systems biology approaches using porcine adipose tissue RNA-Sequencing data in a genetically characterized porcine model for obesity. We revealed complex networks, pathways, candidate and regulatory genes related to obesity, confirming the complexity of obesity and its association with immune-related disorders and osteoporosis

Proceedings - University of Groningen

Crossref

University of Groningen

Springer - Publisher Connector

ARTS repository - University of Groningen

Copenhagen University Research Information System

PubMed Central

Dissertations of the University of Groningen

Managing and Optimizing Bioinformatics Workflows for Data Analysis in Clouds

Author: A Goderis
A Tiwari
B Koller
B Langmead
B Linke
B Rochwerger
BD Halligan
C Cantacessi
C Trapnell
D Hull
D Smedley
David P. Kreil
E Deelman
E Pennisi
GE Robinson
H Li
Ivona Brandic
J Goecks
JO Kephart
LD Stein
M Maurer
Michael Maurer
ML Massie
P Romano
Patrick Stern
Paweł P. Łabaj
PP Labaj
R Buyya
VC Emeakaroha
VC Emeakaroha
Vincent C. Emeakaroha
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Detecting and correcting systematic variation in large-scale RNA sequencing data

Author: A Dobin
A Goncalves
A Roberts
AK Tripathi
AR Quinlan
BE Bernstein
C Trapnell
C Trapnell
CA Ball
Charles Wang
Christopher E Mason
CW Law
D Aird
D Risso
D Thierry-Mieg
DA Casciano
Danielle Thierry-Mieg
David P Kreil
DS DeLuca
DW Barnett
H Dvinge
H Ji
H Li
H Li
H Wang
J Lonsdale
JA Gagnon-Bartsch
Jean Thierry-Mieg
JH Bullard
JK Pickrell
John Phan
JT Dudley
JT Leek
JT Leek
K Wang
KD Hansen
KD Hansen
L Pipes
L Shi
L Wang
Leming Shi
LM Bragg
M Lawrence
M Mooney
May Wang
MD Robinson
MD Robinson
MD Robinson
NJ Loman
O Stegle
P Baldi
PA 't Hoen
Paul Zumbo
Paweł P Łabaj
Peter Sykacek
Po-Yen Wu
PP Łabaj
RA Irizarry
S van Heesch
Sheng Li
SK Schulze
SM Purcell
TC Glenn
VG Cheung
Wei Shi
Y Benjamini
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/08/2014
Field of study

We would like to thank the vendors of the SEQC for contributing many of the resources and reagents needed for completing these projects, including the sequencing and primary data analysis. The Weill Cornell Medical College Epigenomics Core Facility provided support for use of their sequencing machines and technical assistance during sequencing. P.P.Ł., P.S. and D.P.K. acknowledge support by the Vienna Scientific Cluster (VSC), the Vienna Science and Technology Fund (WWTF), Baxter AG, Austrian Research Centres (ARC) Seibersdorf and the Austrian Centre of Biopharmaceutical Technology (ACBT). S.L. would like to thank C. Zhang and T. Vincent for the constructive discussion. This work was supported with funding from the National Institutes of Health (NIH), including R01HG006798, R01NS076465, R01CA149566, as well as funds from the Irma T. Hirschl and Monique Weill-Caulier Charitable Trusts and the STARR Consortium (I7-A765)

Crossref

PubMed Central

Warwick Research Archives Portal Repository

University of Melbourne Institutional Repository

Detecting and correcting systematic variation in large-scale RNA sequencing data

Author: A Dobin
A Goncalves
A Roberts
AK Tripathi
AR Quinlan
BE Bernstein
C Trapnell
C Trapnell
CA Ball
Charles Wang
Christopher E Mason
CW Law
D Aird
D Risso
D Thierry-Mieg
DA Casciano
Danielle Thierry-Mieg
David P Kreil
DS DeLuca
DW Barnett
H Dvinge
H Ji
H Li
H Li
H Wang
J Lonsdale
JA Gagnon-Bartsch
Jean Thierry-Mieg
JH Bullard
JK Pickrell
John Phan
JT Dudley
JT Leek
JT Leek
K Wang
KD Hansen
KD Hansen
L Pipes
L Shi
L Wang
Leming Shi
LM Bragg
M Lawrence
M Mooney
May Wang
MD Robinson
MD Robinson
MD Robinson
NJ Loman
O Stegle
P Baldi
PA 't Hoen
Paul Zumbo
Paweł P Łabaj
Peter Sykacek
Po-Yen Wu
PP Łabaj
RA Irizarry
S van Heesch
Sheng Li
SK Schulze
SM Purcell
TC Glenn
VG Cheung
Wei Shi
Y Benjamini
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium

Author: A Agarwal
A Dobin
A Mortazavi
C Trapnell
CW Law
D Aird
D Kim
D Thierry-Mieg
ET Wang
F Rapaport
H VanGuilder
J Harrow
JC Marioni
JK Pickrell
JM Toung
JZ Levin
KD Pruitt
L Shi
LM McIntyre
M Dai
M David
M Fasold
M-A Dillies
MD Robinson
MD Robinson
N Raghavachari
P Glaus
P Sykacek
PP Łabaj
R Shippy
S Djebali
S Hochreiter
S Liu
S Liu
SC Baker
T Qing
U Mueckstein
W Huber
W Xu
Y Benjamini
Y Liao
Y Liao
Y Liu
Y Yu
Z Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Gene co-expression network analysis identifies porcine genes associated with variation in metabolizing fenbendazole and flunixin meglumine in the liver

Author: A Conesa
A Kommadath
A Lindholm
A Oshlack
AS Hyde
B Zhang
CY Chow
D Villar
DR Hennessy
E Bendixen
E Ravasz
F Gutzler
IA Pikuleva
JJ Eloranta
JP Steibel
JT Howard
JT Howard
K Königsson
K Suga
KM Wasan
LH Smith
LJA Kogelman
LW Kissell
M Ashburner
M Okamura
M Sasaki
MAM Groenen
MB Petersen
MD Pairis-Garcia
MD Robinson
ME Ritchie
N Dogra
P Langfelder
P Langfelder
PP Łabaj
Q Ma
RE Baynes
S Durinck
S Durinck
S Tsutsumi
TD Porter
TF Landers
UM Zanger
WM Pandak
Z Lin
Z Lin
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

The concordance between RNA-seq and microarray data depends on chemical treatment and transcript abundance

Author: A Mortazavi
A Sirbu
Andreas Scherer
B Ganter
BA Merrick
Binsheng Gong
C Li
C Trapnell
Cesare Furlanello
Charles Wang
D Bottomly
D Thierry-Mieg
Dalila Megherbi
Daniel L Svoboda
Danielle Thierry-Mieg
David P Kreil
Davide Albanese
E Wingender
E Wingender
Florian Caiment
FM Giorgi
Giuseppe Jurman
Haiqing Li
Hong Fang
Hui-Rong Qian
Huixiao Hong
I Kupershmidt
I Nookaew
J Lovén
J Lu
James C Fuscoe
JC Marioni
Jean Thierry-Mieg
JH Malone
Jian Wang
Jianying Li
Jie Shen
Joe Meehan
Joost van Delft
Jos Kleinjans
Joshua Xu
JR Bradford
Jui-Hua Hsieh
Ke K Zhang
L Guo
L Shi
L Shi
L Shi
Lee J Lancashire
Leming Shi
LM McIntyre
Lu Yang
M Chen
M Mooney
M Sultan
MA Hamburg
Marco Chierici
Marina Bessarabova
MD Robinson
Michele Filosi
N Raghavachari
Paweł P Łabaj
Pierre R Bushel
PP Łabaj
RA Irizarry
Richard S Paules
Roberto Visintainer
S Anders
S Subramaniam
Samantha Riccadonna
SC Baker
Scott S Auerbach
Stan Gaj
T Breslin
Viswanath Devanarayan
W Xu
Weida Tong
WM Liu
X Fan
Xiaojin Li
Y Katz
Y Xiong
Yong Yang
Youping Deng
Yuri Nikolsky
Z Su
Z Wu
Zhenqiang Su
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

The concordance of RNA-sequencing (RNA-seq) with microarrays for genome-wide analysis of differential gene expression has not been rigorously assessed using a range of chemical treatment conditions. Here we use a comprehensive study design to generate Illumina RNA-seq and Affymetrix microarray data from the same liver samples of rats exposed in triplicate to varying degrees of perturbation by 27 chemicals representing multiple modes of action (MOAs). The cross-platform concordance in terms of differentially expressed genes (DEGs) or enriched pathways is linearly correlated with treatment effect size (R2≈0.8). Furthermore, the concordance is also affected by transcript abundance and biological complexity of the MOA. RNA-seq outperforms microarray (93% versus 75%) in DEG verification as assessed by quantitative PCR, with the gain mainly due to its improved accuracy for low-abundance transcripts. Nonetheless, classifiers to predict MOAs perform similarly when developed using data from either platform. Therefore, the endpoint studied and its biological complexity, transcript abundance and the genomic application are important factors in transcriptomic research and for clinical and regulatory decision making

Maastricht University Research Portal

Crossref

Archivio istituzionale della ricerca - Fondazione Edmund Mach

Archivio della ricerca - Fondazione Bruno Kessler

PubMed Central

Warwick Research Archives Portal Repository

Identification of Optimum Sequencing Depth Especially for De Novo Genome Assembly of Small Genomes Using Next Generation Sequencing Data

Author: Aarti Desai
Abhay Jere
Akshay Yadav
C Rödelsperger
CM Wade
DR Scannell
DR Zerbino
ES Lander
EW Myers
HQ Dinh
J Shendure
J Wang
JA Chapman
JL Wang
JM Rothberg
JR Miller
JT Simpson
Kishor Dhaygude
LW Hillier
M Chaki
M Kircher
M Margulies
MC Schatz
ML Metzker
MS Tantia
N Haiminen
O Harismendy
PA Pevzner
PP Łabaj
R Garg
R Li
RA Holt
RL Warren
S Boisvert
S Diguistini
S Gnerre
S Kurtz
Shu-Dong Zhang
SL Salzberg
SM Huse
Ujwala Bangar
V Costa
Veer Singh Marwah
Vineet Jha
Vivek Kulkarni
W Yu
W Zhang
WR Jeck
Y Lin
Y Peng
Publication venue: 'Public Library of Science (PLoS)'
Publication date
Field of study

Crossref