Search CORE

Scholarly Commons@CWRU

Candidate gene prioritization by network analysis of differential expression using machine learning approaches

Author: A Subramanian
A Zanzoni
AJ Smola
AP Francisco
B Aranda
B Harr
Bart de Moor
C Saunders
C Stark
C von Mering
D Nitsch
D Zieker
Daniela Nitsch
F Chung
F Fouss
Fabian Ojeda
GC Cawley
GD Bader
H Yang
HY Chuang
J Chen
JA Hanley
Joana P Gonçalves
JW Park
K Lage
KR Brown
L Franke
L Gautier
L Salwinski
LC Tranchevent
M Liu
P Baldi
P Pagel
R Gupta
RA Irizarry
RI Kondor
RK Nibbe
S Aerts
S Köhler
S Mirkin
S Razick
S Vardhanabhuti
SE Choe
T Fawcett
WK Lim
Y Saad
Yves Moreau
Z Wu
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Discovering novel disease genes is still challenging for diseases for which no prior knowledge - such as known disease genes or disease-related pathways - is available. Performing genetic studies frequently results in large lists of candidate genes of which only few can be followed up for further investigation. We have recently developed a computational method for constitutional genetic disorders that identifies the most promising candidate genes by replacing prior knowledge by experimental data of differential gene expression between affected and healthy individuals. To improve the performance of our prioritization strategy, we have extended our previous work by applying different machine learning approaches that identify promising candidate genes by determining whether a gene is surrounded by highly differentially expressed genes in a functional association or protein-protein interaction network. Results We have proposed three strategies scoring disease candidate genes relying on network-based machine learning approaches, such as kernel ridge regression, heat kernel, and Arnoldi kernel approximation. For comparison purposes, a local measure based on the expression of the direct neighbors is also computed. We have benchmarked these strategies on 40 publicly available knockout experiments in mice, and performance was assessed against results obtained using a standard procedure in genetics that ranks candidate genes based solely on their differential expression levels (<it>Simple Expression Ranking</it>). Our results showed that our four strategies could outperform this standard procedure and that the best results were obtained using the <it>Heat Kernel Diffusion Ranking </it>leading to an average ranking position of 8 out of 100 genes, an AUC value of 92.3% and an error reduction of 52.8% relative to the standard procedure approach which ranked the knockout gene on average at position 17 with an AUC value of 83.7%. Conclusion In this study we could identify promising candidate genes using network based machine learning approaches even if no knowledge is available about the disease or phenotype.</p

Springer - Publisher Connector

Computational Mass Spectrometry–Based Proteomics

Author: A Bell
A Bertsch
A Keller
A Subramanian
A Thompson
AC Gavin
AHP America
AI Nesvizhskii
AI Nesvizhskii
AK Yocum
AL Boulesteix
AL Oberg
AR Joyce
AW Liew
B Domon
B MacLean
B Schwanhäusser
C Ansong
C H
C Kumar
CH Ahrens
D Huang
DF Ransohoff
DH Lundgren
DL Tabb
DW Huang
EW Deutsch
F Emmert-Streib
Fran Lewitter
H Choi
H Lam
J Cox
J Cox
J Hu
JA Cham Mead
JD Venable
JV Olsen
JV Olsen
K Jeong
L Käll
L Nie
L Reiter
L Reiter
L Valledor
LMF de Godoy
LN Mueller
Lukas Käll
M Ackermann
M Beck
M Gstaiger
M Mann
M Sturm
M Uhlen
MW Duncan
MYK Brusniak
N Bandeira
N Castellana
N Gehlenborg
N Gupta
N Gupta
N Rifai
NL Anderson
NM Griffin
NR Kitteringham
Olga Vitek
P Mallick
P Picotti
P Picotti
PL Ross
R Aebersold
R Clarke
R de Sousa Abreu
R Moore
R Sharan
R Wu
RA Irizarry
RK Nibbe
S Abbatiello
S Carr
S Dasari
S Hanash
S Pan
SE Ong
SJ Callister
SS Huang
T Aittokallio
T Clough
T Geiger
T Maier
T Nilsson
TA Addona
TC Walther
TH Corzett
V Granholm
V Lange
WX Schulze
Y Karpievitch
YF Li
Publication venue: Public Library of Science
Publication date: 01/12/2011
Field of study

Public Library of Science (PLOS)

Google Goes Cancer: Improving Outcome Prediction for Cancer Patients by Network-Based Ranking of Marker Genes

Author: A Jemal
A Murat
A Rosenwald
AL Boulesteix
AR Venkitaraman
B Gudjonsson
B Schölkopf
BE Boser
Beatrix Jahnke
C Plake
Christian Pilarsky
Christof Winter
CQ Zhu
D Bogunovic
Daniela Aust
DD Stocken
DG Beer
Donna K. Slonim
EJ Yeoh
F Li
Felix Rückert
G Chen
G Garcea
G Kristiansen
G Lenz
Glen Kristiansen
GR Mishra
H Yu
H Zhao
Hans J. Schlitt
Hans-Detlev Saeger
Helmut Friess
HG Beger
HY Chuang
I Guyon
IS Lossos
J Ferlay
Janine Roy
JC Yao
JE Darnell
JE Korkola
JJ Smith
JK Stratford
JL Morrison
JS Lee
K Shedden
L Bullinger
L Ein-Dor
L Lin
L Page
L Page
L Royer
LJ van't Veer
M Buyse
M Johannes
M Kanehisa
M Raponi
M West
MA Shipp
Marco Niedergethmann
Marcus Bahra
Markus Büchler
Michael Schroeder
MJ van de Vijver
NY Jiang
P Chaturvedi
PC Boutros
Petra Rümmele
R Eferl
RA Irizarry
RJ Tibshirani
RK Nibbe
Robert Grützmann
S Michiels
SC Mok
SL Pomeroy
SS Dave
Stephan Kersting
T Obayashi
T Sørlie
TF Ørntoft
Thomas Knösel
TR Golub
Utz Settmacher
V Matys
Vera Hentrich
VG Tusher
VN Vapnik
Wilko Weichert
Y Wang
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Predicting the clinical outcome of cancer patients based on the expression of marker genes in their tumors has received increasing interest in the past decade. Accurate predictors of outcome and response to therapy could be used to personalize and thereby improve therapy. However, state of the art methods used so far often found marker genes with limited prediction accuracy, limited reproducibility, and unclear biological relevance. To address this problem, we developed a novel computational approach to identify genes prognostic for outcome that couples gene expression measurements from primary tumor samples with a network of known relationships between the genes. Our approach ranks genes according to their prognostic relevance using both expression and network information in a manner similar to Google's PageRank. We applied this method to gene expression profiles which we obtained from 30 patients with pancreatic cancer, and identified seven candidate marker genes prognostic for outcome. Compared to genes found with state of the art methods, such as Pearson correlation of gene expression with survival time, we improve the prediction accuracy by up to 7%. Accuracies were assessed using support vector machine classifiers and Monte Carlo cross-validation. We then validated the prognostic value of our seven candidate markers using immunohistochemistry on an independent set of 412 pancreatic cancer samples. Notably, signatures derived from our candidate markers were independently predictive of outcome and superior to established clinical prognostic factors such as grade, tumor size, and nodal status. As the amount of genomic data of individual tumors grows rapidly, our algorithm meets the need for powerful computational approaches that are key to exploit these data for personalized cancer therapies in clinical practice

FigShare

Cell cycle and aging, morphogenesis, and response to stimuli genes are individualized biomarkers of glioblastoma progression and survival

Author: A Ganguly
A Martin
A Takeno
B Gyorffy
B Kwabi-Addo
B Salhia
BC Christensen
Bruce R Southey
C Brennan
C Chen
C Dai
C Houillier
C Prapinjumrune
C Welch
Cancer Genome Atlas Research Network
CE Pelloski
CI Dumur
CL Nutt
D Cigognini
D Krex
D Maucort-Boulch
D Michael
D Wang
DF Schaeffer
DN Martin
DR Cox
E Blaveri
E Razis
EU Sim
F Al-Shahrour
F Gao
FV Jacinto
G Minniti
G Sala
G Thomas
G Wang
H Ohgaki
HP Li
HS Phillips
I Nindl
IP Trougakos
J Madoz-Gurpide
J Novakova
J Rohozinski
J Soulier
J van den Boom
J Zhang
JA Doherty
JD Carpten
JG Hodgson
JH Kim
JM Campbell
JM Dreyfuss
JM Nigro
JN Rich
JN Rich
Jonathan E Beever
K Graham
KC Wei
KH Vousden
KK Lagerstedt
KL Gorringe
KR Delfino
Kristin R Delfino
L Frederick
L Wang
LP Fernandez
LY Chuang
LY Chuang
M Ashburner
M Bredel
M Ferletta
M Grade
M Kanehisa
M Lae
M Ocejo-Garcia
M Schraders
M Shirahata
M Tessema
M Weller
M Wrensch
ME Halatsch
ME Mullendore
MJ McGirt
MW Smith
N Butowski
N Ikenaga
NF Marko
Nicola VL Serão
P Bhatti
P Shannon
PA Lachenbruch
PS Mischel
R Baskar
R Garcia-Munoz
R Lymbouridou
RA Calogero
RG Verhaak
RK Nibbe
RL Alterman
S Chevillard
S Comincini
S Dong
S Fre
S Hasegawa
S Kesari
S Madhavan
S Mittal
S Pavlides
S Rorive
Sandra L Rodriguez-Zas
SP Reddy
T John
T Onda
T Suzuki
T Watanabe
TA Chan
TJ MacDonald
TK Jenssen
U Petrausch
W Cheng
W Sun
X Castells
Y Fu
Y Liu
Y Zeng
YF Lau
Z Chen
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Glioblastoma is a complex multifactorial disorder that has swift and devastating consequences. Few genes have been consistently identified as prognostic biomarkers of glioblastoma survival. The goal of this study was to identify general and clinical-dependent biomarker genes and biological processes of three complementary events: lifetime, overall and progression-free glioblastoma survival. Methods A novel analytical strategy was developed to identify general associations between the biomarkers and glioblastoma, and associations that depend on cohort groups, such as race, gender, and therapy. Gene network inference, cross-validation and functional analyses further supported the identified biomarkers. Results A total of 61, 47 and 60 gene expression profiles were significantly associated with lifetime, overall, and progression-free survival, respectively. The vast majority of these genes have been previously reported to be associated with glioblastoma (35, 24, and 35 genes, respectively) or with other cancers (10, 19, and 15 genes, respectively) and the rest (16, 4, and 10 genes, respectively) are novel associations. <it>Pik3r1</it>, <it>E2f3, Akr1c3</it>, <it>Csf1</it>, <it>Jag2</it>, <it>Plcg1</it>, <it>Rpl37a</it>, <it>Sod2</it>, <it>Topors</it>, <it>Hras</it>, <it>Mdm2, Camk2g</it>, <it>Fstl1</it>, <it>Il13ra1</it>, <it>Mtap </it>and <it>Tp53 </it>were associated with multiple survival events. Most genes (from 90 to 96%) were associated with survival in a general or cohort-independent manner and thus the same trend is observed across all clinical levels studied. The most extreme associations between profiles and survival were observed for <it>Syne1</it>, <it>Pdcd4</it>, <it>Ighg1</it>, <it>Tgfa</it>, <it>Pla2g7</it>, and <it>Paics</it>. Several genes were found to have a cohort-dependent association with survival and these associations are the basis for individualized prognostic and gene-based therapies. <it>C2</it>, <it>Egfr</it>, <it>Prkcb</it>, <it>Igf2bp3</it>, and <it>Gdf10 </it>had gender-dependent associations; <it>Sox10</it>, <it>Rps20</it>, <it>Rab31</it>, and <it>Vav3 </it>had race-dependent associations; <it>Chi3l1</it>, <it>Prkcb</it>, <it>Polr2d</it>, and <it>Apool </it>had therapy-dependent associations. Biological processes associated glioblastoma survival included morphogenesis, cell cycle, aging, response to stimuli, and programmed cell death. Conclusions Known biomarkers of glioblastoma survival were confirmed, and new general and clinical-dependent gene profiles were uncovered. The comparison of biomarkers across glioblastoma phases and functional analyses offered insights into the role of genes. These findings support the development of more accurate and personalized prognostic tools and gene-based therapies that improve the survival and quality of life of individuals afflicted by glioblastoma multiforme.</p

Springer - Publisher Connector

Investigating ego modules and pathways in osteosarcoma by integrating the EgoNet algorithm and pathway analysis

Author: Benjamini Y
Bernthal NM
Bolstad B
Bolstad BM
Borgatti SP
Chang C-C
Chu W-M
Cohen J
Fletcher CD
Galimberti S
Ganong P
Glazko GV
Goh K-I
Gringhuis SI
Huang J
Irizarry RA
Jordán F
Kamburov A
Kansara M
Kresse SH
Lensen JFM
Liu C
Luborsky J
Ma X
Nahler G
Nakayama Y
Nibbe RK
Ning B
Ning B
Ottaviani G
Penney RB
Routledge R
Szklarczyk D
Vanunu O
Wiontzek M
Wu Y
Yang R
Zhang L
Zhou D
Zhu Y
Publication venue: 'FapUNIFESP (SciELO)'
Publication date: 01/01/2017
Field of study