Search CORE

16 research outputs found

Prioritization of candidate genes in QTL regions based on associations between traits and biological processes

Author: A Sifrim
A Subramanian
Aalt DJ van Dijk
AM Hancock
AN Egan
B Han
C Chen
C Herold
C Jung
C Zhang
D Bornigen
D Shriner
DJ Schaid
DM Goodstein
E Durand
E Fridman
F Fornara
F Supek
F Tian
Gabino F Sanchez-Perez
H Wuriyanghan
I Lee
J Chen
J Jin
J Li
J Ni
JA Fawcett
Jan-Peter Nap
JN Cobb
Joachim W Bargsten
JW Bargsten
JX Shan
K Wang
K Youens-Clark
K Zhao
KC Falke
L Hou
L Sun
LA Hindorff
M Ashburner
M Falda
M Fujisawa
M Ikeda
M Mutwil
M Rauf
N Atias
P Armengaud
P Gour
P Holmans
P Radivojac
P Wang
PY Chibon
R Breitling
R Monclus
RDC Team
RK Varshney
RS Meyer
S Atwell
S Grossmann
S Ouyang
S Wang
SC Chantha
SY Rhee
T Lenser
T Liu
W Qi
W Wu
X Bai
X Dai
X Gao
X Huang
X Huang
X Wei
X Zhang
Y Benjamini
Y Liu
Y Makita
Y Makita
Y Moreau
Y Su
Y Xiong
YA Kourmpetis
YA Kourmpetis
Z Gao
Z Milec
ZK Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Bayesian Markov Random Field Analysis for Protein Function Prediction Based on Network Data

Author: A Kuzniar
A Vazquez
Aalt D. J. van Dijk
AJ Enright
C Moler
Cajo J. F. ter Braak
CJF Ter Braak
CJF Ter Braak
CM Federovitch
DJC MacKay
GD Bader
GR Lanckriet
H Lee
I Kosmidis
I Ulitsky
Iddo Friedberg
IM Cheeseman
J Besag
JA Hanley
L Milligan
L Peña Castillo
M Ashburner
M Deng
M Deng
M Punta
Marco C. A. M. Bink
N Nariai
NJ Mulder
P McCullagh
R Sharan
RI Kondor
Roeland C. H. J. van Ham
S Ferré
S Geman
S Letovsky
S Mostafavi
SF Altschul
SR Collins
SZ Li
T Gabaldon
U Karaoz
V Vethantham
XL Chen
Y Chen
Y Guan
Yiannis A. I. Kourmpetis
Z Barutcuoglu
Z Wei
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Inference of protein functions is one of the most important aims of modern biology. To fully exploit the large volumes of genomic data typically produced in modern-day genomic experiments, automated computational methods for protein function prediction are urgently needed. Established methods use sequence or structure similarity to infer functions but those types of data do not suffice to determine the biological context in which proteins act. Current high-throughput biological experiments produce large amounts of data on the interactions between proteins. Such data can be used to infer interaction networks and to predict the biological process that the protein is involved in. Here, we develop a probabilistic approach for protein function prediction using network data, such as protein-protein interaction measurements. We take a Bayesian approach to an existing Markov Random Field method by performing simultaneous estimation of the model parameters and prediction of protein functions. We use an adaptive Markov Chain Monte Carlo algorithm that leads to more accurate parameter estimates and consequently to improved prediction performance compared to the standard Markov Random Fields method. We tested our method using a high quality S.cereviciae validation network with 1622 proteins against 90 Gene Ontology terms of different levels of abstraction. Compared to three other protein function prediction methods, our approach shows very good prediction performance. Our method can be directly applied to protein-protein interaction or coexpression networks, but also can be extended to use multiple data sources. We apply our method to physical protein interaction data from S. cerevisiae and provide novel predictions, using 340 Gene Ontology terms, for 1170 unannotated proteins and we evaluate the predictions using the available literature

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Wageningen University & Research Publications

Finished genome of the fungal wheat pathogen Mycosphaerella graminicola Reveals dispensome structure, chromosome plasticity, and stealth pathogenesis.

201

Repository Open Access to Scientific Information from Embrapa

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

RCAAP - Repositório Científico de Acesso Aberto de Portugal

Identification of Colorectal Cancer Related Genes with mRMR and Shortest Path in Protein-Protein Interaction Network

Author: B Bakall
B Hoeft
BC Christensen
Bi-Qing Li
C Deves
C Hiranuma
CA Borgono
D Landi
D Liu
D Menendez
D Szklarczyk
DN Georgiou
DW Parsons
E Dijkstra
E Nabieva
EP Diamandis
EP Diamandis
G Lagger
G Thomas
GP Zhou
GP Zhou
GP Zhou
GR Howe
H Mohabatkar
H Mohabatkar
H Peng
H Stohr
H Tsukahara
HE MacLean
I Niittymaki
I Ohkubo
IJ Kim
IW Althaus
J Andraos
J Cui
J Li
J Sabates-Bellver
JH Friedman
JL Huret
JR Reeves
K Hibi
K Imai
K Yu
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KL Ng
Kuo-Chen Chou
L Castagnetta
L Chen
L Chen
L Hu
L Hu
LD Wood
Lei Liu
LL Hu
M Esmaeili
M Katoh
M Levesque
M Talieri
M Thangaraju
MG Catalano
ML Slattery
MS Kim
MW Medina
P Bogdanov
P Polakis
Paulo Lee Ho
Q Gu
Q Liu
R Sharan
RA Irizarry
S Jones
S Letovsky
SA Gayther
SA Johnson
SH Nagaraj
SM Lipkin
T Denoeux
T Hinoue
T Huang
T Huang
T Huang
T Huang
T Huang
T Huang
T Huang
T Huang
T Huang
T Huang
T Morikawa
Tao Huang
TS Keshava Prasad
U Karaoz
W Huang da
W van Criekinge
WL Allen
X Xiao
XY Yang
Y Benjamini
Y Cai
YA Kourmpetis
YD Cai
Yu-Dong Cai
ZC Wu
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

One of the most important and challenging problems in biomedicine and genomics is how to identify the disease genes. In this study, we developed a computational method to identify colorectal cancer-related genes based on (i) the gene expression profiles, and (ii) the shortest path analysis of functional protein association networks. The former has been used to select differentially expressed genes as disease genes for quite a long time, while the latter has been widely used to study the mechanism of diseases. With the existing protein-protein interaction data from STRING (Search Tool for the Retrieval of Interacting Genes), a weighted functional protein association network was constructed. By means of the mRMR (Maximum Relevance Minimum Redundancy) approach, six genes were identified that can distinguish the colorectal tumors and normal adjacent colonic tissues from their gene expression profiles. Meanwhile, according to the shortest path approach, we further found an additional 35 genes, of which some have been reported to be relevant to colorectal cancer and some are very likely to be relevant to it. Interestingly, the genes we identified from both the gene expression profiles and the functional protein association network have more cancer genes than the genes identified from the gene expression profiles alone. Besides, these genes also had greater functional similarity with the reported colorectal cancer genes than the genes identified from the gene expression profiles alone. All these indicate that our method as presented in this paper is quite promising. The method may become a useful tool, or at least plays a complementary role to the existing method, for identifying colorectal cancer genes. It has not escaped our notice that the method can be applied to identify the genes of other diseases as well

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Metric Labeling and Semi-metric Embedding for Protein Annotation Prediction

Author: A. Schlicker
A. Vazquez
A.C. Gavin
A.P. Gasch
B. Schwikowski
C. Chekuri
C. Stark
D. Dotan-Cohen
D. Lin
D. Lin
E. Nabieva
H. Hishigaki
H. Lee
J. Cheng
J. Chuzhoy
J.C. Rain
L.J. Jensen
M. Deng
M.D. Kui
M.P. Kumar
N. Komodakis
P. Resnik
P. Uetz
R. Sharan
S.Z. Li
T. Ito
U. Karaoz
W.K. Huh
Y. Boykov
Y. Ho
Y.A. Kourmpetis
Publication venue
Publication date: 01/01/2011
Field of study

Computational techniques have been successful at predicting protein function from relational data (functional or physical interactions). These prediction techniques have been used to generate hypotheses and to direct experimental validation. With few exceptions, these predictive tasks are modeled as multi-label classification problems where the labels (functions) are treated independently or semi-independently. However, databases such as the Gene Ontology provide more information about the similarities between functions. It is a largely open question how much the use of relationships between functions can improve the quality of function prediction techniques. In this paper, we explore the use of the Metric Labeling combinatorial optimization problem to make use of heuristically computed distances between functions to make more accurate predictions of protein function in networks derived from both physical interactions and a combination of other data types. To do this, we give a new technique (based on convex optimization) for converting heuristic semimetric distances (from, e.g. Gene Ontology) into a metric that finds an embedding of the semimetric into a metric with minimum least-squares distortion (LSD). The Metric Labeling approach is shown to outperform 5 existing techniques for inferring function from networks. These results suggest Metric Labeling is useful for protein function prediction, and that our LSD minimization approach can help solve the problem of converting heuristic distances to a metric. 1

CiteSeerX

Crossref

Similarities between plant traits based on their connection to underlying gene functions

Author: A. Gusev
A.M. Wentzell
Aalt D. J. van Dijk
B. Patra
B. Patra
C. Hu
C. Riedelsheimer
D. Arends
D.J. Slotboom
F. Matsuda
F. Supek
G.K. Mazandu
G.S. Maloney
Gabino F. Sanchez-Perez
H. Caniza
H. Chen
H.J. Chen
I. Dalle-Donne
J. Lisec
J. Ni
J. Wang
J.D. Peleman
J.E. Melaragno
J.N. Cobb
J.W. Bargsten
J.W. Bargsten
Jan-Peter Nap
K. Youens-Clark
L. Gong
Lewis Lukens
M. Ashburner
M. Falda
M. Salehin
M.A. Matamoros
M.E. Smoot
N. Fahlgren
N.E. Soltis
P. Krajewski
P.B. Goud
R. Sulpice
S. Xiao
W. Wen
W. Wen
W.L. Araujo
X. He
X. Zhang
Y. Yin
Y.A. Kourmpetis
Y.A. Kourmpetis
Z. Gu
Publication venue: 'Public Library of Science (PLoS)'
Publication date
Field of study

Crossref

Protein function prediction by collective classification with explicit and implicit edges in protein-protein interaction networks

Author: A Ruepp
A Ruepp
A Vazquez
A Wallace
A Wallace
B Adamcsek
B Rost
B Schwikowski
C Brun
C Stark
D Bu
E Becker
E Nabieva
H Chua
H Taubig
HN Chua
Hui Liu
I Friedberg
Jihong Guan
KL Ng
L Hu
M Ashburner
M Deng
N Hulo
P Bogdanov
P Sen
R Dunn
R Sharan
R Sleator
RE Fan
S Altschul
S Damian
S Letovsky
Shuigeng Zhou
U Güldener
U Karaoz
V Arnau
W Xiong
Wei Xiong
WR Gilks
Y Ye
YAI Kourmpetis
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Evolved hexose transporter enhances xylose uptake and glucose/xylose co-utilization in Saccharomyces cerevisiae

Author: A Farwick
A Madhavan
AA Saloheimo
C Snowdon
C Wang
CF Wahlbom
CF Wahlbom
D Brat
D Runquist
D Solis-Escalante
E Young
E Young
EM Young
EM Young
F Fang
F Zhang
FM Gírio
H Viklund
H Zhou
I Ulitsky
J Du
J Nielsen
J Zha
JA Barnett
JD Keasling
JG Nijland
JT Robinson
K-K Hong
L Sun
LA Kelley
LN Latimer
M Gárdonyi
M Kuyper
M Kuyper
M Sedlak
M Walfridsson
M Walfridsson
MH Toivari
MJ Leandro
MJ Leandro
NS Parachin
NW Ho
P Kötter
RD Gietz
RE Hector
S Ozcan
S Pitre
SR Kim
T Hamacher
T Kasahara
T Subtil
T Weierstall
T-H Lee
TK Sato
TS Ham
Y Shen
Y-S Jin
YAI Kourmpetis
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Enhancing xylose utilization has been a major focus in Saccharomyces cerevisiae strain-engineering efforts. The incentive for these studies arises from the need to use all sugars in the typical carbon mixtures that comprise standard renewable plant-biomass-based carbon sources. While major advances have been made in developing utilization pathways, the efficient import of five carbon sugars into the cell remains an important bottleneck in this endeavor. Here we use an engineered S. cerevisiae BY4742 strain, containing an established heterologous xylose utilization pathway, and imposed a laboratory evolution regime with xylose as the sole carbon source. We obtained several evolved strains with improved growth phenotypes and evaluated the best candidate using genome resequencing. We observed remarkably few single nucleotide polymorphisms in the evolved strain, among which we confirmed a single amino acid change in the hexose transporter HXT7 coding sequence to be responsible for the evolved phenotype. The mutant HXT7(F79S) shows improved xylose uptake rates (Vmax = 186.4 ± 20.1 nmol•min(−1)•mg(−1)) that allows the S. cerevisiae strain to show significant growth with xylose as the sole carbon source, as well as partial co-utilization of glucose and xylose in a mixed sugar cultivation

Crossref

PubMed Central

eScholarship - University of California

A domain-centric solution to functional genomics via dcGO Predictor

Author: A Andreeva
A Sebe-Pedros
A Subramanian
AA Parikesit
BT Sherman
C Chothia
C Vogel
D Lee
D Wilson
D Wilson
DV Lavrov
F Denoeud
G Manning
GA Reeves
H Fang
H Fang
H Ledford
Hai Fang
I Friedberg
I Ruiz-Trillo
J Gough
Julian Gough
K Drew
L Malmstrom
L Pena-Castillo
M Conejo
M Madera
M Punta
MF Rogers
MJ Davis
MK Basu
ML Metzker
N King
N Nariai
OG Troyanskaya
R Pethica
R Rentzsch
RE Michod
S Chavali
S Hunter
S Velankar
SR Eddy
T Hawkins
WA Lim
Y Benjamini
YA Kourmpetis
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Large-scale identification of human protein function using topological features of interaction network

Author: A Baudot
A Benso
A Sokolov
A Vazquez
B Liu
CC Chang
D Cozzetto
D Davis
D Piovesan
E Becker
E Boutet
E Nabieva
G Yu
H Lele
H Peng
H Wang
HN Chua
I Dubchak
J Hou
J Lee
J Lee
K Teilum
KC Kao
KL Ng
L Fu
L Lan
L Yao
M Ashburner
M Cao
M Cao
M Hulsman
MH Schaefer
MN Wass
N Youngs
NM Goldenberg
P Radivojac
Q Lv
Q Wu
R Kumar
S Letovsky
S Linse
S Maslov
T Kire
U Stelzl
X Chi
XF Zhang
Y Ofran
YA Kourmpetis
Z Wang
ZL Peng
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref