Search CORE

16 research outputs found

Predicting cancer involvement of genes from heterogeneous data

Author: A Aouacheria
A Chatr-aryamontri
A Subramanian
AC Gavin
AC Gavin
AL Barabasi
AL Welm
B Schwikowski
B Vogelstein
Baldo Oliva
C Alfarano
C Fan
C Stark
Chris Sander
D Hanahan
DA Notterman
DB Allison
DR Rhodes
DR Rhodes
DR Rhodes
DR Rhodes
DX Nguyen
E Kunze
E Segal
EH Davidson
G Joshi-Tope
HJ Lee
J Lim
J Ptacek
JB Welsh
JH Bielas
JJ Hong
JP Mathew
K Lage
KI Goh
L Espana
L Salwinski
LJ Jensen
MA Harris
ME Higgins
O Mendez
P Hu
P Pagel
PA Futreal
PF Jonsson
Q Tian
R Aragues
R Hoffmann
R Ihaka
R Lucito
R Sharan
Ramon Aragues
S Draghici
S Kerrien
S Peri
S Varambally
SA Tomlins
SJ Furney
SM Dhanasekaran
T Mehta
TK Gandhi
VK Mootha
WC Cho
WK Huh
WP Kuo
Y Yuan
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Systematic approaches for identifying proteins involved in different types of cancer are needed. Experimental techniques such as microarrays are being used to characterize cancer, but validating their results can be a laborious task. Computational approaches are used to prioritize between genes putatively involved in cancer, usually based on further analyzing experimental data. Results We implemented a systematic method using the PIANA software that predicts cancer involvement of genes by integrating heterogeneous datasets. Specifically, we produced lists of genes likely to be involved in cancer by relying on: (i) protein-protein interactions; (ii) differential expression data; and (iii) structural and functional properties of cancer genes. The integrative approach that combines multiple sources of data obtained positive predictive values ranging from 23% (on a list of 811 genes) to 73% (on a list of 22 genes), outperforming the use of any of the data sources alone. We analyze a list of 20 cancer gene predictions, finding that most of them have been recently linked to cancer in literature. Conclusion Our approach to identifying and prioritizing candidate cancer genes can be used to produce lists of genes likely to be involved in cancer. Our results suggest that differential expression studies yielding high numbers of candidate cancer genes can be filtered using protein interaction networks. </p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

UPF Digital Repository

EnRICH: Extraction and Ranking using Integration and Criteria Heuristics

Author: Greenlee M. Heather
Serb Jeanne
Serb Jeanne
Zhang Xia
Publication venue: Iowa State University Digital Repository
Publication date: 01/01/2013
Field of study

Background: High throughput screening technologies enable biologists to generate candidate genes at a rate that, due to time and cost constraints, cannot be studied by experimental approaches in the laboratory. Thus, it has become increasingly important to prioritize candidate genes for experiments. To accomplish this, researchers need to apply selection requirements based on their knowledge, which necessitates qualitative integration of heterogeneous data sources and filtration using multiple criteria. A similar approach can also be applied to putative candidate gene relationships. While automation can assist in this routine and imperative procedure, flexibility of data sources and criteria must not be sacrificed. A tool that can optimize the trade-off between automation and flexibility to simultaneously filter and qualitatively integrate data is needed to prioritize candidate genes and generate composite networks from heterogeneous data sources. Results: We developed the java application, EnRICH (Extraction and Ranking using Integration and Criteria Heuristics), in order to alleviate this need. Here we present a case study in which we used EnRICH to integrate and filter multiple candidate gene lists in order to identify potential retinal disease genes. As a result of this procedure, a candidate pool of several hundred genes was narrowed down to five candidate genes, of which four are confirmed retinal disease genes and one is associated with a retinal disease state. Conclusions: We developed a platform-independent tool that is able to qualitatively integrate multiple heterogeneous datasets and use different selection criteria to filter each of them, provided the datasets are tables that have distinct identifiers (required) and attributes (optional). With the flexibility to specify data sources and filtering criteria, EnRICH automatically prioritizes candidate genes or gene relationships for biologists based on their specific requirements. Here, we also demonstrate that this tool can be effectively and easily used to apply highly specific user-defined criteria and can efficiently identify high quality candidate genes from relatively sparse datasets

Digital Repository @ Iowa State University (ISU)

Crossref

PubMed Central

A comparative study of cancer proteins in the human protein-protein interaction network

Author: Sun Jingchun
Zhao Zhongming
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Biana: a software framework for compiling biological interactions and analyzing networks

Author: Aragues Ramon
Garcia-Garcia Javier
Guney Emre
Oliva Baldo
Planas-Iglesias Joan
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background The analysis and usage of biological data is hindered by the spread of information across multiple repositories and the difficulties posed by different nomenclature systems and storage formats. In particular, there is an important need for data unification in the study and use of protein-protein interactions. Without good integration strategies, it is difficult to analyze the whole set of available data and its properties. Results We introduce BIANA (Biologic Interactions and Network Analysis), a tool for biological information integration and network management. BIANA is a Python framework designed to achieve two major goals: i) the integration of multiple sources of biological information, including biological entities and their relationships, and ii) the management of biological information as a network where entities are nodes and relationships are edges. Moreover, BIANA uses properties of proteins and genes to infer latent biomolecular relationships by transferring edges to entities sharing similar properties. BIANA is also provided as a plugin for Cytoscape, which allows users to visualize and interactively manage the data. A web interface to BIANA providing basic functionalities is also available. The software can be downloaded under GNU GPL license from <url>http://sbi.imim.es/web/BIANA.php</url>. Conclusions BIANA's approach to data unification solves many of the nomenclature issues common to systems dealing with biological data. BIANA can easily be extended to handle new specific data repositories and new specific data types. The unification protocol allows BIANA to be a flexible tool suitable for different user requirements: non-expert users can use a suggested unification protocol while expert users can define their own specific unification rules.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Warwick Research Archives Portal Repository

UPF Digital Repository

Associations of SNPs located at candidate genes to bovine growth traits, prioritized with an interaction networks construction approach

Author: A Chatr-Aryamontri
A Franceschini
AK Lindholm-Perry
Aldo Segura Cabrera
Ana María Sifuentes-Rincón
BR Voldborg
C Li
CA Ball
Carlos Armando García Pérez
D Lim
D Yin
DE Machugh
DE Moody
EM Marcotte
F Rousset
FF Qiu
Francisco Alejandro Paredes-Sánchez
G Östlund
Gaspar Manuel Parra Bracamonte
HN Chua
I Akisa
I Hulsegge
I Lee
I Lee
I Lee
I Lee
I Lee
I Lee
J Wilson-Rawls
JE Womack
JF Fontaine
JR Garbe
M Bionaz
M Punta
M Raymond
M Zhu
MA Lee
ME Buzanskas
MR Fortes
N Jager De
NS Morsci
Pascuala Ambriz Morales
R Aragues
R Zhang
RA Curi
S Hwang
S Kerrien
T Barrett
T Imai
WK Kim
X Luo
YT Utsunomiya
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Dominating Biological Networks

Author: Bonato Anthony
Memišević Vesna
Milenković Tijana
Pržulj Nataša
Publication venue: Public Library of Science
Publication date: 26/08/2011
Field of study

Proteins are essential macromolecules of life that carry out most cellular processes. Since proteins aggregate to perform function, and since protein-protein interaction (PPI) networks model these aggregations, one would expect to uncover new biology from PPI network topology. Hence, using PPI networks to predict protein function and role of protein pathways in disease has received attention. A debate remains open about whether network properties of “biologically central (BC)” genes (i.e., their protein products), such as those involved in aging, cancer, infectious diseases, or signaling and drug-targeted pathways, exhibit some topological centrality compared to the rest of the proteins in the human PPI network

Public Library of Science (PLOS)

Directory of Open Access Journals

PubMed Central

UCL Discovery

eScholarship - University of California

GraphCrunch 2: Software tool for network modeling, alignment and clustering

Author: Hayes Wayne
Kuchaiev Oleksii
Pržulj Nataša
Stevanović Aleksandar
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Recent advancements in experimental biotechnology have produced large amounts of protein-protein interaction (PPI) data. The topology of PPI networks is believed to have a strong link to their function. Hence, the abundance of PPI data for many organisms stimulates the development of computational techniques for the modeling, comparison, alignment, and clustering of networks. In addition, finding representative models for PPI networks will improve our understanding of the cell just as a model of gravity has helped us understand planetary motion. To decide if a model is representative, we need quantitative comparisons of model networks to real ones. However, exact network comparison is computationally intractable and therefore several heuristics have been used instead. Some of these heuristics are easily computable "network properties," such as the degree distribution, or the clustering coefficient. An important special case of network comparison is the network alignment problem. Analogous to sequence alignment, this problem asks to find the "best" mapping between regions in two networks. It is expected that network alignment might have as strong an impact on our understanding of biology as sequence alignment has had. Topology-based clustering of nodes in PPI networks is another example of an important network analysis problem that can uncover relationships between interaction patterns and phenotype. Results We introduce the GraphCrunch 2 software tool, which addresses these problems. It is a significant extension of GraphCrunch which implements the most popular random network models and compares them with the data networks with respect to many network properties. Also, GraphCrunch 2 implements the GRAph ALigner algorithm ("GRAAL") for purely topological network alignment. GRAAL can align any pair of networks and exposes large, dense, contiguous regions of topological and functional similarities far larger than any other existing tool. Finally, GraphCruch 2 implements an algorithm for clustering nodes within a network based solely on their topological similarities. Using GraphCrunch 2, we demonstrate that eukaryotic and viral PPI networks may belong to different graph model families and show that topology-based clustering can reveal important functional similarities between proteins within yeast and human PPI networks. Conclusions GraphCrunch 2 is a software tool that implements the latest research on biological network analysis. It parallelizes computationally intensive tasks to fully utilize the potential of modern multi-core CPUs. It is open-source and freely available for research use. It runs under the Windows and Linux platforms.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

UCL Discovery

eScholarship - University of California

Human Cancer Protein-Protein Interaction Network: A Structural Perspective

Author: A Friedler
A Hamosh
A Patil
AA Bogan
AL Barabasi
AS Aytuna
Attila Gursoy
B Luo
BA Shoemaker
C Reynolds
CJ Tsai
D Dell'Orco
D Maglott
E Frank
E Gasteiger
E Guney
E Ravasz
G Dawelbait
GD Bader
GD Bader
Gozde Kar
H Jeong
H Yu
H Zhu
HM Berman
HY Chuang
I Xenarios
IM Nooren
J Schymkowitz
J Tormo
JD Han
JT Jones
K Sakai
KI Goh
L Lo Conte
LM Iakoucheva
M Girvan
M Higurashi
M Kanehisa
M Lee
MA Harris
MA Pujana
MG Kann
N Tuncbag
N Tuncbag
O Keskin
O Keskin
O Keskin
O Keskin
Ozlem Keskin
P Aloy
P Aloy
P Aloy
P Pagel
P Shannon
PA Futreal
PF Jonsson
PM Kim
R Aragues
R Aragues
R Guerois
R Sharan
RA Laskowski
RB Jones
RP Bahadur
Ruth Nussinov
S Coulomb
S Efroni
S Jones
S Jones
S Jones
S Liang
S Maere
S Maslov
S Wachi
SF Altschul
SJ Furney
SJ Hubbard
U Ogmen
W Humphrey
YJ Huang
Z Hu
Publication venue: Public Library of Science
Publication date: 01/12/2009
Field of study

Protein-protein interaction networks provide a global picture of cellular function and biological processes. Some proteins act as hub proteins, highly connected to others, whereas some others have few interactions. The dysfunction of some interactions causes many diseases, including cancer. Proteins interact through their interfaces. Therefore, studying the interface properties of cancer-related proteins will help explain their role in the interaction networks. Similar or overlapping binding sites should be used repeatedly in single interface hub proteins, making them promiscuous. Alternatively, multi-interface hub proteins make use of several distinct binding sites to bind to different partners. We propose a methodology to integrate protein interfaces into cancer interaction networks (ciSPIN, cancer structural protein interface network). The interactions in the human protein interaction network are replaced by interfaces, coming from either known or predicted complexes. We provide a detailed analysis of cancer related human protein-protein interfaces and the topological properties of the cancer network. The results reveal that cancer-related proteins have smaller, more planar, more charged and less hydrophobic binding sites than non-cancer proteins, which may indicate low affinity and high specificity of the cancer-related interactions. We also classified the genes in ciSPIN according to phenotypes. Within phenotypes, for breast cancer, colorectal cancer and leukemia, interface properties were found to be discriminating from non-cancer interfaces with an accuracy of 71%, 67%, 61%, respectively. In addition, cancer-related proteins tend to interact with their partners through distinct interfaces, corresponding mostly to multi-interface hubs, which comprise 56% of cancer-related proteins, and constituting the nodes with higher essentiality in the network (76%). We illustrate the interface related affinity properties of two cancer-related hub proteins: Erbb3, a multi interface, and Raf1, a single interface hub. The results reveal that affinity of interactions of the multi-interface hub tends to be higher than that of the single-interface hub. These findings might be important in obtaining new targets in cancer as well as finding the details of specific binding regions of putative cancer drug candidates

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Koç University Digital Collections

Integrative computational biology for cancer research

Author: Fortney Kristen
Jurisica Igor
Publication venue: Springer-Verlag
Publication date: 01/01/2011
Field of study

Over the past two decades, high-throughput (HTP) technologies such as microarrays and mass spectrometry have fundamentally changed clinical cancer research. They have revealed novel molecular markers of cancer subtypes, metastasis, and drug sensitivity and resistance. Some have been translated into the clinic as tools for early disease diagnosis, prognosis, and individualized treatment and response monitoring. Despite these successes, many challenges remain: HTP platforms are often noisy and suffer from false positives and false negatives; optimal analysis and successful validation require complex workflows; and great volumes of data are accumulating at a rapid pace. Here we discuss these challenges, and show how integrative computational biology can help diminish them by creating new software tools, analytical methods, and data standards

Springer - Publisher Connector

PubMed Central