Search CORE

289 research outputs found

Predicting protein function via downward random walks on a gene ontology

Author
Publication venue: BioMed Central
Publication date: 27/08/2015
Field of study

Benchmark datasets for biomedical knowledge graphs with negative statements

Author: Pesquita Catia
Silva Sara
Sousa Rita T.
Publication venue
Publication date: 21/07/2023
Field of study

Knowledge graphs represent facts about real-world entities. Most of these facts are defined as positive statements. The negative statements are scarce but highly relevant under the open-world assumption. Furthermore, they have been demonstrated to improve the performance of several applications, namely in the biomedical domain. However, no benchmark dataset supports the evaluation of the methods that consider these negative statements. We present a collection of datasets for three relation prediction tasks - protein-protein interaction prediction, gene-disease association prediction and disease prediction - that aim at circumventing the difficulties in building benchmarks for knowledge graphs with negative statements. These datasets include data from two successful biomedical ontologies, Gene Ontology and Human Phenotype Ontology, enriched with negative statements. We also generate knowledge graph embeddings for each dataset with two popular path-based methods and evaluate the performance in each task. The results show that the negative statements can improve the performance of knowledge graph embeddings

arXiv.org e-Print Archive

Graph Representation Learning in Biomedicine

Author: Huang Kexin
Li Michelle M.
Zitnik Marinka
Publication venue
Publication date: 05/11/2021
Field of study

Biomedical networks are universal descriptors of systems of interacting elements, from protein interactions to disease networks, all the way to healthcare systems and scientific knowledge. With the remarkable success of representation learning in providing powerful predictions and insights, we have witnessed a rapid expansion of representation learning techniques into modeling, analyzing, and learning with such networks. In this review, we put forward an observation that long-standing principles of networks in biology and medicine -- while often unspoken in machine learning research -- can provide the conceptual grounding for representation learning, explain its current successes and limitations, and inform future advances. We synthesize a spectrum of algorithmic approaches that, at their core, leverage graph topology to embed networks into compact vector spaces, and capture the breadth of ways in which representation learning is proving useful. Areas of profound impact include identifying variants underlying complex traits, disentangling behaviors of single cells and their effects on health, assisting in diagnosis and treatment of patients, and developing safe and effective medicines

arXiv.org e-Print Archive

Interspecies gene function prediction using semantic similarity

Author
Publication venue: BioMed Central
Publication date: 23/12/2016
Field of study

Springer - Publisher Connector

Computational approaches to identify genetic interactions for cancer therapeutics

Author: Benstead-Hume Graeme
Pearl Frances M G
Wooller Sarah
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/09/2017
Field of study

The development of improved cancer therapies is frequently cited as an urgent unmet medical need. Here we describe how genetic interactions are being therapeutically exploited to identify novel targeted treatments for cancer. We discuss the current methodologies that use ‘omics data to identify genetic interactions, in particular focusing on synthetic sickness lethality (SSL) and synthetic dosage lethality (SDL). We describe the experimen- tal and computational approaches undertaken both in humans and model organisms to identify these interac- tions. Finally we discuss some of the identified targets with licensed drugs, inhibitors in clinical trials or with compounds under development

Directory of Open Access Journals

Sussex Research Online

Interspecies gene function prediction using semantic similarity

Author: A Benso
A Holzinger
A Mitrofanova
A Schlicker
BM Good
C Pesquita
C Pesquita
CL Myers
D Lee
FM Couto
G Valentini
G Yu
G Yu
G Yu
G Yu
G Yu
GO Consortium
Guangyuan Fu
Guoxian Yu
H Yang
J Demsar
J Wu
JL Sevilla
Jun Wang
JZ Wang
L Wilcoxon
M Ashburner
M Cao
M Mistry
ML Zhang
MS Alexandra
MW Hahn
N Cesa-Bianchi
OD King
P Legrain
P Radivojac
PD Thomas
PH Guzzi
PW Lord
Q Zou
Q Zou
R Rada
R Sharan
RJ Roberts
S Mostafavi
SY Rhee
Wei Luo
X Zeng
Y Loewenstein
Y Tao
Z Barutcuoglu
Z Teng
Z Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Computational Labeling, Partitioning, and Balancing of Molecular Networks

Author: Jiang Biaobin
Publication venue: 'Purdue University (bepress)'
Publication date: 01/01/2016
Field of study

Recent advances in high throughput techniques enable large-scale molecular quantification with high accuracy, including mRNAs, proteins and metabolites. Differential expression of these molecules in case and control samples provides a way to select phenotype-associated molecules with statistically significant changes. However, given the significance ranking list of molecular changes, how those molecules work together to drive phenotype formation is still unclear. In particular, the changes in molecular quantities are insufficient to interpret the changes in their functional behavior. My study is aimed at answering this question by integrating molecular network data to systematically model and estimate the changes of molecular functional behaviors. We build three computational models to label, partition, and balance molecular networks using modern machine learning techniques. (1) Due to the incompleteness of protein functional annotation, we develop AptRank, an adaptive PageRank model for protein function prediction on bilayer networks. By integrating Gene Ontology (GO) hierarchy with protein-protein interaction network, our AptRank outperforms four state-of-the-art methods in a comprehensive evaluation using benchmark datasets. (2) We next extend our AptRank into a network partitioning method, BioSweeper, to identify functional network modules in which molecules share similar functions and also densely connect to each other. Compared to traditional network partitioning methods using only network connections, BioSweeper, which integrates the GO hierarchy, can automatically identify functionally enriched network modules. (3) Finally, we conduct a differential interaction analysis, namely difFBA, on protein-protein interaction networks by simulating protein fluxes using flux balance analysis (FBA). We test difFBA using quantitative proteomic data from colon cancer, and demonstrate that difFBA offers more insights into functional changes in molecular behavior than does protein quantity changes alone. We conclude that our integrative network model increases the observational dimensions of complex biological systems, and enables us to more deeply understand the causal relationships between genotypes and phenotypes

Purdue E-Pubs