Search CORE

1,767 research outputs found

MorphDB : prioritizing genes for specialized metabolism pathways and gene ontology categories in plants

Author: Amar David
Diels Tim
Shamir Ron
Tzfadia Oren
Van de Peer Yves
Van Parys Thomas
Zwaenepoel Arthur
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2018
Field of study

Recent times have seen an enormous growth of "omics" data, of which high-throughput gene expression data are arguably the most important from a functional perspective. Despite huge improvements in computational techniques for the functional classification of gene sequences, common similarity-based methods often fall short of providing full and reliable functional information. Recently, the combination of comparative genomics with approaches in functional genomics has received considerable interest for gene function analysis, leveraging both gene expression based guilt-by-association methods and annotation efforts in closely related model organisms. Besides the identification of missing genes in pathways, these methods also typically enable the discovery of biological regulators (i.e., transcription factors or signaling genes). A previously built guilt-by-association method is MORPH, which was proven to be an efficient algorithm that performs particularly well in identifying and prioritizing missing genes in plant metabolic pathways. Here, we present MorphDB, a resource where MORPH-based candidate genes for large-scale functional annotations (Gene Ontology, MapMan bins) are integrated across multiple plant species. Besides a gene centric query utility, we present a comparative network approach that enables researchers to efficiently browse MORPH predictions across functional gene sets and species, facilitating efficient gene discovery and candidate gene prioritization. MorphDB is available at http://bioinformatics.psb.ugent.be/webtools/morphdb/morphDB/index/. We also provide a toolkit, named "MORPH bulk" (https://github.com/arzwa/morph-bulk), for running MORPH in bulk mode on novel data sets, enabling researchers to apply MORPH to their own species of interest

Ghent University Academic Bibliography

Frontiers - Publisher Connector

UPSpace at the University of Pretoria

NET-GE: a novel NETwork-based Gene Enrichment for detecting biological processes associated to Mendelian diseases

Author: Casadio Rita
Di Lena Pietro
Fariselli Piero
Martelli Pier Luigi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Enrichment analysis is a widely applied procedure for shedding light on the molecular mechanisms and functions at the basis of phenotypes, for enlarging the dataset of possibly related genes/proteins and for helping interpretation and prioritization of newly determined variations. Several standard and Network-based enrichment methods are available. Both approaches rely on the annotations that characterize the genes/proteins included in the input set; network based ones also include in different ways physical and functional relationships among different genes or proteins that can be extracted from the available biological networks of interactions

PubMed Central

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Archivio istituzionale della ricerca - Università di Padova

Improving disease gene prioritization using the semantic similarity of Gene Ontology terms

Author: Adie
Adie
Aerts
Ala
Altshuler
Andreas Schlicker
Ashburner
Berglund
Blake
Chatr-Aryamontri
Chen
Chen
Cho
Cordell
Feldman
Franke
Freudenberg
Gibson
Goh
Hubbard
Ideker
Jimenez-Sanchez
Kann
Kann
Kelso
Kerrien
Lage
Lee
Lin
Lowe
Mario Albrecht
Navlakha
O'Connor
Ortutay
Oti
Ozgür
Perez-Iratxeta
Perez-Iratxeta
Prasad
Reference Genome Group of the Gene Ontology Consortium
Robinson
Ruepp
Salwinski
Schlicker
Schlicker
Schreiber
Shriner
Smith
Teare
Thomas Lengauer
Tiffin
Tranchevent
Turner
UniProt Consortium
van Driel
van Driel
Velankar
Wu
Yilmaz
Yu
Publication venue: Oxford University Press
Publication date: 01/01/2010
Field of study

Motivation: Many hereditary human diseases are polygenic, resulting from sequence alterations in multiple genes. Genomic linkage and association studies are commonly performed for identifying disease-related genes. Such studies often yield lists of up to several hundred candidate genes, which have to be prioritized and validated further. Recent studies discovered that genes involved in phenotypically similar diseases are often functionally related on the molecular level

FunSimMat update: new features for exploring functional similarity

Author: Albrecht Mario
Schlicker Andreas
Publication venue: Oxford University Press
Publication date: 01/01/2010
Field of study

Quantifying the functional similarity of genes and their products based on Gene Ontology annotation is an important tool for diverse applications like the analysis of gene expression data, the prediction and validation of protein functions and interactions, and the prioritization of disease genes. The Functional Similarity Matrix (FunSimMat, http://www.funsimmat.de) is a comprehensive database providing various precomputed functional similarity values for proteins in UniProtKB and for protein families in Pfam and SMART. With this update, we significantly increase the coverage of FunSimMat by adding data from the Gene Ontology Annotation project as well as new functional similarity measures. The applicability of the database is greatly extended by the implementation of a new Gene Ontology-based method for disease gene prioritization. Two new visualization tools allow an interactive analysis of the functional relationships between proteins or protein families. This is enhanced further by the introduction of an automatically derived hierarchy of annotation classes. Additional changes include a revised user front-end and a new RESTlike interface for improving the user-friendliness and online accessibility of FunSimMat

CiteSeerX

PubMed Central

MPG.PuRe

Constructing a gene semantic similarity network for the inference of disease genes

Author: Gan Mingxin
He Peng
Jiang Rui
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Motivation The inference of genes that are truly associated with inherited human diseases from a set of candidates resulting from genetic linkage studies has been one of the most challenging tasks in human genetics. Although several computational approaches have been proposed to prioritize candidate genes relying on protein-protein interaction (PPI) networks, these methods can usually cover less than half of known human genes. Results We propose to rely on the biological process domain of the gene ontology to construct a gene semantic similarity network and then use the network to infer disease genes. We show that the constructed network covers about 50% more genes than a typical PPI network. By analyzing the gene semantic similarity network with the PPI network, we show that gene pairs tend to have higher semantic similarity scores if the corresponding proteins are closer to each other in the PPI network. By analyzing the gene semantic similarity network with a phenotype similarity network, we show that semantic similarity scores of genes associated with similar diseases are significantly different from those of genes selected at random, and that genes with higher semantic similarity scores tend to be associated with diseases with higher phenotype similarity scores. We further use the gene semantic similarity network with a random walk with restart model to infer disease genes. Through a series of large-scale leave-one-out cross-validation experiments, we show that the gene semantic similarity network can achieve not only higher coverage but also higher accuracy than the PPI network in the inference of disease genes. Contact <email>[email protected]</email></p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

ToppGene Suite for gene list enrichment analysis and candidate gene prioritization

Author: A. G. Jegga
Adie
Aerts
B. J. Aronow
Bader
Barrett
Berger
Chen
Chen
Clarke
Dhandapany
E. E. Bardes
Fisher
Franke
Freudenberg
Hunt
J. Chen
Jimenez-Sanchez
Junker
Kohler
Peri
Rual
Stelzl
Thornblad
Tiffin
Tiffin
Turner
van Bokhoven
Villani
Wu
Zhu
Publication venue: Oxford University Press
Publication date: 01/01/2009
Field of study

ToppGene Suite (http://toppgene.cchmc.org; this web site is free and open to all users and does not require a login to access) is a one-stop portal for (i) gene list functional enrichment, (ii) candidate gene prioritization using either functional annotations or network analysis and (iii) identification and prioritization of novel disease candidate genes in the interactome. Functional annotation-based disease candidate gene prioritization uses a fuzzy-based similarity measure to compute the similarity between any two genes based on semantic annotations. The similarity scores from individual features are combined into an overall score using statistical meta-analysis. A P-value of each annotation of a test gene is derived by random sampling of the whole genome. The protein–protein interaction network (PPIN)-based disease candidate gene prioritization uses social and Web networks analysis algorithms (extended versions of the PageRank and HITS algorithms, and the K-Step Markov method). We demonstrate the utility of ToppGene Suite using 20 recently reported GWAS-based gene–disease associations (including novel disease genes) representing five diseases. ToppGene ranked 19 of 20 (95%) candidate genes within the top 20%, while ToppNet ranked 12 of 16 (75%) candidate genes among the top 20%

CiteSeerX

Crossref

PubMed Central