Search CORE

27 research outputs found

Comparing biological networks via graph compression

Author: A Kocsor
AR Mushegian
BP Kelley
DJ Cook
H Morgan
H Ogata
J Yang
L Peshkin
M Adler
M Hayashida
M Kanehisa
M Li
M Zaslavskiy
Morihiro Hayashida
N Krasnogor
R Singh
RY Pinter
S Wernicke
T Ito
Tatsuya Akutsu
Y Tohsato
Z Li
Z Liang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Comparison of various kinds of biological data is one of the main problems in bioinformatics and systems biology. Data compression methods have been applied to comparison of large sequence data and protein structure data. Since it is still difficult to compare global structures of large biological networks, it is reasonable to try to apply data compression methods to comparison of biological networks. In existing compression methods, the uniqueness of compression results is not guaranteed because there is some ambiguity in selection of overlapping edges. Results This paper proposes novel efficient methods, CompressEdge and CompressVertices, for comparing large biological networks. In the proposed methods, an original network structure is compressed by iteratively contracting identical edges and sets of connected edges. Then, the similarity of two networks is measured by a compression ratio of the concatenated networks. The proposed methods are applied to comparison of metabolic networks of several organisms, <it>H. sapiens, M. musculus, A. thaliana, D. melanogaster, C. elegans, E. coli, S. cerevisiae,</it> and <it>B. subtilis,</it> and are compared with an existing method. These results suggest that our methods can efficiently measure the similarities between metabolic networks. Conclusions Our proposed algorithms, which compress node-labeled networks, are useful for measuring the similarity of large biological networks.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Kyoto University Research Information Repository

Retrieval, alignment, and clustering of computational models based on semantic annotations

Author: Becker J
Budanitsky A
Edda Klipp
Falko Krause
Fielding R
Henkel R
Jiang J
Liebermeister W
Lin D
Marvin Schulz
Nicolas Le Novère
Resnik P
Salton G
Salton G
Tohsato Y
van Rijsbergen C
Wolfram Liebermeister
Publication venue: Nature Publishing Group
Publication date
Field of study

As the number of computational systems biology models increases, new methods are needed to explore their content and build connections with experimental data. In this Perspective article, the authors propose a flexible semantic framework that can help achieve these aims

Crossref

PubMed Central

Defining genes: a computational framework

Author: BO Palsson
Christian V. Forst
CV Forst
D Karolchik
David C. Krakauer
E Dicou
E Pennisi
G Berry
H Pearson
I Brigandt
JD Walton
K Scherrer
L Duret
MB Gerstein
MD Laubichler
MM Krem
Peter F. Stadler
RG Taylor
S Griffiths-Jones
SJ Prohaska
Sonja J. Prohaska
TR Gingeras
TS Furey
Y Tohsato
Publication venue: Springer-Verlag
Publication date: 01/01/2009
Field of study

The precise elucidation of the gene concept has become the subject of intense discussion in light of results from several, large high-throughput surveys of transcriptomes and proteomes. In previous work, we proposed an approach for constructing gene concepts that combines genomic heritability with elements of function. Here, we introduce a definition of the gene within a computational framework of cellular interactions. The definition seeks to satisfy the practical requirements imposed by annotation, capture logical aspects of regulation, and encompass the evolutionary property of homology

Crossref

Springer - Publisher Connector

Fraunhofer-ePrints

PubMed Central

Metabolic pathway alignment between species using a comprehensive and flexible similarity measure

Author: BP Kelley
CV Forst
D Croes
D Hwang
DA Fell
Dick de Ridder
E Ravasz
E Sandmeier
H Jeong
HW Ma
JC Clemente
JC Clemente
JD Hughes
JJ Díaz-Mejía
L Krishnamurthy
L Zhenping
LV Hedges
M Heymans
Marcel JT Reinders
Marco JL de Groot
Q Yang
R Guimerà
R Küffner
R Sharan
RY Pinter
S Goto
T Dandekar
The UniProt Consortium
Y Li
Y Tohsato
Yunlei Li
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Comparative analysis of metabolic networks in multiple species yields important information on their evolution, and has great practical value in metabolic engineering, human disease analysis, drug design etc. In this work, we aim to systematically search for conserved pathways in two species, quantify their similarities, and focus on the variations between themElectrical Engineering, Mathematics and Computer Scienc

Crossref

TU Delft Repository

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Semantic Similarity for Automatic Classification of Chemical Compounds

Author: A Mehta
AM Richard
B Chandrasekaran
C Cortes
C Pesquita
C Pesquita
C Pesquita
D Healy
DR Flower
Francisco M. Couto
FV So
G Lehne
GW Bemis
GW Bemis
H Wolosker
IH Witten
JD Amsterdam
JE Penzotti
John B. O. Mitchell
João D. Ferreira
JP Keogh
JW Raymond
JW Raymond
L Markiewicz
LG Ranilla
M Kanehisa
MF Ullah
N Nikolova
P Baldi
P De Matos
P Jaccard
P Resnik
P Willett
PW Lord
R Dias
R Gentleman
R Guha
R Mishra
RJ Miksicek
RM Harris
RSR Zand
S Doniger
SK Kearsley
SM Ross
SQ Le
T Grego
T Joachims
V Svetnik
W Tong
Y Fukunishi
Y Fukunishi
Y Tohsato
Y Xue
YC Martin
Publication venue: Public Library of Science
Publication date: 01/09/2010
Field of study

With the increasing amount of data made available in the chemical field, there is a strong need for systems capable of comparing and classifying chemical compounds in an efficient and effective way. The best approaches existing today are based on the structure-activity relationship premise, which states that biological activity of a molecule is strongly related to its structural or physicochemical properties. This work presents a novel approach to the automatic classification of chemical compounds by integrating semantic similarity with existing structural comparison methods. Our approach was assessed based on the Matthews Correlation Coefficient for the prediction, and achieved values of 0.810 when used as a prediction of blood-brain barrier permeability, 0.694 for P-glycoprotein substrate, and 0.673 for estrogen receptor binding activity. These results expose a significant improvement over the currently existing methods, whose best performances were 0.628, 0.591, and 0.647 respectively. It was demonstrated that the integration of semantic similarity is a feasible and effective way to improve existing chemical compound classification systems. Among other possible uses, this tool helps the study of the evolution of metabolic pathways, the study of the correlation of metabolic networks with properties of those networks, or the improvement of ontologies that represent chemical information

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Propagating semantic information in biochemical network models

Author: A Levchenko
B Dost
BP Kelley
C Huang
E Nabieva
Edda Klipp
F Hynne
F Krause
G Salton
J Becker
J Gamalielsson
J Sevilla
K Degtyarenko
K Moutselos
M Goodfellow
M Hattori
M Hucka
M Kanehisa
M Schulz
M Schulz
Marvin Schulz
N Le Novère
N Le Novère
P Lord
Q Yang
R Pinter
R Randhawa
R Singh
S Gay
S Wernicke
T Shlomi
V Fionda
Wolfram Liebermeister
Y Tohsato
YT Wang
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background To enable automatic searches, alignments, and model combination, the elements of systems biology models need to be compared and matched across models. Elements can be identified by machine-readable biological annotations, but assigning such annotations and matching non-annotated elements is tedious work and calls for automation. Results A new method called "semantic propagation" allows the comparison of model elements based not only on their own annotations, but also on annotations of surrounding elements in the network. One may either propagate feature vectors, describing the annotations of individual elements, or quantitative similarities between elements from different models. Based on semantic propagation, we align partially annotated models and find annotations for non-annotated model elements. Conclusions Semantic propagation and model alignment are included in the open-source library semanticSBML, available on sourceforge. Online services for model alignment and for annotation prediction can be used at <url>http://www.semanticsbml.org</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Optimized ancestral state reconstruction using Sankoff parsimony

Author: A Mazurie
AWF Edwards
B Kolaczkowski
BA Malcolm
BS Gaut
CV Forst
D Sankoff
D Sankoff
DA Shagin
DE Knuth
DL Swofford
DS Gladstein
F Ronquist
Gabriel Valiente
H Akashi
HW Ma
J Felsenstein
J Felsenstein
J Felsenstein
J Ma
J Wang
J Zhang
JC Clemente
José C Clemente
JP Huelsenbeck
JT Bridgham
JW Thornton
K Fan
Kazuho Ikeo
LR Murphy
M Cieplak
M Heymans
M Kanehisa
M Kimura
MK Kuhner
MS Waterman
N Saitou
NB Adey
NM Krishnan
PA Goloboff
PA Goloboff
PHA Sneath
RF Smith
T Tanaka
Takashi Gojobori
TH Jukes
WC Liu
WC Wheeler
WM Fitch
Y Inagaki
Y Tohsato
Z Jiang
ZS Yang
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Parsimony methods are widely used in molecular evolution to estimate the most plausible phylogeny for a set of characters. Sankoff parsimony determines the minimum number of changes required in a given phylogeny when a cost is associated to transitions between character states. Although optimizations exist to reduce the computations in the number of taxa, the original algorithm takes time <it>O</it>(<it>n</it>2) in the number of states, making it impractical for large values of <it>n</it>. Results In this study we introduce an optimization of Sankoff parsimony for the reconstruction of ancestral states when ultrametric or additive cost matrices are used. We analyzed its performance for randomly generated matrices, Jukes-Cantor and Kimura's two-parameter models of DNA evolution, and in the reconstruction of elongation factor-1<it>α </it>and ancestral metabolic states of a group of eukaryotes, showing that in all cases the execution time is significantly less than with the original implementation. Conclusion The algorithms here presented provide a fast computation of Sankoff parsimony for a given phylogeny. Problems where the number of states is large, such as reconstruction of ancestral metabolism, are particularly adequate for this optimization. Since we are reducing the computations required to calculate the parsimony cost of a single tree, our method can be combined with optimizations in the number of taxa that aim at finding the most parsimonious tree.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Trends in life science grid: from computing grid to knowledge grid

Author: A Birnbaum
A Emerson
A Falzone
A Fukuzaki
A Jones
A Konagaya
A Konagaya
A Konagaya
A Konagaya
A Krishnan
A Krishnan
A Shahab
A Stell
Akihiko Konagaya
C Blanchet
D Sulakhe
E Bartocci
F Konishi
F Konishi
F Konishi
H Imade
H Lee
H Shimosaka
H Sugawara
I Nonaka
J Rajapakse
J Salzemann
J Seo
K Satou
K Satou
L Seitz
M Fato
M Hartzwood
M Pan
M Schroeder
M Sugimoto
M Taiji
N Cannata
N Jacq
N Zhang
P Arzberger
R Sinnott
R Sinnott
R Umetsu
S DAscia
S Kimura
S Kimura
S Loong
S Masuno
T Arbona
T Oinn
V Breton
W Li
W Li
Y Tohsato
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Grid computing has great potential to become a standard cyberinfrastructure for life sciences which often require high-performance computing and large data handling which exceeds the computing capacity of a single institution. RESULTS: This survey reviews the latest grid technologies from the viewpoints of computing grid, data grid and knowledge grid. Computing grid technologies have been matured enough to solve high-throughput real-world life scientific problems. Data grid technologies are strong candidates for realizing "resourceome" for bioinformatics. Knowledge grids should be designed not only from sharing explicit knowledge on computers but also from community formulation for sharing tacit knowledge among a community. CONCLUSION: Extending the concept of grid from computing grid to knowledge grid, it is possible to make use of a grid as not only sharable computing resources, but also as time and place in which people work together, create knowledge, and share knowledge and experiences in a community

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

ENVIRONMENTAL DEPENDENCY OF GENE KNOCKOUTS ON PHENOTYPE MICROARRAY ANALYSIS IN ESCHERICHIA COLI

Author: Baba T.
BARRY L. WANNER
HIROTADA MORI
Kato J.
MASAHIRO ITO
Tohsato Y.
TOMOYA BABA
YUKAKO TOHSATO
YUSAKU MAZAKI
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date
Field of study

Crossref

SubMAP: Aligning Metabolic Pathways with Subnetwork Mappings

Author: Austrin P.
Ay F.
Berman P.
Clemente J.
Deutscher D.
Dost B.
Ferhat Ay
Garg A.
Grochow J.
Koyuturk M.
Koyuturk M.
Manolis Kellis
Saunders P.
Singh R.
Sridhar P.
Tamer Kahveci
Tohsato Y.
Tohsato Y.
Webb E.
Publication venue: 'Mary Ann Liebert Inc'
Publication date: 01/03/2011
Field of study

We consider the problem of aligning two metabolic pathways. Unlike traditional approaches, we do not restrict the alignment to one-to-one mappings between the molecules (nodes) of the input pathways (graphs). We follow the observation that, in nature, different organisms can perform the same or similar functions through different sets of reactions and molecules. The number and the topology of the molecules in these alternative sets often vary from one organism to another. With the motivation that an accurate biological alignment should be able to reveal these functionally similar molecule sets across different species, we develop an algorithm that first measures the similarities between different nodes using a mixture of homology and topological similarity. We combine the two metrics by employing an eigenvalue formulation. We then search for an alignment between the two input pathways that maximizes a similarity score, evaluated as the sum of the similarities of the mapped subnetworks of size at most a given integer k, and also does not contain any conflicting mappings. Here we prove that this maximization is NP-hard by a reduction from the maximum weight independent set (MWIS) problem. We then convert our problem to an instance of MWIS and use an efficient vertex-selection strategy to extract the mappings that constitute our alignment. We name our algorithm SubMAP (Subnetwork Mappings in Alignment of Pathways). We evaluate its accuracy and performance on real datasets. Our empirical results demonstrate that SubMAP can identify biologically relevant mappings that are missed by traditional alignment methods. Furthermore, we observe that SubMAP is scalable for metabolic pathways of arbitrary topology, including searching for a query pathway of size 70 against the complete KEGG database of 1,842 pathways. Implementation in C++ is available at http://bioinformatics.cise.ufl.edu/SubMAP.html.National Science Foundation (U.S.) (Grant CCF-0829867)National Science Foundation (U.S.) (Grant IIS-0845439

DSpace@MIT

Crossref

PubMed Central