Search CORE

54 research outputs found

Graph-based analysis and visualization of experimental results with ONDEX

Author: Baumbach J.
Koehler J.
Philippi S.
Rawlings C. J.
Ruegg A.
Skusa A.
Specht M.
Taubert J.
Verrier P. J.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2006
Field of study

Motivation: Assembling the relevant information needed to interpret the output from high-throughput, genome scale, experiments such as gene expression microarrays is challenging. Analysis reveals genes that show statistically significant changes in expression levels, but more information is needed to determine their biological relevance. The challenge is to bring these genes together with biological information distributed across hundreds of databases or buried in the scientific literature (millions of articles). Software tools are needed to automate this task which at present is labor-intensive and requires considerable informatics and biological expertise. Results: This article describes ONDEX and how it can be applied to the task of interpreting gene expression results. ONDEX is a database system that combines the features of semantic database integration and text mining with methods for graph-based analysis. An overview of the ONDEX system is presented, concentrating on recently developed features for graph-based analysis and visualization. A case study is used to show how ONDEX can help to identify causal relationships between stress response genes and metabolic pathways from gene expression data. ONDEX also discovered functional annotations for most of the genes that emerged as significant in the microarray experiment, but were previously of unknown function

Rothamsted Repository

BacillOndex: An Integrated Data Resource for Systems and Synthetic Biology

Author: Allenby Nick
Hallinan Jennifer S.
James Katherine
Misirli Goksel
Mullen Joseph
Pocock Matthew
Smith Wendy
Wipat Anil
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/01/2013
Field of study

BacillOndex is an extension of the Ondex data integration system, providing a semantically annotated, integrated knowledge base for the model Gram-positive bacterium Bacillus subtilis. This application allows a user to mine a variety of B. subtilis data sources, and analyse the resulting integrated dataset, which contains data about genes, gene products and their interactions. The data can be analysed either manually, by browsing using Ondex, or computationally via a Web services interface. We describe the process of creating a BacillOndex instance, and describe the use of the system for the analysis of single nucleotide polymorphisms in B. subtilis Marburg. The Marburg strain is the progenitor of the widely-used laboratory strain B. subtilis 168. We identified 27 SNPs with predictable phenotypic effects, including genetic traits for known phenotypes. We conclude that BacillOndex is a valuable tool for the systems-level investigation of, and hypothesis generation about, this important biotechnology workhorse. Such understanding contributes to our ability to construct synthetic genetic circuits in this organism

Keele Research Repository

Northumbria University Research Portal

Crossref

Macquarie University ResearchOnline

PHI-base update: additions to the pathogen–host interaction database

Author: A. Beacham
Baldwin
C. Rawlings
DiGuistini
H. Hansen
J. Kohler
Jeon
K. E. Hammond-Kosack
Lindeberg
M. Lindeberg
M. Urban
Philippi
R. Winnenburg
S. Holland
T. K. Baldwin
Tunlid
Winnenburg
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

The pathogen–host interaction database (PHI-base) is a web-accessible database that catalogues experimentally verified pathogenicity, virulence and effector genes from bacterial, fungal and Oomycete pathogens, which infect human, animal, plant, insect, fish and fungal hosts. Plant endophytes are also included. PHI-base is therefore an invaluable resource for the discovery of genes in medically and agronomically important pathogens, which may be potential targets for chemical intervention. The database is freely accessible to both academic and non-academic users. This publication describes recent additions to the database and both current and future applications. The number of fields that characterize PHI-base entries has almost doubled. Important additional fields deal with new experimental methods, strain information, pathogenicity islands and external references that link the database to external resources, for example, gene ontology terms and Locus IDs. Another important addition is the inclusion of anti-infectives and their target genes that makes it possible to predict the compounds, that may interact with newly identified virulence factors. In parallel, the curation process has been improved and now involves several external experts. On the technical side, several new search tools have been provided and the database is also now distributed in XML format. PHI-base is available at: http://www.phi-base.org/

Crossref

PubMed Central

Munin - Open Research Archive

NORA - Norwegian Open Research Archives

Rothamsted Repository

Bayesian integration of networks without gold standards

Author: Anil Wipat
Bader
Braun
Breitkreutz
Cerami
Chatr-aryamontri
Cheung
Darren J. Wilkinson
Eisenberg
Gelfand
Guldener
Hermjakob
James
Jennifer Hallinan
Jeong
Jochen Weile
Kass
Katherine James
Kerrien
Koehler
Lee
Lycett
Phillip Lord
Simon J. Cockell
Smith
Stein
Troyanskaya
Venkatesan
von Mering
Publication venue: Oxford University Press
Publication date: 01/01/2012
Field of study

Motivation: Biological experiments give insight into networks of processes inside a cell, but are subject to error and uncertainty. However, due to the overlap between the large number of experiments reported in public databases it is possible to assess the chances of individual observations being correct. In order to do so, existing methods rely on high-quality ‘gold standard’ reference networks, but such reference networks are not always available

Crossref

PubMed Central

Macquarie University ResearchOnline

The potential of text mining in data integration and network biology for plant research : a case study on Arabidopsis

Author: De Bodt Stefanie
Drebert Zuzanna
Inzé Dirk
Van de Peer Yves
Van Landeghem Sofie
Publication venue: 'American Society of Plant Biologists (ASPB)'
Publication date: 01/01/2013
Field of study

Despite the availability of various data repositories for plant research, a wealth of information currently remains hidden within the biomolecular literature. Text mining provides the necessary means to retrieve these data through automated processing of texts. However, only recently has advanced text mining methodology been implemented with sufficient computational power to process texts at a large scale. In this study, we assess the potential of large-scale text mining for plant biology research in general and for network biology in particular using a state-of-the-art text mining system applied to all PubMed abstracts and PubMed Central full texts. We present extensive evaluation of the textual data for Arabidopsis thaliana, assessing the overall accuracy of this new resource for usage in plant network analyses. Furthermore, we combine text mining information with both protein-protein and regulatory interactions from experimental databases. Clusters of tightly connected genes are delineated from the resulting network, illustrating how such an integrative approach is essential to grasp the current knowledge available for Arabidopsis and to uncover gene information through guilt by association. All large-scale data sets, as well as the manually curated textual data, are made publicly available, hereby stimulating the application of text mining data in future plant biology studies

Ghent University Academic Bibliography

PubMed Central

Arena3D: visualization of biological networks in 3D

Author: O'Donoghue Seán I
Pafilis Evangelos
Pavlopoulos Georgios A
Satagopam Venkata P
Schneider Reinhard
Soldatos Theodoros G
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Complexity is a key problem when visualizing biological networks; as the number of entities increases, most graphical views become incomprehensible. Our goal is to enable many thousands of entities to be visualized meaningfully and with high performance. Results We present a new visualization tool, Arena3D, which introduces a new concept of staggered layers in 3D space. Related data – such as proteins, chemicals, or pathways – can be grouped onto separate layers and arranged via layout algorithms, such as Fruchterman-Reingold, distance geometry, and a novel hierarchical layout. Data on a layer can be clustered via k-means, affinity propagation, Markov clustering, neighbor joining, tree clustering, or UPGMA ('unweighted pair-group method with arithmetic mean'). A simple input format defines the name and URL for each node, and defines connections or similarity scores between pairs of nodes. The use of Arena3D is illustrated with datasets related to Huntington's disease. Conclusion Arena3D is a user friendly visualization tool that is able to visualize biological or any other network in 3D space. It is free for academic use and runs on any platform. It can be downloaded or lunched directly from <url>http://arena3d.org</url>. Java3D library and Java 1.5 need to be pre-installed for the software to run.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

UNSWorks

Open Repository and Bibliography - Luxembourg

IntegromeDB: an integrated system and biological search engine

Author: Baitaluk Michael
Dubinina Yulia
Kozhenkov Sergey
Ponomarenko Julia
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background With the growth of biological data in volume and heterogeneity, web search engines become key tools for researchers. However, general-purpose search engines are not specialized for the search of biological data. Description Here, we present an approach at developing a biological web search engine based on the Semantic Web technologies and demonstrate its implementation for retrieving gene- and protein-centered knowledge. The engine is available at http://www.integromedb.org. Conclusions The IntegromeDB search engine allows scanning data on gene regulation, gene expression, protein-protein interactions, pathways, metagenomics, mutations, diseases, and other gene- and protein-related data that are automatically retrieved from publicly available databases and web pages using biological ontologies. To perfect the resource design and usability, we welcome and encourage community feedback

Crossref

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

WebGIVI: a web-based gene enrichment analysis and visualization tool

Author: A Bateman
A Mitchell
A. S. M. Ashique Mahmood
B Breitkreutz
Carl J. Schmidt
Catalina O. Tudor
CO Tudor
D Croft
DW Huang
DW Huang
E Eden
F Iragne
GA Pavlopoulos
GA Salazar
H Ogata
J Köhler
JA Blake
Jia Ren
Jian Chen
K. Vijay-Shanker
L Sun
Liang Sun
M Bostock
P Shannon
PJ Kersey
Q Zheng
SD Hooper
TC Freeman
Yongnan Zhu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Customizable views on semantically integrated networks for systems biology

Author: Achard
Addinall
Altschul
Andrews
Anil Wipat
Ashburner
Balaji
Blasco
Brinkley
Bruckmann
Chaudhri
Cherry
Cheung
Cheung
Darren Wilkinson
David Lydall
Elbing
Eva−Maria Holstein
Garvik
Gavin
Haider
Heimbigner
Herrgard
James M. Dewar
Jennifer Hallinan
Jiang
Jochen Weile
Keseler
Kohler
Krogan
Lin
Longhese
Matthew Pocock
McBride
Nugent
Phillip Lord
Prlić
Ptacek
Sandell
Schmidt
Schwartz
Shannon
Simon J. Cockell
Smith
Spellman
Stark
Stein
Tanay
Taubert
Tong
Tzivion
Usui
Vincent
Weinert
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

Motivation: The rise of high-throughput technologies in the post-genomic era has led to the production of large amounts of biological data. Many of these datasets are freely available on the Internet. Making optimal use of these data is a significant challenge for bioinformaticians. Various strategies for integrating data have been proposed to address this challenge. One of the most promising approaches is the development of semantically rich integrated datasets. Although well suited to computational manipulation, such integrated datasets are typically too large and complex for easy visualization and interactive exploration

Crossref

PubMed Central

Macquarie University ResearchOnline

cPath: open source software for collecting, storing, and querying biological pathways

Author: A Birkland
A Bouchie
A Zanzoni
Benjamin E Gross
C Sander
CF Schaefer
Chris Sander
CM Lloyd
D Hanahan
EM Zdobnov
Ethan G Cerami
F Campagne
F Iragne
G Joshi-Tope
Gary D Bader
GD Bader
H Hermjakob
H Hermjakob
H Kitano
H Ogata
I Xenarios
J Kohler
KH Buetow
L Salwinski
L Stein
LD Stein
M Hucka
M Kanehisa
MP Cary
N Le Novere
N Le Novere
P Nurse
P Shannon
PD Karp
PD Karp
PJ Kersey
R Aragues
RT Fielding
S Peri
SP Shah
T Ideker
T Ideker
WC Hahn
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Biological pathways, including metabolic pathways, protein interaction networks, signal transduction pathways, and gene regulatory networks, are currently represented in over 220 diverse databases. These data are crucial for the study of specific biological processes, including human diseases. Standard exchange formats for pathway information, such as BioPAX, CellML, SBML and PSI-MI, enable convenient collection of this data for biological research, but mechanisms for common storage and communication are required. RESULTS: We have developed cPath, an open source database and web application for collecting, storing, and querying biological pathway data. cPath makes it easy to aggregate custom pathway data sets available in standard exchange formats from multiple databases, present pathway data to biologists via a customizable web interface, and export pathway data via a web service to third-party software, such as Cytoscape, for visualization and analysis. cPath is software only, and does not include new pathway information. Key features include: a built-in identifier mapping service for linking identical interactors and linking to external resources; built-in support for PSI-MI and BioPAX standard pathway exchange formats; a web service interface for searching and retrieving pathway data sets; and thorough documentation. The cPath software is freely available under the LGPL open source license for academic and commercial use. CONCLUSION: cPath is a robust, scalable, modular, professional-grade software platform for collecting, storing, and querying biological pathways. It can serve as the core data handling component in information systems for pathway visualization, analysis and modeling

University of Toronto Research Repository

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central