1,105,878 research outputs found
WikiPathways: building research communities on biological pathways.
Here, we describe the development of WikiPathways (http://www.wikipathways.org), a public wiki for pathway curation, since it was first published in 2008. New features are discussed, as well as developments in the community of contributors. New features include a zoomable pathway viewer, support for pathway ontology annotations, the ability to mark pathways as private for a limited time and the availability of stable hyperlinks to pathways and the elements therein. WikiPathways content is freely available in a variety of formats such as the BioPAX standard, and the content is increasingly adopted by external databases and tools, including Wikipedia. A recent development is the use of WikiPathways as a staging ground for centrally curated databases such as Reactome. WikiPathways is seeing steady growth in the number of users, page views and edits for each pathway. To assess whether the community curation experiment can be considered successful, here we analyze the relation between use and contribution, which gives results in line with other wiki projects. The novel use of pathway pages as supplementary material to publications, as well as the addition of tailored content for research domains, is expected to stimulate growth further
Reactome - a knowledgebase of human biological pathways
Pathway curation is a powerful tool for systematically associating gene products with functions. Reactome (www.reactome.org) is a manually curated human pathway knowledgebase describing a wide range of biological processes in a computationally accessible manner. The core unit of the Reactome data model is the Reaction, whose instances form a network of biological interactions through entities that are consumed, produced, or act as catalysts. Entities are distinguished by their molecular identities and cellular locations. Set objects allow grouping of related entities. Curation is based on communication between expert authors and staff curators, facilitated by freely available data entry tools. Manually curated data are subjected to quality control and peer review by a second expert. Reactome data are released quarterly. At release time, electronic orthology inference performed on human data produces reaction predictions in 22 species ranging from mouse to bacteria. Cross-references to a large number of publicly available databases are attached, providing multiple entry points into the database. The Reactome Mart allows query submission and data retrieval from Reactome and across other databases. The SkyPainter tool provides visualization and statistical analysis of user supplied data, e.g. from microarray experiments. Reactome data are freely available in a number of data formats (e.g. BioPax, SBML)
PathExpand: Extending biological pathways using molecular interaction networks
We present a methodology for extending pre-defined protein sets representing cellular pathways and processes by mapping them onto a protein-protein interaction network, and extending them to include densely interconnected interaction partners. The added proteins display distinctive network topological features and molecular function annotations, and can be proposed as putative new components, and/or as regulators of the communication between the different cellular processes. Finally, these extended pathways and processes are used to analyze their enrichment in cancer mutated genes. Significant associations between mutated genes and certain processes are identified, enabling an analysis of the influence of previously non-annotated cancer mutated genes
Perturbation Detection Through Modeling of Gene Expression on a Latent Biological Pathway Network: A Bayesian hierarchical approach
Cellular response to a perturbation is the result of a dynamic system of
biological variables linked in a complex network. A major challenge in drug and
disease studies is identifying the key factors of a biological network that are
essential in determining the cell's fate.
Here our goal is the identification of perturbed pathways from
high-throughput gene expression data. We develop a three-level hierarchical
model, where (i) the first level captures the relationship between gene
expression and biological pathways using confirmatory factor analysis, (ii) the
second level models the behavior within an underlying network of pathways
induced by an unknown perturbation using a conditional autoregressive model,
and (iii) the third level is a spike-and-slab prior on the perturbations. We
then identify perturbations through posterior-based variable selection.
We illustrate our approach using gene transcription drug perturbation
profiles from the DREAM7 drug sensitivity predication challenge data set. Our
proposed method identified regulatory pathways that are known to play a
causative role and that were not readily resolved using gene set enrichment
analysis or exploratory factor models. Simulation results are presented
assessing the performance of this model relative to a network-free variant and
its robustness to inaccuracies in biological databases
Sirtuin functions and modulation: from chemistry to the clinic
Sirtuins are NAD+
-dependent histone deacetylases regulating important metabolic pathways in prokaryotes and
eukaryotes and are involved in many biological processes such as cell survival, senescence, proliferation, apoptosis,
DNA repair, cell metabolism, and caloric restriction. The seven members of this family of enzymes are considered
potential targets for the treatment of human pathologies including neurodegenerative diseases, cardiovascular
diseases, and cancer. Furthermore, recent interest focusing on sirtuin modulators as epigenetic players in the
regulation of fundamental biological pathways has prompted increased efforts to discover new small molecules
able to modify sirtuin activity. Here, we review the role, mechanism of action, and biological function of the
seven sirtuins, as well as their inhibitors and activators
Extracting Functional Modules from Biological Pathways
It has been proposed that functional modules are the fundamental units of cellular function. Methods to identify these modules have thus far relied on gene expression data or protein-protein interaction (PPI) data, but have a few limitations. We propose a new method, using biological pathway data to identify functional modules, that can potentially overcome these limitations. We also construct a network of these modules using functionally relevant PPI data. This network displays the flow and integration of information between modules and can be used to map cellular function
Representing and analysing molecular and cellular function in the computer
Determining the biological function of a myriad of genes, and understanding how they interact to yield a living cell, is the major challenge of the post genome-sequencing era. The complexity of biological systems is such that this cannot be envisaged without the help of powerful computer systems capable of representing and analysing the intricate networks of physical and functional interactions between the different cellular components. In this review we try to provide the reader with an appreciation of where we stand in this regard. We discuss some of the inherent problems in describing the different facets of biological function, give an overview of how information on function is currently represented in the major biological databases, and describe different systems for organising and categorising the functions of gene products. In a second part, we present a new general data model, currently under development, which describes information on molecular function and cellular processes in a rigorous manner. The model is capable of representing a large variety of biochemical processes, including metabolic pathways, regulation of gene expression and signal transduction. It also incorporates taxonomies for categorising molecular entities, interactions and processes, and it offers means of viewing the information at different levels of resolution, and dealing with incomplete knowledge. The data model has been implemented in the database on protein function and cellular processes 'aMAZE' (http://www.ebi.ac.uk/research/pfbp/), which presently covers metabolic pathways and their regulation. Several tools for querying, displaying, and performing analyses on such pathways are briefly described in order to illustrate the practical applications enabled by the model
- …
