Search CORE

347 research outputs found

The Ontology Lookup Service: more data and better tools for controlled vocabulary queries

Author: Cote
H. Hermjakob
Hull
L. Martens
Orchard
P. Jones
R. Apweiler
R. G. Cote
Smith
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

The Ontology Lookup Service (OLS) (http://www.ebi.ac.uk/ols) provides interactive and programmatic interfaces to query, browse and navigate an ever increasing number of biomedical ontologies and controlled vocabularies. The volume of data available for querying has more than quadrupled since it went into production and OLS functionality has been integrated into several high-usage databases and data entry tools. Improvements have been made to both OLS query interfaces, based on user feedback and requirements, to improve usability and service interoperability and provide novel ways to perform queries

CiteSeerX

Crossref

Ghent University Academic Bibliography

PubMed Central

ComplexViewer: visualization of curated macromolecular complexes.

Author: Balakrishnan
Birgit H M Meldal
Colin W Combe
Combe
Gos Micklem
Henning Hermjakob
Hermjakob
Jonathan Wren
Joshua Heimbach
Juri Rappsilber
Kerrien
Kerrien
Marine (Dumousseau) Sivade
Meldal
Orchard
Orchard
Sandra Orchard
Smith
Publication venue: Bioinformatics
Publication date: 03/08/2017
Field of study

SUMMARY: Proteins frequently function as parts of complexes, assemblages of multiple proteins and other biomolecules, yet network visualizations usually only show proteins as parts of binary interactions. ComplexViewer visualizes interactions with more than two participants and thereby avoids the need to first expand these into multiple binary interactions. Furthermore, if binding regions between molecules are known then these can be displayed in the context of the larger complex. AVAILABILITY AND IMPLEMENTATION: freely available under Apache version 2 license; EMBL-EBI Complex Portal: http://www.ebi.ac.uk/complexportal; Source code: https://github.com/MICommunity/ComplexViewer; Package: https://www.npmjs.com/package/complexviewer; http://biojs.io/d/complexviewer. Language: JavaScript; Web technology: Scalable Vector Graphics; Libraries: D3.js. CONTACT: [email protected] or [email protected]

Crossref

Edinburgh Research Explorer

Apollo (Cambridge)

The Reactome BioMart

Author: Croft D.
D'Eustachio P.
Haw R. A.
Hermjakob H.
Ndegwa N.
Stein L. D.
Yung C. K.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2011
Field of study

Reactome is an open source, expert-authored, manually curated and peer-reviewed database of reactions, pathways and biological processes. We provide an intuitive web-based user interface to pathway knowledge and a suite of data analysis tools. The Reactome BioMart provides biologists and bioinformaticians with a single web interface for performing simple or elaborate queries of the Reactome database, aggregating data from different sources and providing an opportunity to integrate experimental and computational results with information relating to biological pathways. Database URL: http://www.reactome.org

Cold Spring Harbor Laboratory Institutional Repository

PubMed Central

ISPIDER Central: an integrated database web-server for proteomics

Author: Belhajjame K.
Côté R.
Embury S.M.
Goble C.A.
Hermjakob H.
Hubbard S.J.
Jones D.T.
Jones P.
Martin Nigel
Oliver S.G.
Orengo C.A.
Paton N.W.
Pentony M.M.
Poulovassilis Alexandra
Selley J.N.
Siepen J.A.
Stevens R.
Zamboulis Lucas
Publication venue: 'Oxford University Press (OUP)'
Publication date: 25/04/2008
Field of study

Despite the growing volumes of proteomic data, integration of the underlying results remains problematic owing to differences in formats, data captured, protein accessions and services available from the individual repositories. To address this, we present the ISPIDER Central Proteomic Database search (http://www.ispider.manchester.ac.uk/cgi-bin/ProteomicSearch.pl), an integration service offering novel search capabilities over leading, mature, proteomic repositories including PRoteomics IDEntifications database (PRIDE), PepSeeker, PeptideAtlas and the Global Proteome Machine. It enables users to search for proteins and peptides that have been characterised in mass spectrometry-based proteomics experiments from different groups, stored in different databases, and view the collated results with specialist viewers/clients. In order to overcome limitations imposed by the great variability in protein accessions used by individual laboratories, the European Bioinformatics Institute's Protein Identifier Cross-Reference (PICR) service is used to resolve accessions from different sequence repositories. Custom-built clients allow users to view peptide/protein identifications in different contexts from multiple experiments and repositories, as well as integration with the Dasty2 client supporting any annotations available from Distributed Annotation System servers. Further information on the protein hits may also be added via external web services able to take a protein as input. This web server offers the first truly integrated access to proteomics repositories and provides a unique service to biologists interested in mass spectrometry-based proteomics

PubMed Central

Birkbeck Institutional Research Online

The Australian National University

The University of Manchester - Institutional Repository

Development of data representation standards by the human proteome organization proteomics standards initiative.

Author: Albar J.P.
Binz P.A.
Deutsch E.W.
Eisenacher M.
Hermjakob H.
Jones A.R.
Mayer G.
Omenn G.S.
Orchard S.
Vizcaíno J.A.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2015
Field of study

OBJECTIVE: To describe the goals of the Proteomics Standards Initiative (PSI) of the Human Proteome Organization, the methods that the PSI has employed to create data standards, the resulting output of the PSI, lessons learned from the PSI's evolution, and future directions and synergies for the group. MATERIALS AND METHODS: The PSI has 5 categories of deliverables that have guided the group. These are minimum information guidelines, data formats, controlled vocabularies, resources and software tools, and dissemination activities. These deliverables are produced via the leadership and working group organization of the initiative, driven by frequent workshops and ongoing communication within the working groups. Official standards are subjected to a rigorous document process that includes several levels of peer review prior to release. RESULTS: We have produced and published minimum information guidelines describing what information should be provided when making data public, either via public repositories or other means. The PSI has produced a series of standard formats covering mass spectrometer input, mass spectrometer output, results of informatics analysis (both qualitative and quantitative analyses), reports of molecular interaction data, and gel electrophoresis analyses. We have produced controlled vocabularies that ensure that concepts are uniformly annotated in the formats and engaged in extensive software development and dissemination efforts so that the standards can efficiently be used by the community.Conclusion In its first dozen years of operation, the PSI has produced many standards that have accelerated the field of proteomics by facilitating data exchange and deposition to data repositories. We look to the future to continue developing standards for new proteomics technologies and workflows and mechanisms for integration with other omics data types. Our products facilitate the translation of genomics and proteomics findings to clinical and biological phenotypes. The PSI website can be accessed at http://www.psidev.info

Crossref

Serveur académique lausannois

PubMed Central

Gene regulation knowledge commons: community action takes care of DNA binding transcription factors

Author: Blake JA
Chawla K
Christie KR
Hermjakob H
Huntley RP
Kuiper M
Lægreid A
Orchard S
Thommesen L
Tripathi S
Vercruysse S
Publication venue
Publication date: 01/01/2016
Field of study

A large gap remains between the amount of knowledge in scientific literature and the fraction that gets curated into standardized databases, despite many curation initiatives. Yet the availability of comprehensive knowledge in databases is crucial for exploiting existing background knowledge, both for designing follow-up experiments and for interpreting new experimental data. Structured resources also underpin the computational integration and modeling of regulatory pathways, which further aids our understanding of regulatory dynamics. We argue how cooperation between the scientific community and professional curators can increase the capacity of capturing precise knowledge from literature. We demonstrate this with a project in which we mobilize biological domain experts who curate large amounts of DNA binding transcription factors, and show that they, although new to the field of curation, can make valuable contributions by harvesting reported knowledge from scientific papers. Such community curation can enhance the scientific epistemic process.Database URL: http://www.tfcheckpoint.org

Crossref

The Jackson Laboratory: The Mouseion at the JAXlibrary

UCL Discovery

PubMed Central

NORA - Norwegian Open Research Archives

Merging and scoring molecular interactions utilising existing community standards: tools, use-cases and a case study.

Author: Askenazi M.
Choi H.
Del-Toro N.
Duesbury M.
Dumousseau M.
Habermann B.
Hermjakob H.
Jimenez R.
Orchard S.
Ping P.
Porras P.
Villaveces J.
Zong N.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2015
Field of study

The evidence that two molecules interact in a living cell is often inferred from multiple different experiments. Experimental data is captured in multiple repositories, but there is no simple way to assess the evidence of an interaction occurring in a cellular environment. Merging and scoring of data are commonly required operations after querying for the details of specific molecular interactions, to remove redundancy and assess the strength of accompanying experimental evidence. We have developed both a merging algorithm and a scoring system for molecular interactions based on the proteomics standard initiative-molecular interaction standards. In this manuscript, we introduce these two algorithms and provide community access to the tool suite, describe examples of how these tools are useful to selectively present molecular interaction data and demonstrate a case where the algorithms were successfully used to identify a systematic error in an existing dataset

PubMed Central

MPG.PuRe

R spider: a network-based analysis of gene lists by combining signaling and metabolic pathways from Reactome and KEGG databases

Author: A. V. Antonov
Antonov
Antonov
Antonov
Antonov
Berger
Berriz
Boutet
Draghici
E. E. Schmidt
H. Hermjakob
Khatri
Khatri
Liu
M. Krestyaninova
Ma'ayan
Martin
Masseroli
S. Dietmann
Shannon
Vastrik
Vermeer
Wheeler
Publication venue: Oxford University Press
Publication date: 01/01/2010
Field of study

R spider is a web-based tool for the analysis of a gene list using the systematic knowledge of core pathways and reactions in human biology accumulated in the Reactome and KEGG databases. R spider implements a network-based statistical framework, which provides a global understanding of gene relations in the supplied gene list, and fully exploits the Reactome and KEGG knowledge bases. R spider provides a user-friendly dialog-driven web interface for several model organisms and supports most available gene identifiers. R spider is freely available at http://mips.helmholtz-muenchen.de/proj/rspider

PRIDE: Quality control in a proteomics data repository

Author: A. Csordas
Barsnes
Bradshaw
Craig
D. Ovelleiro
D. Rios
Deutsch
Eisenacher
H. Hermjakob
J. A. Vizcaino
J. M. Foster
Montecchi-Palazzi
R. Wang
Slotta
Taylor
Wang
Publication venue: Oxford University Press
Publication date
Field of study

The PRoteomics IDEntifications (PRIDE) database is a large public proteomics data repository, containing over 270 million mass spectra (by November 2011). PRIDE is an archival database, providing the proteomics data supporting specific scientific publications in a computationally accessible manner. While PRIDE faces rapid increases in data deposition size as well as number of depositions, the major challenge is to ensure a high quality of data depositions in the context of highly diverse proteomics work flows and data representations. Here, we describe the PRIDE curation pipeline and its practical application in quality control of complex data depositions

Crossref

PubMed Central

Perspectives on tracking data reuse across biodata resources.

Author: Bastian F.B.
Bateman A.
Buys M.
Cook C.E.
D'Eustachio P.
Harrison M.
Hermjakob H.
Li D.
Lord P.
Natale D.A.
Peters B.
Ross K.E.
Sternberg P.W.
Su A.I.
Thakur M.
Thomas P.D.
Publication venue
Publication date: 01/01/2024
Field of study

Data reuse is a common and vital practice in molecular biology and enables the knowledge gathered over recent decades to drive discovery and innovation in the life sciences. Much of this knowledge has been collated into molecular biology databases, such as UniProtKB, and these resources derive enormous value from sharing data among themselves. However, quantifying and documenting this kind of data reuse remains a challenge. The article reports on a one-day virtual workshop hosted by the UniProt Consortium in March 2023, attended by representatives from biodata resources, experts in data management, and NIH program managers. Workshop discussions focused on strategies for tracking data reuse, best practices for reusing data, and the challenges associated with data reuse and tracking. Surveys and discussions showed that data reuse is widespread, but critical information for reproducibility is sometimes lacking. Challenges include costs of tracking data reuse, tensions between tracking data and open sharing, restrictive licenses, and difficulties in tracking commercial data use. Recommendations that emerged from the discussion include: development of standardized formats for documenting data reuse, education about the obstacles posed by restrictive licenses, and continued recognition by funding agencies that data management is a critical activity that requires dedicated resources. Summaries of survey results are available at: https://docs.google.com/forms/d/1j-VU2ifEKb9C-sW6l3ATB79dgHdRk5v_lESv2hawnso/viewanalytics (survey of data providers) and https://docs.google.com/forms/d/18WbJFutUd7qiZoEzbOytFYXSfWFT61hVce0vjvIwIjk/viewanalytics (survey of users)

Serveur académique lausannois