Search CORE

30 research outputs found

Genetic Variations and Diseases in UniProtKB/Swiss-Prot: The Ins and Outs of Expert Manual Curation.

Author: Bolleman J.
Bougueleret L.
Breuza L.
Bridge A.
Estreicher A.
Famiglietti M.L.
Gos A.
Géhant S.
Poux S.
Redaschi N.
Xenarios I.
Publication venue: 'Wiley'
Publication date: 01/01/2014
Field of study

During the last few years, next-generation sequencing (NGS) technologies have accelerated the detection of genetic variants resulting in the rapid discovery of new disease-associated genes. However, the wealth of variation data made available by NGS alone is not sufficient to understand the mechanisms underlying disease pathogenesis and manifestation. Multidisciplinary approaches combining sequence and clinical data with prior biological knowledge are needed to unravel the role of genetic variants in human health and disease. In this context, it is crucial that these data are linked, organized, and made readily available through reliable online resources. The Swiss-Prot section of the Universal Protein Knowledgebase (UniProtKB/Swiss-Prot) provides the scientific community with a collection of information on protein functions, interactions, biological pathways, as well as human genetic diseases and variants, all manually reviewed by experts. In this article, we present an overview of the information content of UniProtKB/Swiss-Prot to show how this knowledgebase can support researchers in the elucidation of the mechanisms leading from a molecular defect to a disease phenotype

Serveur académique lausannois

PubMed Central

Interoperability and FAIRness through a novel combination of Web technologies

Author: Bolleman Jerven T.
Bonino da Silva Santos Luiz Olavo
Ciccarese Paolo
Clark Tim
Dumontier Michel
Gavai Anand
Gray Alasdair J. G.
Kaliyaperumal Rajaram
Kelpin Fleur D. L.
Kuzniar Arnold
Schultes Erik A.
Swertz Morris A.
Thompson Mark
van Mulligen Erik M.
Verborgh Ruben
Wilkinson Mark D.
Publication venue: 'PeerJ'
Publication date: 01/01/2017
Field of study

Data in the life sciences are extremely diverse and are stored in a broad spectrum of repositories ranging from those designed for particular data types (such as KEGG for pathway data or UniProt for protein data) to those that are general-purpose (such as FigShare, Zenodo, Dataverse or EUDAT). These data have widely different levels of sensitivity and security considerations. For example, clinical observations about genetic mutations in patients are highly sensitive, while observations of species diversity are generally not. The lack of uniformity in data models from one repository to another, and in the richness and availability of metadata descriptions, makes integration and analysis of these data a manual, time-consuming task with no scalability. Here we explore a set of resource-oriented Web design patterns for data discovery, accessibility, transformation, and integration that can be implemented by any general- or special-purpose repository as a means to assist users in finding and reusing their data holdings. We show that by using off-the-shelf technologies, interoperability can be achieved atthe level of an individual spreadsheet cell. We note that the behaviours of this architecture compare favourably to the desiderata defined by the FAIR Data Principles, and can therefore represent an exemplar implementation of those principles. The proposed interoperability design patterns may be used to improve discovery and integration of both new and legacy data, maximizing the utility of all scholarly outputs

Maastricht University Research Portal

Heriot Watt Pure

Proceedings - University of Groningen

Crossref

University of Groningen

ARTS repository - University of Groningen

Ghent University Academic Bibliography

Directory of Open Access Journals

Dissertations of the University of Groningen

Navigating in vitro bioactivity data by investigating available resources using model compounds.

Author: Augsburger F.
Bolleman J.T.
Bridge A.J.
Ilmjärv S.
Jaquet V.
Krause K.H.
Liechti R.
Sandström J.
Xenarios I.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

The number of chemical compounds and associated experimental data in public databases is growing, but presently there is no simple way to access these data in a quick and synoptic manner. Instead, data are fragmented across different resources and interested parties need to invest invaluable time and effort to navigate these systems

Serveur académique lausannois

Archive ouverte UNIGE

Interaction between Record Matching and Data Repairing

Author: Aiken A.
Bolleman J.
Bravo L.
Cong G.
Gartner
Nan Tang
Naumann F.
Rahm E.
Shuai Ma
Wegener I.
Wenfei Fan
Wenyuan Yu
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/05/2014
Field of study

Crossref

Edinburgh Research Explorer

Supplemental Information 2: Example dataset description

Access to consistent, high-quality metadata is critical to finding, understanding, and reusing scientific data. However, while there are many relevant vocabularies for the annotation of a dataset, none sufficiently captures all the necessary metadata. This prevents uniform indexing and querying of dataset repositories. Towards providing a practical guide for producing a high quality description of biomedical datasets, the W3C Semantic Web for Health Care and the Life Sciences Interest Group (HCLSIG) identified Resource Description Framework (RDF) vocabularies that could be used to specify common metadata elements and their value sets. The resulting guideline covers elements of description, identification, attribution, versioning, provenance, and content summarization. This guideline reuses existing vocabularies, and is intended to meet key functional requirements including indexing, discovery, exchange, query, and retrieval of datasets, thereby enabling the publication of FAIR data. The resulting metadata profile is generic and could be used by other domains with an interest in providing machine readable descriptions of versioned datasets

Maastricht University Research Portal

Crossref

Heriot Watt Pure

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Oxford University Research Archive

The health care and life sciences community profile for dataset descriptions

Carleton University's Institutional Repository

Locustella luscinioides, found breeding in the province of Friesland (Netherlands)

Author: Van Der Veen P J Bolleman
Publication venue: Leyden.
Publication date: 01/01/1896
Field of study

Volume: 18Start Page: 160End Page: 16

Biodiversity Heritage Library OAI Repository

Het natuurlijk evenwicht en de mensch : natuur en cultuur

Author: Bolleman van der Veen P. J.
Publication venue
Publication date: 01/01/1910
Field of study

Utrecht University Repository

Symposium Een postmoderne vredescultuur? (verslag)

Author: Bolleman Th.G.
Kunneman H.P.
Mul J. de
Vriens L.
Weerdenburg J.
Publication venue
Publication date: 01/07/1989
Field of study

De kritiek die van de zijde van postmodernen wordt uitgeoefend op alle vormen van vooruitgangsgeloof prikkelde mijn nieuwsgierigheid, in het bijzonder waar deze kritiek steun aan de vredesbeweging als een zinloze, zo niet huichelachtige, mensonwaardige, activiteit zou bestempelen. In hoeverre zou deze postmoderne kritiek verwantschap vertonen met pre-moderne, traditionele vermaningen en bedreigingen uit de mond van hen die elk streven naar wereldvrede als absurd en staatsgevaarlijk bestempelden? Zijn de postmodernen de vijanden van alle vormen van emancipatie, de nieuwe vrienden van oude conservatieve machthebbers, een soort neo-fascisten

Utrecht University Repository

FALDO: a semantic standard for describing the location of nucleotide and protein feature annotation

Author: Baran Joachim
Bolleman Jerven T.
Bonnal Raoul J. P.
Buels Robert
Cock Peter J. A.
Dumontier Michel
Fujisawa Takatomo
Hoehndorf Robert
Katayama Toshiaki
Mungall Christopher J.
Strozzi Francesco
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

BACKGROUND: Nucleotide and protein sequence feature annotations are essential to understand biology on the genomic, transcriptomic, and proteomic level. Using Semantic Web technologies to query biological annotations, there was no standard that described this potentially complex location information as subject-predicate-object triples. DESCRIPTION: We have developed an ontology, the Feature Annotation Location Description Ontology (FALDO), to describe the positions of annotated features on linear and circular sequences. FALDO can be used to describe nucleotide features in sequence records, protein annotations, and glycan binding sites, among other features in coordinate systems of the aforementioned “omics” areas. Using the same data format to represent sequence positions that are independent of file formats allows us to integrate sequence data from multiple sources and data types. The genome browser JBrowse is used to demonstrate accessing multiple SPARQL endpoints to display genomic feature annotations, as well as protein annotations from UniProt mapped to genomic locations. CONCLUSIONS: Our ontology allows users to uniformly describe – and potentially merge – sequence annotations from multiple sources. Data sources using FALDO can prospectively be retrieved using federalised SPARQL queries against public SPARQL endpoints and/or local private triple stores

Maastricht University Research Portal

Crossref

Springer - Publisher Connector

PubMed Central

eScholarship - University of California