Search CORE

RERO DOC Digital Library

Interoperability and FAIRness through a novel combination of Web technologies

Author: Bolleman Jerven T.
Bonino da Silva Santos Luiz Olavo
Ciccarese Paolo
Clark Tim
Dumontier Michel
Gavai Anand
Gray Alasdair J. G.
Kaliyaperumal Rajaram
Kelpin Fleur D. L.
Kuzniar Arnold
Schultes Erik A.
Swertz Morris A.
Thompson Mark
van Mulligen Erik M.
Verborgh Ruben
Wilkinson Mark D.
Publication venue: 'PeerJ'
Publication date: 01/01/2017
Field of study

Data in the life sciences are extremely diverse and are stored in a broad spectrum of repositories ranging from those designed for particular data types (such as KEGG for pathway data or UniProt for protein data) to those that are general-purpose (such as FigShare, Zenodo, Dataverse or EUDAT). These data have widely different levels of sensitivity and security considerations. For example, clinical observations about genetic mutations in patients are highly sensitive, while observations of species diversity are generally not. The lack of uniformity in data models from one repository to another, and in the richness and availability of metadata descriptions, makes integration and analysis of these data a manual, time-consuming task with no scalability. Here we explore a set of resource-oriented Web design patterns for data discovery, accessibility, transformation, and integration that can be implemented by any general- or special-purpose repository as a means to assist users in finding and reusing their data holdings. We show that by using off-the-shelf technologies, interoperability can be achieved atthe level of an individual spreadsheet cell. We note that the behaviours of this architecture compare favourably to the desiderata defined by the FAIR Data Principles, and can therefore represent an exemplar implementation of those principles. The proposed interoperability design patterns may be used to improve discovery and integration of both new and legacy data, maximizing the utility of all scholarly outputs

Proceedings - University of Groningen

Heriot Watt Pure

ARTS repository - University of Groningen

University of Groningen

Ghent University Academic Bibliography

Directory of Open Access Journals

Dissertations of the University of Groningen

The EBI RDF platform: linked open data for the life sciences

Author: Birney Ewan
Bolleman Jerven
Brandizi Marco
Davies Mark
Garcia Leyla
Gaulton Anna
Gehant Sebastien
Jenkinson Andrew M.
Jupp Simon
Laibe Camille
Le Novère Nicolas
Malone James
Martin Maria
Parkinson Helen
Redaschi Nicole
Wimalaratne Sarala M.
Publication venue
Publication date: 02/08/2017
Field of study

Motivation: Resource description framework (RDF) is an emerging technology for describing, publishing and linking life science data. As a major provider of bioinformatics data and services, the European Bioinformatics Institute (EBI) is committed to making data readily accessible to the community in ways that meet existing demand. The EBI RDF platform has been developed to meet an increasing demand to coordinate RDF activities across the institute and provides a new entry point to querying and exploring integrated resources available at the EBI. Availability: http://www.ebi.ac.uk/rdf Contact: [email protected]

RERO DOC Digital Library

Supplemental Information 2: Example dataset description

Access to consistent, high-quality metadata is critical to finding, understanding, and reusing scientific data. However, while there are many relevant vocabularies for the annotation of a dataset, none sufficiently captures all the necessary metadata. This prevents uniform indexing and querying of dataset repositories. Towards providing a practical guide for producing a high quality description of biomedical datasets, the W3C Semantic Web for Health Care and the Life Sciences Interest Group (HCLSIG) identified Resource Description Framework (RDF) vocabularies that could be used to specify common metadata elements and their value sets. The resulting guideline covers elements of description, identification, attribution, versioning, provenance, and content summarization. This guideline reuses existing vocabularies, and is intended to meet key functional requirements including indexing, discovery, exchange, query, and retrieval of datasets, thereby enabling the publication of FAIR data. The resulting metadata profile is generic and could be used by other domains with an interest in providing machine readable descriptions of versioned datasets

Directory of Open Access Journals

Heriot Watt Pure

eScholarship - University of California

Oxford University Research Archive

The 3rd DBCLS BioHackathon: improving life science data integration with Semantic Web technologies.

Author: Aerts Jan
Afzal Hammad
Antezana Erick
Arakawa Kazuharu
Aranda Bruno
Asai Kiyoshi
Belleau Francois
Bolleman Jerven
Bonnal Raoul Jp
Chapman Brad
Chun Hong-Woo
Cock Peter Ja
Eriksson Tore
Gordon Paul Mk
Goto Naohisa
Hayashi Kazuhiro
Horn Heiko
Ishiwata Ryosuke
Kaminuma Eli
Kasprzyk Arek
Katayama Toshiaki
Kawaji Hideya
Kawamoto Shoko
Kawashima Shuichi
Kido Nobuhiro
Kim Young Joo
Kinjo Akira R
Konishi Fumikazu
Kwon Kyung-Hoon
Labarga Alberto
Lamprecht Anna-Lena
Lin Yu
Lindenbaum Pierre
McCarthy Luke
Micklem Gos
Morita Hideyuki
Murakami Katsuhiko
Nagao Koji
Nakao Mitsuteru
Nishida Kozo
Nishimura Kunihiro
Nishizawa Tatsuya
Ogishima Soichi
Okamoto Shinobu
Okubo Kosaku
Ono Keiichiro
Oouchida Kenta
Oshita Kazuki
Park Keun-Joon
Prins Pjotr
Saito Taro L
Samwald Matthias
Satagopam Venkata P
Shigemoto Yasumasa
Smith Richard
Splendiani Andrea
Sugawara Hideaki
Takagi Toshihisa
Taylor James
Vos Rutger A
Wilkinson Mark D
Withers David
Yamaguchi Atsuko
Yamamoto Yasunori
Yamasaki Chisato
Zmasek Christian M
Publication venue: J Biomed Semantics
Publication date: 01/01/2013
Field of study

BACKGROUND: BioHackathon 2010 was the third in a series of meetings hosted by the Database Center for Life Sciences (DBCLS) in Tokyo, Japan. The overall goal of the BioHackathon series is to improve the quality and accessibility of life science research data on the Web by bringing together representatives from public databases, analytical tool providers, and cyber-infrastructure researchers to jointly tackle important challenges in the area of in silico biological research. RESULTS: The theme of BioHackathon 2010 was the 'Semantic Web', and all attendees gathered with the shared goal of producing Semantic Web data from their respective resources, and/or consuming or interacting those data using their tools and interfaces. We discussed on topics including guidelines for designing semantic data and interoperability of resources. We consequently developed tools and clients for analysis and visualization. CONCLUSION: We provide a meeting report from BioHackathon 2010, in which we describe the discussions, decisions, and breakthroughs made as we moved towards compliance with Semantic Web technologies - from source provider, through middleware, to the end-consumer.RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are

Springer - Publisher Connector

Copenhagen University Research Information System

eScholarship - University of California

Apollo (Cambridge)

BioHackathon series in 2011 and 2012: penetration of ontology and linked data in life science domains

Author: Aerts Jan
Akune Yukie
Antezana Erick
Aoki-Kinoshita Kiyoko F
Arakawa Kazuharu
Aranda Bruno
Baran Joachim
Bolleman Jerven
Bonnal Raoul JP
Bono Hidemasa
Buttigieg Pier Luigi
Campbell Matthew P
Chen Yi-an
Chiba Hirokazu
Cock Peter JA
Cohen K Bretonnel
Constantin Alexandru
Duck Geraint
Dumontier Michel
Fujisawa Takatomo
Fujiwara Toyofumi
Goto Naohisa
Hoehndorf Robert
Igarashi Yoshinobu
Itaya Hidetoshi
Ito Maori
Iwasaki Wataru
Kalaš Matúš
Kano Yoshinobu
Katayama Toshiaki
Katoda Takeo
Kawamoto Shoko
Kawano Shin
Kawashima Shuichi
Kim Jin-Dong
Kim Taehong
Kocbek Simon
Kokubu Anna
Komiyama Yusuke
Kotera Masaaki
Laibe Camille
Lapp Hilmar
Lütteke Thomas
Marshall M Scott
Mori Hiroshi
Mori Takaaki
Morita Mizuki
Murakami Katsuhiko
Nakao Mitsuteru
Narimatsu Hisashi
Nishide Hiroyo
Nishimura Yosuke
Nystrom-Persson Johan
Ogishima Soichi
Okamoto Shinobu
Okamura Yasunobu
Okuda Shujiro
Ono Hiromasa
Oshita Kazuki
Packer Nicki H
Prins Pjotr
Ranzinger Rene
Rocca-Serra Philippe
Sansone Susanna
Sawaki Hiromichi
Shin Sung-Ho
Splendiani Andrea
Strozzi Francesco
Tadaka Shu
Takagi Toshihisa
Toukach Philip
Uchiyama Ikuo
Umezaki Masahito
Vos Rutger
Wang Yue
Whetzel Patricia L
Wilkinson Mark D
Wu Hongyan
Yamada Issaku
Yamaguchi Atsuko
Yamamoto Yasunori
Yamasaki Chisato
Yamashita Riu
York William S
Zmasek Christian M
Publication venue
Publication date: 01/01/2014
Field of study

The application of semantic technologies to the integration of biological data and the interoperability of bioinformatics analysis and visualization tools has been the common theme of a series of annual BioHackathons hosted in Japan for the past five years. Here we provide a review of the activities and outcomes from the BioHackathons held in 2011 in Kyoto and 2012 in Toyama. In order to efficiently implement semantic technologies in the life sciences, participants formed various sub-groups and worked on the following topics: Resource Description Framework (RDF) models for specific domains, text mining of the literature, ontology development, essential metadata for biological databases, platforms to enable efficient Semantic Web technology development and interoperability, and the development of applications for Semantic Web data. In this review, we briefly introduce the themes covered by these sub-groups. The observations made, conclusions drawn, and software development projects that emerged from these activities are discussed

University of Bergen

Aberystwyth Research Portal

Springer - Publisher Connector