Search CORE

VU Research Portal

FigShare

Interoperability and FAIRness through a novel combination of Web technologies

Author: Bolleman Jerven T.
Bonino da Silva Santos Luiz Olavo
Ciccarese Paolo
Clark Tim
Dumontier Michel
Gavai Anand
Gray Alasdair J. G.
Kaliyaperumal Rajaram
Kelpin Fleur D. L.
Kuzniar Arnold
Schultes Erik A.
Swertz Morris A.
Thompson Mark
van Mulligen Erik M.
Verborgh Ruben
Wilkinson Mark D.
Publication venue: 'PeerJ'
Publication date: 01/01/2017
Field of study

Data in the life sciences are extremely diverse and are stored in a broad spectrum of repositories ranging from those designed for particular data types (such as KEGG for pathway data or UniProt for protein data) to those that are general-purpose (such as FigShare, Zenodo, Dataverse or EUDAT). These data have widely different levels of sensitivity and security considerations. For example, clinical observations about genetic mutations in patients are highly sensitive, while observations of species diversity are generally not. The lack of uniformity in data models from one repository to another, and in the richness and availability of metadata descriptions, makes integration and analysis of these data a manual, time-consuming task with no scalability. Here we explore a set of resource-oriented Web design patterns for data discovery, accessibility, transformation, and integration that can be implemented by any general- or special-purpose repository as a means to assist users in finding and reusing their data holdings. We show that by using off-the-shelf technologies, interoperability can be achieved atthe level of an individual spreadsheet cell. We note that the behaviours of this architecture compare favourably to the desiderata defined by the FAIR Data Principles, and can therefore represent an exemplar implementation of those principles. The proposed interoperability design patterns may be used to improve discovery and integration of both new and legacy data, maximizing the utility of all scholarly outputs

Proceedings - University of Groningen

Heriot Watt Pure

ARTS repository - University of Groningen

University of Groningen

Ghent University Academic Bibliography

Dissertations of the University of Groningen

The smartAPI ecosystem for making Web APIs FAIR

Author: Afrasiabe C.
Assis P.
Availlach P.
Dastgheib S.
De Pons J.
Dumontier M.
Jagodnik K.
Korodi G.
Pilarczyk M.
Schürer S.
Terryn R.
Verborgh Ruben
Whetzel T.
Wu C
Zaveri A.
Publication venue
Publication date: 01/01/2017
Field of study

Ghent University Academic Bibliography

Semantic Web integration of Cheminformatics resources with the SADI framework

Author: A McNaught
B Chen
BP Vandervalk
C Steinbeck
CA Lipinski
CA Lipinski
DDG Gessler
E Benfenati
F Belleau
J Kietz
Leonid L Chepelev
M DiBernardo
MD Wilkinson
MD Wilkinson
Michel Dumontier
P Lord
PB Neerincx
R Guha
R Stevens
T Kuhn
T Vitvar
Publication venue: BioMed Central
Publication date: 01/05/2011
Field of study

Abstract Background The diversity and the largely independent nature of chemical research efforts over the past half century are, most likely, the major contributors to the current poor state of chemical computational resource and database interoperability. While open software for chemical format interconversion and database entry cross-linking have partially addressed database interoperability, computational resource integration is hindered by the great diversity of software interfaces, languages, access methods, and platforms, among others. This has, in turn, translated into limited reproducibility of computational experiments and the need for application-specific computational workflow construction and semi-automated enactment by human experts, especially where emerging interdisciplinary fields, such as systems chemistry, are pursued. Fortunately, the advent of the Semantic Web, and the very recent introduction of RESTful Semantic Web Services (SWS) may present an opportunity to integrate all of the existing computational and database resources in chemistry into a machine-understandable, unified system that draws on the entirety of the Semantic Web. Results We have created a prototype framework of Semantic Automated Discovery and Integration (SADI) framework SWS that exposes the QSAR descriptor functionality of the Chemistry Development Kit. Since each of these services has formal ontology-defined input and output classes, and each service consumes and produces RDF graphs, clients can automatically reason about the services and available reference information necessary to complete a given overall computational task specified through a simple SPARQL query. We demonstrate this capability by carrying out QSAR analysis backed by a simple formal ontology to determine whether a given molecule is drug-like. Further, we discuss parameter-based control over the execution of SADI SWS. Finally, we demonstrate the value of computational resource envelopment as SADI services through service reuse and ease of integration of computational functionality into formal ontologies. Conclusions The work we present here may trigger a major paradigm shift in the distribution of computational resources in chemistry. We conclude that envelopment of chemical computational resources as SADI SWS facilitates interdisciplinary research by enabling the definition of computational problems in terms of ontologies and formal logical statements instead of cumbersome and application-specific tasks and workflows.</p

Springer - Publisher Connector

Repository@Hull - Worktribe

PubMed Central

Assaying Rho GTPase–dependent processes in Dictyostelium discoideum

Author: A Hall
A Müller-Taubenberger
BP Somesh
CY Chung
D Dormann
DM Veltman
DM Veltman
F Rivero
G Vlahou
G Vlahou
J Condeelis
J Faix
JW Han
KC Park
M Dumontier
M Marinović
M Roche de la
R Kessin
T Howard
V Bernard
V Filić
V Filić
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/07/2018
Field of study

The model organism D. discoideum is well-suited to investigate basic questions of molecular and cell biology, particularly those related to the structure, regulation and dynamics of the cytoskeleton, signal transduction, cell-cell adhesion and development. D. discoideum cells make use of Rho-regulated signaling pathways to reorganize the actin cytoskeleton during chemotaxis, endocytosis and cytokinesis. In this organism the Rho family encompasses 20 members, several belonging to the Rac subfamily, but there are no representatives of the Cdc42 and Rho subfamilies. Here we present protocols suitable for monitoring the actin polymerization response and the activation of Rac upon stimulation of aggregation competent cells with the chemoattractant cAMP, and for monitoring the localization and dynamics of Rac activity in live cells

eScholarship - University of California

Optimizing Nervous System-Specific Gene Targeting with Cre Driver Lines: Prevalence of Germline Recombination and Influencing Factors.

Author: Abumaria Nashat
Ambrozkiewicz Mateusz C
Beier Kevin T
Benseler Fritz
Brose Nils
Burgess Harold A
Cepko Constance L
Chen Cui
Cloutier Jean-François
Craig Ann Marie
Dumontier Emilie
Eroglu Cagla
Falkner Susanne
Furlanis Elisabetta
Goebbels Sandra
Gomez Andrea M
Hoshina Naosuke
Huang Wei-Hsiang
Hutchison Mary Anne
Itoh-Maruoka Yu
Kaeser Pascal S
Kawabe Hiroshi
Kay Jeremy N
Lavery Laura A
Li Wei
Lu Wei
Luo Lin
Luo Liqun
Mandai Kenji
Maruo Tomohiko
McBain Chris J
Motohashi Junko
Nave Klaus-Armin
Pai Emily Ling-Lin
Pelkey Kenneth A
Pereira Ariane
Philips Thomas
Prado Marco AM
Prado Vania F
Rothstein Jeffrey
Rubenstein John LR
Saher Gesine
Sakimura Kenji
Sanes Joshua R
Scheiffele Peter
Sinclair Jennifer L
Stogsdill Jeff A
Takai Yoshimi
Traunmüller Lisa
Umemori Hisashi
Verhage Matthijs
Wang Jiexin
Wortel Joke
You Wenjia
Yuzaki Michisuke
Zoghbi Huda Yahya
Publication venue: eScholarship, University of California
Publication date: 01/01/2020
Field of study

The Cre-loxP system is invaluable for spatial and temporal control of gene knockout, knockin, and reporter expression in the mouse nervous system. However, we report varying probabilities of unexpected germline recombination in distinct Cre driver lines designed for nervous system-specific recombination. Selective maternal or paternal germline recombination is showcased with sample Cre lines. Collated data reveal germline recombination in over half of 64 commonly used Cre driver lines, in most cases with a parental sex bias related to Cre expression in sperm or oocytes. Slight differences among Cre driver lines utilizing common transcriptional control elements affect germline recombination rates. Specific target loci demonstrated differential recombination; thus, reporters are not reliable proxies for another locus of interest. Similar principles apply to other recombinase systems and other genetically targeted organisms. We hereby draw attention to the prevalence of germline recombination and provide guidelines to inform future research for the neuroscience and broader molecular genetics communities

VU Research Portal

edoc

Adding a Little Reality to Building Ontologies for Biology

Author: A Rector
AP Seyed
B Russell
B Smith
B Smith
B Smith
B Zeeberg
G Merrill
I Johansson
Iddo Friedberg
J Shrager
K Wolstencroft
M Ashburner
M Dumontier
M Egana
P Grenon
P Lord
Phillip Lord
PL Whetzel
PW Lord
R Stevens
Robert Stevens
S Schulz
T Gruber
W Ceusters
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

BACKGROUND: Many areas of biology are open to mathematical and computational modelling. The application of discrete, logical formalisms defines the field of biomedical ontologies. Ontologies have been put to many uses in bioinformatics. The most widespread is for description of entities about which data have been collected, allowing integration and analysis across multiple resources. There are now over 60 ontologies in active use, increasingly developed as large, international collaborations. There are, however, many opinions on how ontologies should be authored; that is, what is appropriate for representation. Recently, a common opinion has been the "realist" approach that places restrictions upon the style of modelling considered to be appropriate. METHODOLOGY/PRINCIPAL FINDINGS: Here, we use a number of case studies for describing the results of biological experiments. We investigate the ways in which these could be represented using both realist and non-realist approaches; we consider the limitations and advantages of each of these models. CONCLUSIONS/SIGNIFICANCE: From our analysis, we conclude that while realist principles may enable straight-forward modelling for some topics, there are crucial aspects of science and the phenomena it studies that do not fit into this approach; realism appears to be over-simplistic which, perversely, results in overly complex ontological models. We suggest that it is impossible to avoid compromise in modelling ontology; a clearer understanding of these compromises will better enable appropriate modelling, fulfilling the many needs for discrete mathematical models within computational biology

Public Library of Science (PLOS)

CiteSeerX

The University of Manchester - Institutional Repository

PubMed Central

Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index

Author: A Andreeva
Abdur R Sikder
Albert Y Zomaya
AR Sikder
FMG Pearl
G Pollastri
G Pollastri
HM Berman
J Cheng
J Liu
J Sim
JE Gewehr
L Kong
M Dumontier
M Suyama
N Nagarajan
OV Galzitskaya
RA George
RL Marsden
S Veretnik
SF Altschul
SJ Wheelan
T Joachims
TA Holland
V Vapnik
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Knowledge of protein domain boundaries is critical for the characterisation and understanding of protein function. The ability to identify domains without the knowledge of the structure – by using sequence information only – is an essential step in many types of protein analyses. In this present study, we demonstrate that the performance of DomainDiscovery is improved significantly by including the inter-domain linker index value for domain identification from sequence-based information. Improved DomainDiscovery uses a Support Vector Machine (SVM) approach and a unique training dataset built on the principle of consensus among experts in defining domains in protein structure. The SVM was trained using a PSSM (Position Specific Scoring Matrix), secondary structure, solvent accessibility information and inter-domain linker index to detect possible domain boundaries for a target sequence. RESULTS: Improved DomainDiscovery is compared with other methods by benchmarking against a structurally non-redundant dataset and also CASP5 targets. Improved DomainDiscovery achieves 70% accuracy for domain boundary identification in multi-domains proteins. CONCLUSION: Improved DomainDiscovery compares favourably to the performance of other methods and excels in the identification of domain boundaries for multi-domain proteins as a result of introducing support vector machine with benchmark_2 dataset

Springer - Publisher Connector

PubMed Central

Mathematical model for empirically optimizing large scale production of soluble protein domains

Author: A Fontana
A Kouranov
Atsushi Kurotani
BH Dessailly
C Zhang
D Christ
DT Jones
E Chikayama
Eisuke Chikayama
F Corpet
GE Folkers
GE Tusnady
HM Berman
JM Chandonia
M Dumontier
M Suyama
PB Card
R Kikuno
RL Marsden
S Cabantous
S Dokudovskaya
S Miyazaki
S Miyazaki
Satoshi Miyazaki
SF Altschul
Shigeyuki Yokoyama
SJ Wheelan
T Hondoh
T Kigawa
T Niwa
T Tanaka
Takanori Tanaka
Takashi Yabuki
TC Terwilliger
X Gao
Y Kuroda
Yutaka Kuroda
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study