Search CORE

3,576 research outputs found

Interoperability and FAIRness through a novel combination of Web technologies

Author: Bolleman Jerven T.
Bonino da Silva Santos Luiz Olavo
Ciccarese Paolo
Clark Tim
Dumontier Michel
Gavai Anand
Gray Alasdair J. G.
Kaliyaperumal Rajaram
Kelpin Fleur D. L.
Kuzniar Arnold
Schultes Erik A.
Swertz Morris A.
Thompson Mark
van Mulligen Erik M.
Verborgh Ruben
Wilkinson Mark D.
Publication venue: 'PeerJ'
Publication date: 01/01/2017
Field of study

Data in the life sciences are extremely diverse and are stored in a broad spectrum of repositories ranging from those designed for particular data types (such as KEGG for pathway data or UniProt for protein data) to those that are general-purpose (such as FigShare, Zenodo, Dataverse or EUDAT). These data have widely different levels of sensitivity and security considerations. For example, clinical observations about genetic mutations in patients are highly sensitive, while observations of species diversity are generally not. The lack of uniformity in data models from one repository to another, and in the richness and availability of metadata descriptions, makes integration and analysis of these data a manual, time-consuming task with no scalability. Here we explore a set of resource-oriented Web design patterns for data discovery, accessibility, transformation, and integration that can be implemented by any general- or special-purpose repository as a means to assist users in finding and reusing their data holdings. We show that by using off-the-shelf technologies, interoperability can be achieved atthe level of an individual spreadsheet cell. We note that the behaviours of this architecture compare favourably to the desiderata defined by the FAIR Data Principles, and can therefore represent an exemplar implementation of those principles. The proposed interoperability design patterns may be used to improve discovery and integration of both new and legacy data, maximizing the utility of all scholarly outputs

Maastricht University Research Portal

Heriot Watt Pure

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Ghent University Academic Bibliography

Directory of Open Access Journals

Dissertations of the University of Groningen

Recommended from our members

Finding the traces of behavioral and cognitive processes in big data and naturally occurring datasets.

Author: Griffiths Tom
Paxton Alexandra
Publication venue: eScholarship, University of California
Publication date: 01/10/2017
Field of study

Today, people generate and store more data than ever before as they interact with both real and virtual environments. These digital traces of behavior and cognition offer cognitive scientists and psychologists an unprecedented opportunity to test theories outside the laboratory. Despite general excitement about big data and naturally occurring datasets among researchers, three gaps stand in the way of their wider adoption in theory-driven research: the imagination gap, the skills gap, and the culture gap. We outline an approach to bridging these three gaps while respecting our responsibilities to the public as participants in and consumers of the resulting research. To that end, we introduce Data on the Mind ( http://www.dataonthemind.org ), a community-focused initiative aimed at meeting the unprecedented challenges and opportunities of theory-driven research with big data and naturally occurring datasets. We argue that big data and naturally occurring datasets are most powerfully used to supplement-not supplant-traditional experimental paradigms in order to understand human behavior and cognition, and we highlight emerging ethical issues related to the collection, sharing, and use of these powerful datasets

eScholarship - University of California

Spreadsheet Framework for Visual Exploration of Biomedical Datasets

Author: Boulic R.
Maciel A.
Sarni S.
Thalmann D.
Publication venue
Publication date: 16/01/2007
Field of study

In this paper, we present our spreadsheet framework, which uses a spreadsheet-likeinterface for exploring biomedical datasets. The principles and advantages of this classof visualization systems are illustrated, and a case study for the analysis of hip jointcongruity is presented. Throughout this use case, we see how end users can comparedifferent datasets, apply parallel operations on data, create analysis templates, andhow this helps them in the exploration process

Infoscience - École polytechnique fédérale de Lausanne

Applying a User-centred Approach to Interactive Visualization Design

Author: A. Cooper
A. MacEachren
A. Sutcliffe
B. Latour
B. Shneiderman
C. Chen
C. North
C. van der Lelie
C. Ware
D. Benyon
D. G. Novick
D. Morgan
G. J. Trafton
H. Beyer
H. Javahery
H. Rauwerda
J. A. Landay
J. D. Thompson
J. M. Carroll
J. Nielsen
J. Preece
J. Seo
J. Zhang
K. Dunbar
L. Arnstein
L. E. Wood
M. Clamp
M. Graham
M. Rettig
M. Tory
O. Kulyk
P. A. Pevzner
P. Figueroa
R. Chenna
R. Poppe
R. Spence
S. Westerman
W. E. Mackay
Publication venue: Springer Verlag
Publication date: 01/01/2008
Field of study

Analysing users in their context of work and finding out how and why they use different information resources is essential to provide interactive visualisation systems that match their goals and needs. Designers should actively involve the intended users throughout the whole process. This chapter presents a user-centered approach for the design of interactive visualisation systems. We describe three phases of the iterative visualisation design process: the early envisioning phase, the global specification hase, and the detailed specification phase. The whole design cycle is repeated until some criterion of success is reached. We discuss different techniques for the analysis of users, their tasks and domain. Subsequently, the design of prototypes and evaluation methods in visualisation practice are presented. Finally, we discuss the practical challenges in design and evaluation of collaborative visualisation environments. Our own case studies and those of others are used throughout the whole chapter to illustrate various approaches

VU Research Portal

Crossref

University of Twente Research Information

Multiplierz: An Extensible API Based Desktop Environment for Proteomics Data Analysis

Author: Askenazi Manor
Blank Nathaniel C.
Cashorali Tanya
Ficarro Scott B.
Marto Jarrod A.
Parikh Jignesh R.
Webber James T.
Zhang Yi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

BACKGROUND. Efficient analysis of results from mass spectrometry-based proteomics experiments requires access to disparate data types, including native mass spectrometry files, output from algorithms that assign peptide sequence to MS/MS spectra, and annotation for proteins and pathways from various database sources. Moreover, proteomics technologies and experimental methods are not yet standardized; hence a high degree of flexibility is necessary for efficient support of high- and low-throughput data analytic tasks. Development of a desktop environment that is sufficiently robust for deployment in data analytic pipelines, and simultaneously supports customization for programmers and non-programmers alike, has proven to be a significant challenge. RESULTS. We describe multiplierz, a flexible and open-source desktop environment for comprehensive proteomics data analysis. We use this framework to expose a prototype version of our recently proposed common API (mzAPI) designed for direct access to proprietary mass spectrometry files. In addition to routine data analytic tasks, multiplierz supports generation of information rich, portable spreadsheet-based reports. Moreover, multiplierz is designed around a "zero infrastructure" philosophy, meaning that it can be deployed by end users with little or no system administration support. Finally, access to multiplierz functionality is provided via high-level Python scripts, resulting in a fully extensible data analytic environment for rapid development of custom algorithms and deployment of high-throughput data pipelines. CONCLUSION. Collectively, mzAPI and multiplierz facilitate a wide range of data analysis tasks, spanning technology development to biological annotation, for mass spectrometry-based proteomics research.Dana-Farber Cancer Institute; National Human Genome Research Institute (P50HG004233); National Science Foundation Integrative Graduate Education and Research Traineeship grant (DGE-0654108

Crossref

Boston University Institutional Repository (OpenBU)

Springer - Publisher Connector

PubMed Central

Imaging Informatics and the Human Brain Project: the Role of Structure, Review

Author: Brinkley James F
Rosse Cornelius
Publication venue
Publication date: 01/01/2002
Field of study

(no abstract

University of Washington Structural Informatics Group Publications

Visual parameter optimisation for biomedical image processing

Author: AC Ruifrok
AE Carpenter
AJ Pretorius
AJ Pretorius
BB Bederson
C Onwubiko
H Kang
H Piringer
J LeBlanc
J Marks
JA Hartigan
K Matkovic
KL Ma
L Tweedie
L Tweedie
M Beham
M Booshehrian
M Meyer
M Sedlmair
M Sedlmair
M Weber
N Ansari
P McLachlan
R Rao
RA Ruddle
S Bergner
S Bruckner
S Steger
SP Callahan
T Schultz
T Torsney-Weir
T Von Landesberger
T Von Landesberger
T Walter
TJ Jankun-Kelly
W Berger
WA Link
Y Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Background: Biomedical image processing methods require users to optimise input parameters to ensure high quality output. This presents two challenges. First, it is difficult to optimise multiple input parameters for multiple input images. Second, it is difficult to achieve an understanding of underlying algorithms, in particular, relationships between input and output. Results: We present a visualisation method that transforms users’ ability to understand algorithm behaviour by integrating input and output, and by supporting exploration of their relationships. We discuss its application to a colour deconvolution technique for stained histology images and show how it enabled a domain expert to identify suitable parameter values for the deconvolution of two types of images, and metrics to quantify deconvolution performance. It also enabled a breakthrough in understanding by invalidating an underlying assumption about the algorithm. Conclusions: The visualisation method presented here provides analysis capability for multiple inputs and outputs in biomedical image processing that is not supported by previous analysis software. The analysis supported by our method is not feasible with conventional trial-and-error approaches

CLoK

Crossref

Springer - Publisher Connector

PubMed Central

White Rose Research Online