Search CORE

33 research outputs found

Extending Science Gateway Frameworks to Support Big Data Applications in the Cloud

Author: B Ludascher
Carlos Blanco
D Churches
Gabor Terstyanszky
J Dean
L Li
MC Schatz
P Kacsuk
Shashank Gugnani
T Oinn
Tamas Kiss
X Fei
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Cloud computing offers massive scalability and elasticity required by many scientific and commercial applications. Combining the computational and data handling capabilities of clouds with parallel processing also has the potential to tackle Big Data problems efficiently. Science gateway frameworks and workflow systems enable application developers to implement complex applications and make these available for end-users via simple graphical user interfaces. The integration of such frameworks with Big Data processing tools on the cloud opens new oppor-tunities for application developers. This paper investigates how workflow sys-tems and science gateways can be extended with Big Data processing capabilities. A generic approach based on infrastructure aware workflows is suggested and a proof of concept is implemented based on the WS-PGRADE/gUSE science gateway framework and its integration with the Hadoop parallel data processing solution based on the MapReduce paradigm in the cloud. The provided analysis demonstrates that the methods described to integrate Big Data processing with workflows and science gateways work well in different cloud infrastructures and application scenarios, and can be used to create massively parallel applications for scientific analysis of Big Data

Crossref

UCrea

Springer - Publisher Connector

WestminsterResearch

Using Workflows to Explore and Optimise Named Entity Recognition for Chemistry

Author: A Copestake
A Tiwari
Apache
B Florian
B Ludascher
B Mellebeek
B Muller
BalaKrishna Kolluru
C Kolarik
C Kolrik
C Nobata
C Steinbeck
CJ Rupp
CJ Rupp
D Banville
D Ferrucci
D Jiao
I Taylor
J Shon
J Wren
JA Townsend
Junichi Tsujii
K Hettne
K Hettne
Lezan Hawizy
M Hassan
N Kemp
P Corbett
P Corbett
P Murray-Rust
P Murray-Rust
Peter Murray-Rust
R Klinger
R Klinger
SG Vellay
Sophia Ananiadou
T Kuhn
T Kuhn
T Oinn
Tim J. Hubbard
WJ Wilbur
Y Kano
Y Kano
Y Kano
Y Kano
Y Miyao
Y Tsuruoka
Y Tsuruoka
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Chemistry text mining tools should be interoperable and adaptable regardless of system-level implementation, installation or even programming issues. We aim to abstract the functionality of these tools from the underlying implementation via reconfigurable workflows for automatically identifying chemical names. To achieve this, we refactored an established named entity recogniser (in the chemistry domain), OSCAR and studied the impact of each component on the net performance. We developed two reconfigurable workflows from OSCAR using an interoperable text mining framework, U-Compare. These workflows can be altered using the drag-&-drop mechanism of the graphical user interface of U-Compare. These workflows also provide a platform to study the relationship between text mining components such as tokenisation and named entity recognition (using maximum entropy Markov model (MEMM) and pattern recognition based classifiers). Results indicate that, for chemistry in particular, eliminating noise generated by tokenisation techniques lead to a slightly better performance than others, in terms of named entity recognition (NER) accuracy. Poor tokenisation translates into poorer input to the classifier components which in turn leads to an increase in Type I or Type II errors, thus, lowering the overall performance. On the Sciborg corpus, the workflow based system, which uses a new tokeniser whilst retaining the same MEMM component, increases the F-score from 82.35% to 84.44%. On the PubMed corpus, it recorded an F-score of 84.84% as against 84.23% by OSCAR

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The University of Manchester - Institutional Repository

Osiris: accessible and reproducible phylogenetic and phylogenomic analyses within the Galaxy workflow management system

Author: A Loytynoja
A Stamatakis
AJ Drummond
B Giardine
B Ludascher
B Misof
Celia K C Churchill
CO Webb
D Darriba
D Posada
DG MacArthur
DP Faith
E Afgan
E Afgan
E Lord
ELL Sonnhammer
F Abascal
F Nardi
G Talavera
H Shimodaira
I Ebersberger
I Letunic
J Evans
K Katoh
K Tamura
Karl B Lopker
L Liu
L Liu
L Liu
LS Kubatko
M Abouelhoda
M Sabrina Pankey
Markos A Alexandrou
MV Han
NP Brown
O Sakarya
P Kuck
RA Vos
RC Edgar
RD Finn
Roger Ngo
SA Berger
SA Smith
SV Edwards
T Oinn
TH Oakley
Todd H Oakley
William Chen
WP Maddison
WP Maddison
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Non-suicidal Self-Injury in Adolescence

Author: AL Barrocas
AM Brausch
American Psychiatric Association
B Stanley
BJ Casey
BL Hankin
BL Hankin
C Reichl
C Schmahl
CA Hamza
CM Jacobson
D Leo De
D Nitkowski
D Ougrin
Deutsche Gesellschaft für Kinder- und Jugendpsychiatrie PuP
E Osuch
ED Klonsky
EE Lloyd-Richardson
G Fischer
HC Wilcox
JC Franklin
JJ Muehlenkamp
JJ Muehlenkamp
JJ Washburn
K Bentley
K Bresin
K Hawton
K Thomassin
KL Gratz
KR Fox
KR Fox
L Bowes
L Mehlum
LM Taylor
M Frost
M Kaess
M Kaess
M Zetterqvist
M Zetterqvist
MK Nock
MK Nock
MS Andover
MS Andover
N Kapur
O Nakar
P Ludascher
P Ludascher
P Moran
P Plener
P Wilkinson
Paul L. Plener
PL Plener
PL Plener
PL Plener
R Brunner
R Carroll
R Maniglio
R Young
RC Groschwitz
Rebecca C. Brown
S Jarvi
S Reitz
S Ross
SP Lewis
SP Lewis
SP Lewis
SP Lewis
ST Lereya
SV Swannell
T In-Albon
T Tschan
TI Rossouw
TM Yates
TP Beauchaine
UM Nater
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Provenance-based searching and ranking for scientific workflows

Author: Cuevas-Vicenttin V
Ludascher B
Missier P
Publication venue: Springer Verlag
Publication date: 01/01/2015
Field of study

Newcastle University E-Prints

eProvenance-Based Searching and Ranking for Scientific Workflows

Author: Cuevas-Vicenttin V
Ludascher B
Missier P
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date
Field of study

Newcastle University E-Prints

A Dataflow-Oriented Atomicity and Provenance System for Pipelined Scientific Workflows ⋆

Author: B. Ludascher
F. Leymann
H. Garcia-Molina
J. Yu
P. Buneman
P.A. Bernstein
S. Bowers
W. Derks
Publication venue
Publication date: 01/01/2007
Field of study

Abstract. Scientific workflows have gained great momentum in recent years due to their critical roles in e-Science and cyberinfrastructure applications. However, some tasks of a scientific workflow might fail during execution. A domain scientist might require a region of a scientific workflow to be “atomic”. Data provenance, which determines the source data that are used to produce a data item, is also essential to scientific workflows. In this paper, we propose: (i) an architecture for scientific workflow management systems that supports both provenance and atomicity; (ii) a dataflow-oriented atomicity model that supports the notions of commit and abort; and (iii) a dataflow-oriented provenance model that, in addition to supporting existing provenance graphs and queries, also supports queries related to atomicity and failure.

CiteSeerX

Crossref

Controlling an Iteration-Wise Coherence in Dataflow

Author: B. Ludascher
D. Clarke
E. Pignotti
I. Taylor
L. Wang
P. Velasco Elizondo
S. Limet
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Crossref

Provenance Storage, Querying, and Visualization in PBase

Author: Chirigati F
Cuevas-Vicenttin V
Dey S
Kianmajd P
Koop D
Ludascher B
Missier P
Wei YX
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date
Field of study

Newcastle University E-Prints

Coherence and Performance for Interactive Scientific Visualization Applications

Author: B. Hess
B. Ludascher
D. Barseghian
D. Hull
I. Taylor
P. Velasco Elizondo
S.P. Callahan
Z. Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Crossref