Search CORE

43 research outputs found

Towards Automatic Capturing of Semi-structured Process Provenance

Author: A. Misra
B. Ludascher
L. Moreau
M. Szomszor
M.D. Allen
T. Oinn
Y. Cui
Y.L. Simmhan
Y.L. Simmhan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Extending Science Gateway Frameworks to Support Big Data Applications in the Cloud

Author: B Ludascher
Carlos Blanco
D Churches
Gabor Terstyanszky
J Dean
L Li
MC Schatz
P Kacsuk
Shashank Gugnani
T Oinn
Tamas Kiss
X Fei
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Cloud computing offers massive scalability and elasticity required by many scientific and commercial applications. Combining the computational and data handling capabilities of clouds with parallel processing also has the potential to tackle Big Data problems efficiently. Science gateway frameworks and workflow systems enable application developers to implement complex applications and make these available for end-users via simple graphical user interfaces. The integration of such frameworks with Big Data processing tools on the cloud opens new oppor-tunities for application developers. This paper investigates how workflow sys-tems and science gateways can be extended with Big Data processing capabilities. A generic approach based on infrastructure aware workflows is suggested and a proof of concept is implemented based on the WS-PGRADE/gUSE science gateway framework and its integration with the Hadoop parallel data processing solution based on the MapReduce paradigm in the cloud. The provided analysis demonstrates that the methods described to integrate Big Data processing with workflows and science gateways work well in different cloud infrastructures and application scenarios, and can be used to create massively parallel applications for scientific analysis of Big Data

Crossref

UCrea

Springer - Publisher Connector

WestminsterResearch

Using Workflows to Explore and Optimise Named Entity Recognition for Chemistry

Author: A Copestake
A Tiwari
Apache
B Florian
B Ludascher
B Mellebeek
B Muller
BalaKrishna Kolluru
C Kolarik
C Kolrik
C Nobata
C Steinbeck
CJ Rupp
CJ Rupp
D Banville
D Ferrucci
D Jiao
I Taylor
J Shon
J Wren
JA Townsend
Junichi Tsujii
K Hettne
K Hettne
Lezan Hawizy
M Hassan
N Kemp
P Corbett
P Corbett
P Murray-Rust
P Murray-Rust
Peter Murray-Rust
R Klinger
R Klinger
SG Vellay
Sophia Ananiadou
T Kuhn
T Kuhn
T Oinn
Tim J. Hubbard
WJ Wilbur
Y Kano
Y Kano
Y Kano
Y Kano
Y Miyao
Y Tsuruoka
Y Tsuruoka
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Chemistry text mining tools should be interoperable and adaptable regardless of system-level implementation, installation or even programming issues. We aim to abstract the functionality of these tools from the underlying implementation via reconfigurable workflows for automatically identifying chemical names. To achieve this, we refactored an established named entity recogniser (in the chemistry domain), OSCAR and studied the impact of each component on the net performance. We developed two reconfigurable workflows from OSCAR using an interoperable text mining framework, U-Compare. These workflows can be altered using the drag-&-drop mechanism of the graphical user interface of U-Compare. These workflows also provide a platform to study the relationship between text mining components such as tokenisation and named entity recognition (using maximum entropy Markov model (MEMM) and pattern recognition based classifiers). Results indicate that, for chemistry in particular, eliminating noise generated by tokenisation techniques lead to a slightly better performance than others, in terms of named entity recognition (NER) accuracy. Poor tokenisation translates into poorer input to the classifier components which in turn leads to an increase in Type I or Type II errors, thus, lowering the overall performance. On the Sciborg corpus, the workflow based system, which uses a new tokeniser whilst retaining the same MEMM component, increases the F-score from 82.35% to 84.44%. On the PubMed corpus, it recorded an F-score of 84.84% as against 84.23% by OSCAR

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The University of Manchester - Institutional Repository

Osiris: accessible and reproducible phylogenetic and phylogenomic analyses within the Galaxy workflow management system

Author: A Loytynoja
A Stamatakis
AJ Drummond
B Giardine
B Ludascher
B Misof
Celia K C Churchill
CO Webb
D Darriba
D Posada
DG MacArthur
DP Faith
E Afgan
E Afgan
E Lord
ELL Sonnhammer
F Abascal
F Nardi
G Talavera
H Shimodaira
I Ebersberger
I Letunic
J Evans
K Katoh
K Tamura
Karl B Lopker
L Liu
L Liu
L Liu
LS Kubatko
M Abouelhoda
M Sabrina Pankey
Markos A Alexandrou
MV Han
NP Brown
O Sakarya
P Kuck
RA Vos
RC Edgar
RD Finn
Roger Ngo
SA Berger
SA Smith
SV Edwards
T Oinn
TH Oakley
Todd H Oakley
William Chen
WP Maddison
WP Maddison
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Non-suicidal Self-Injury in Adolescence

Author: AL Barrocas
AM Brausch
American Psychiatric Association
B Stanley
BJ Casey
BL Hankin
BL Hankin
C Reichl
C Schmahl
CA Hamza
CM Jacobson
D Leo De
D Nitkowski
D Ougrin
Deutsche Gesellschaft für Kinder- und Jugendpsychiatrie PuP
E Osuch
ED Klonsky
EE Lloyd-Richardson
G Fischer
HC Wilcox
JC Franklin
JJ Muehlenkamp
JJ Muehlenkamp
JJ Washburn
K Bentley
K Bresin
K Hawton
K Thomassin
KL Gratz
KR Fox
KR Fox
L Bowes
L Mehlum
LM Taylor
M Frost
M Kaess
M Kaess
M Zetterqvist
M Zetterqvist
MK Nock
MK Nock
MS Andover
MS Andover
N Kapur
O Nakar
P Ludascher
P Ludascher
P Moran
P Plener
P Wilkinson
Paul L. Plener
PL Plener
PL Plener
PL Plener
R Brunner
R Carroll
R Maniglio
R Young
RC Groschwitz
Rebecca C. Brown
S Jarvi
S Reitz
S Ross
SP Lewis
SP Lewis
SP Lewis
SP Lewis
ST Lereya
SV Swannell
T In-Albon
T Tschan
TI Rossouw
TM Yates
TP Beauchaine
UM Nater
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Provenance-based searching and ranking for scientific workflows

Author: Cuevas-Vicenttin V
Ludascher B
Missier P
Publication venue: Springer Verlag
Publication date: 01/01/2015
Field of study

Newcastle University E-Prints

Framework for Workflow Parallel Execution in Grid Environment

Author: B. Ludascher
I. Foster
L. Huang
O. Bunin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Crossref

eProvenance-Based Searching and Ranking for Scientific Workflows

Author: Cuevas-Vicenttin V
Ludascher B
Missier P
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date
Field of study

Newcastle University E-Prints

ESM Workflow

Author: B Ludascher
C Larsson
R Sessions
R. Dunlap
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A Metadata Integration Assistant Generator for Heterogeneous Distributed Databases

Author: A. Farquhar
B. Ludascher
C. Parent
D. Chamberlin
G. Wiederhold
J. D. Ullman
Publication venue: Springer
Publication date: 01/01/2002
Field of study

Abstract. This paper describes a metadata interchange approach for semi-automated integration of heterogeneous distributed databases. Our system prototype uses distributed metadata to generate a GUI tool for a meta-user (who does the metadata integration) to describe mappings between master and local databases by assigning index numbers and specifying conversion function names; the system uses Quilt as its XML query language. A DDXMI (for Distributed Database XML Metadata Interface) file is generated based on the mappings, and is used to translate queries over the virtual master database into sub-queries to local databases. An experiment testing feasibility is reported in which 3 different bibliography databases are integrated.

CiteSeerX

Crossref