Search CORE

92 research outputs found

Index-driven XML data integration to support functional genomics

Author: Hunt E.
Pafilis E.
Tulloch I.
Wilson J.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2002
Field of study

We identify a new type of data integration problem that arises in functional genomics research in the context of large-scale experiments involving arrays, 2-dimensional protein gels and mass-spectrometry. We explore the current practice of data analysis that involves repeated web queries iterating over long lists of gene or protein names. We postulate a new approach to solve this problem, applicable to data sets stored in XML format. We propose to discover data redundancies using an XML index we construct and to remove them from the results returned by the query. We combine XML indexing with queries carried out on top of relational tables. We believe our approach could support semi-automated data integration such as that required in the interpretation of large-scale biological experiments

Crossref

University of Strathclyde Institutional Repository

Enlighten

OnTheFly: a tool for automated document-based text annotation, data linking and network generation

Author: E. Pafilis
G. A. Pavlopoulos
M. Kuhn
Pitre
R. Schneider
S. D. Hooper
Publication venue: Oxford University Press
Publication date: 01/01/2009
Field of study

OnTheFly is a web-based application that applies biological named entity recognition to enrich Microsoft Office, PDF and plain text documents. The input files are converted into the HTML format and then sent to the Reflect tagging server, which highlights biological entity names like genes, proteins and chemicals, and attaches to them JavaScript code to invoke a summary pop-up window. The window provides an overview of relevant information about the entity, such as a protein description, the domain composition, a link to the 3D structure and links to other relevant online resources. OnTheFly is also able to extract the bioentities mentioned in a set of files and to produce a graphical representation of the networks of the known and predicted associations of these entities by retrieving the information from the STITCH database

Crossref

PubMed Central

Open Repository and Bibliography - Luxembourg

Value, but high costs in post-deposition data curation

Author: Amid C.
Bravakos P.
Buttigieg P.
Cerdeño-Tárraga A.
Cochrane G.
Gibson R.
Kahlke T.
Legaki A.
Narayana Murthy K.
Pafilis E.
Papastefanou G.
Pereira E.
Rossello M.
ten Hoopen P.
Toribio A.
Publication venue
Publication date: 09/02/2016
Field of study

Discoverability of sequence data in primary data archives is proportional to the richness of contextual information associated with the data. Here, we describe an exercise in the improvement of contextual information surrounding sample records associated with metagenomics sequence reads available in the European Nucleotide Archive. We outline the annotation process and summarize findings of this effort aimed at increasing usability of publicly available environmental data. Furthermore, we emphasize the benefits of such an exercise and detail its costs. We conclude that such a third party annotation approach is expensive and has value as an element of curation, but should form only part of a more sustainable submitter-driven approach

MPG.PuRe

Value, but high costs in post-deposition data Curation

Author: Amid C
Bravakos P
Buttigieg PL
Cochrane G
Gibson R
Hoopen PT
Kahlke T
Legaki A
Murthy KN
O-Tárraga AMC
Pafilis E
Papastefanou G
Pereira E
Rossello M
Toribio AL
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2016
Field of study

© The Author(s) 2016. Published by Oxford University Press. Discoverability of sequence data in primary data archives is proportional to the richness of contextual information associated with the data. Here, we describe an exercise in the improvement of contextual information surrounding sample records associated with metagenomics sequence reads available in the European Nucleotide Archive. We outline the annotation process and summarize findings of this effort aimed at increasing usability of publicly available environmental data. Furthermore, we emphasize the benefits of such an exercise and detail its costs. We conclude that such a third party annotation approach is expensive and has value as an element of curation, but should form only part of a more sustainable submitter-driven approach

Crossref

OPUS - University of Technology Sydney

PubMed Central

Electronic Publication Information Center

MPG.PuRe

The environment ontology in 2016: bridging domains with increased scope, semantic density, and interoperation

Author: Buttigieg Pier Luigi
Lewis Suzanna E.
Mungall Christopher J.
Pafilis Evangelos
Schildhauer Mark P.
Walls Ramona L.
Publication venue
Publication date: 01/01/2016
Field of study

Background The Environment Ontology (ENVO; http://www.environmentontology.org/), first described in 2013, is a resource and research target for the semantically controlled description of environmental entities. The ontology's initial aim was the representation of the biomes, environmental features, and environmental materials pertinent to genomic and microbiome-related investigations. However, the need for environmental semantics is common to a multitude of fields, and ENVO's use has steadily grown since its initial description. We have thus expanded, enhanced, and generalised the ontology to support its increasingly diverse applications. Methods We have updated our development suite to promote expressivity, consistency, and speed: we now develop ENVO in the Web Ontology Language (OWL) and employ templating methods to accelerate class creation. We have also taken steps to better align ENVO with the Open Biological and Biomedical Ontologies (OBO) Foundry principles and interoperate with existing OBO ontologies. Further, we applied text-mining approaches to extract habitat information from the Encyclopedia of Life and automatically create experimental habitat classes within ENVO. Results Relative to its state in 2013, ENVO's content, scope, and implementation have been enhanced and much of its existing content revised for improved semantic representation. ENVO now offers representations of habitats, environmental processes, anthropogenic environments, and entities relevant to environmental health initiatives and the global Sustainable Development Agenda for 2030. Several branches of ENVO have been used to incubate and seed new ontologies in previously unrepresented domains such as food and agronomy. The current release version of the ontology, in OWL format, is available at http://purl.obolibrary.org/obo/envo.owl. Conclusions ENVO has been shaped into an ontology which bridges multiple domains including biomedicine, natural and anthropogenic ecology, ‘omics, and socioeconomic development. Through continued interactions with our users and partners, particularly those performing data archiving and sythesis, we anticipate that ENVO’s growth will accelerate in 2017. As always, we invite further contributions and collaboration to advance the semantic representation of the environment, ranging from geographic features and environmental materials, across habitats and ecosystems, to everyday objects in household settings

Crossref

ZENODO

Springer - Publisher Connector

PubMed Central

Electronic Publication Information Center

eScholarship - University of California

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

MPG.PuRe

Research Infrastructures offer capacity to address scientific questions never attempted before: Are all taxa equal?

Author: Appeltans W.
Arvanitidis C.
Bailly N.
Chatzigeorgiou G.
Chatzinikolaou E.
Filiopoulou I.
Gerovasileiou V.
Gougousis A.
Hernandez F.
Keklikoglou K.
Michalakis N.
Oulas A.
Pafilis E.
Panteri E.
Patkos T.
Pavloudi C.
Somerfield P.J.
Vandepitte L.
Vanhoorne B.
Warwick R.M.
Publication venue
Publication date: 01/01/2018
Field of study

Open Marine Archive

Semantic text mining support for lignocellulose research

Author: A Demirbas
A Doms
Adrian Tsang
BC Saha
C Görg
C Murphy
Caitlin Murphy
DC Bernard
E Pafilis
EW Sayers
Greg Butler
H Cunningham
HM Müller
I Jovanovic
Ingo Morgenstern
International Union of Biochemistry and Molecular Biology
Justin Powlowski
K Bontcheva
M Scheer
Marie-Jean Meurs
N Naderi
N Okazaki
R Hoffmann
R Witte
R Witte
R Witte
René Witte
S Ananiadou
S Bringezu
S Federhen
UniProt Consortium
Y Yamamoto
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Biofuels produced from biomass are considered to be promising sustainable alternatives to fossil fuels. The conversion of lignocellulose into fermentable sugars for biofuels production requires the use of enzyme cocktails that can efficiently and economically hydrolyze lignocellulosic biomass. As many fungi naturally break down lignocellulose, the identification and characterization of the enzymes involved is a key challenge in the research and development of biomass-derived products and fuels. One approach to meeting this challenge is to mine the rapidly-expanding repertoire of microbial genomes for enzymes with the appropriate catalytic properties. Semantic technologies, including natural language processing, ontologies, semantic Web services and Web-based collaboration tools, promise to support users in handling complex data, thereby facilitating knowledge-intensive tasks. An ongoing challenge is to select the appropriate technologies and combine them in a coherent system that brings measurable improvements to the users. We present our ongoing development of a semantic infrastructure in support of genomics-based lignocellulose research. Part of this effort is the automated curation of knowledge from information on fungal enzymes that is available in the literature and genome resources. Working closely with fungal biology researchers who manually curate the existing literature, we developed ontological natural language processing pipelines integrated in a Web-based interface to assist them in two main tasks: mining the literature for relevant knowledge, and at the same time providing rich and semantically linked information

Crossref

Springer - Publisher Connector

Concordia University Research Repository

PubMed Central

Entity recognition in the biomedical domain using a hybrid approach

Author: A Tharatipyakul
C Funk
CD Paice
CS Funk
D Campos
D Koning
D Maglott
D Szklarczyk
DM Jessop
E Pafilis
E Tseytlin
F Rinaldi
F Rinaldi
F Rinaldi
F Rinaldi
G Sheikhshab
K Degtyarenko
K Eilbeck
K Verspoor
K Verspoor
M Ashburner
M Bada
M Basaldella
M Basaldella
MF Porter
N Pudota
P Lopez
PD Turney
R Core Team
R Leaman
R Leaman
S Aubin
S Eltyeb
S Tulkens
SA Akhondi
T Groza
T Munkhdalai
U Leser
Y Sasaki
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A decadal view of biodiversity informatics: challenges and priorities

Author: Addink W.
Aelterman B.
Agosti D.
Amaral-Zettler L.
Ariño A.H.
Arvanitidis C.
Backeljau T.
Bailly N.
Belbin L.
Berendsohn W.
Bertrand N.
Caithness N.
Campbell D.
Cochrane G.
Conruyt N.
Culham A.
Damgaard C.
Davies N.
de Jong Y.
Fady B.
Faulwetter S.
Feest A.
Field D.
Garnier E.
Geser G.
Gilbert J.
Grosche B.
Grosser D.
Hardisty A.
Herbinet B.
Hobern D.
Jones A.
King D.
Knapp S.
Koivula H.
Los W.
Meyer C.
Morris R.A.
Morrison N.
Morse D.
Obst M.
Pafilis E.
Page L.M.
Page R.
Pape T.
Parr C.
Paton A.
Patterson D.
Roberts D.
van Tienderen P.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

International Migration, Integration and Social Cohesion online publications

Impact of Tail Loss on the Behaviour and Locomotor Performance of Two Sympatric Lampropholis Skink Species

Author: A Kutt
Adam Stow
AR Clause
BE Dial
C Lever
C Prosser
DAA Bellairs
David G. Chapple
DG Chapple
DG Chapple
DG Chapple
DG Chapple
DG Chapple
DG Chapple
DG Chapple
DG Chapple
DRJ Formanowicz
E Ottoni
EN Arnold
F Kraus
F Punzo
Gillian L. Cromie
J Martin
J Martin
J Martin
J Martin
J Martin
JB Losos
JMP Joss
MJ Angilletta
P Doughty
P Pafilis
PW Bateman
R Shine
RE Ballinger
RM Goodman
S Downes
S Wilson
T Hamley
TE Higham
TL Maginnis
WE Cooper
WE Cooper
WE Cooper
ZH Lin
ZH Lin
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Caudal autotomy is an anti-predator behaviour that is used by many lizard species. Although there is an immediate survival benefit, the subsequent absence of the tail may inhibit locomotor performance, alter activity and habitat use, and increase the individuals' susceptibility to future predation attempts. We used laboratory experiments to examine the impact of tail autotomy on locomotor performance, activity and basking site selection in two lizard species, the delicate skink (Lampropholis delicata) and garden skink (L. guichenoti), that occur sympatrically throughout southeastern Australia and are exposed to an identical suite of potential predators. Post-autotomy tail movement did not differ between the two Lampropholis species, although a positive relationship between the shed tail length and distance moved, but not the duration of movement, was observed. Tail autotomy resulted in a substantial decrease in sprint speed in both species (28–39%), although this impact was limited to the optimal performance temperature (30°C). Although L. delicata was more active than L. guichenoti, tail autotomy resulted in decreased activity in both species. Sheltered basking sites were preferred over open sites by both Lampropholis species, although this preference was stronger in L. delicata. Caudal autotomy did not alter the basking site preferences of either species. Thus, both Lampropholis species had similar behavioural responses to autotomy. Our study also indicates that the impact of tail loss on locomotor performance may be temperature-dependent and highlights that future studies should be conducted over a broad thermal range

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central