60 research outputs found
Reasoning over Taxonomic Change: Exploring Alignments for the Perelleschus Use Case
Classifications and phylogenetic inferences of organismal groups change in
light of new insights. Over time these changes can result in an imperfect
tracking of taxonomic perspectives through the re-/use of Code-compliant or
informal names. To mitigate these limitations, we introduce a novel approach
for aligning taxonomies through the interaction of human experts and logic
reasoners. We explore the performance of this approach with the Perelleschus
use case of Franz & Cardona-Duque (2013). The use case includes six taxonomies
published from 1936 to 2013, 54 taxonomic concepts (i.e., circumscriptions of
names individuated according to their respective source publications), and 75
expert-asserted Region Connection Calculus articulations (e.g., congruence,
proper inclusion, overlap, or exclusion). An Open Source reasoning toolkit is
used to analyze 13 paired Perelleschus taxonomy alignments under heterogeneous
constraints and interpretations. The reasoning workflow optimizes the logical
consistency and expressiveness of the input and infers the set of maximally
informative relations among the entailed taxonomic concepts. The latter are
then used to produce merge visualizations that represent all congruent and
non-congruent taxonomic elements among the aligned input trees. In this small
use case with 6-53 input concepts per alignment, the information gained through
the reasoning process is on average one order of magnitude greater than in the
input. The approach offers scalable solutions for tracking provenance among
succeeding taxonomic perspectives that may have differential biases in naming
conventions, phylogenetic resolution, ingroup and outgroup sampling, or
ostensive (member-referencing) versus intensional (property-referencing)
concepts and articulations.Comment: 30 pages, 16 figure
The PBase Scientific Workflow Provenance Repository
Scientific workflows and their supporting systems are becoming increasingly popular for compute-intensive and data-intensive scientific experiments. The advantages scientific workflows offer include rapid and easy workflow design, software and data reuse, scalable execution, sharing and collaboration, and other advantages that altogether facilitate “reproducible science”. In this context, provenance – information about the origin, context, derivation, ownership, or history of some artifact – plays a key role, since scientists are interested in examining and auditing the results of scientific experiments. However, in order to perform such analyses on scientific results as part of extended research collaborations, an adequate environment and tools are required. Concretely, the need arises for a repository that will facilitate the sharing of scientific workflows and their associated execution traces in an interoperable manner, also enabling querying and visualization. Furthermore, such functionality should be supported while taking performance and scalability into account. With this purpose in mind, we introduce PBase: a scientific workflow provenance repository implementing the ProvONE proposed standard, which extends the emerging W3C PROV standard for provenance data with workflow specific concepts. PBase is built on the Neo4j graph database, thus offering capabilities such as declarative and efficient querying. Our experiences demonstrate the power gained by supporting various types of queries for provenance data. In addition, PBase is equipped with a user friendly interface tailored for the visualization of scientific workflow provenance data, making the specification of queries and the interpretation of their results easier and more effective
Security and Privacy for Emerging Smart Community Infrastructures
"Las alteraciones químicas de los componentes del ADN conducen a errores en la matriz
que dirige la biosíntesis de los ácidos nucleicos y en consecuencia lleva a mutagénesis y
cancerogénesis. Un gran número de modi caciones de los ácidos nucleicos son el resultado
de la acción de químicos o radiación sobre los ácidos nucleicos. Un tipo de modi cación de
la base que probablemente es el más importante en mutagénesis, cancerogénesis y envejecimiento es la transformación oxidativa de la guanina a la 8-oxoGuanina (8-oxoG), bajo la infuencia de radiación de otros factores endogénos y exógenos.
Provenance storage, querying, and visualization in PBase
We present PBase, a repository for scientific workflows and their corresponding provenance information that facilitates the sharing of experiments among the scientific community. PBase is interoperable since it uses ProvONE, a standard provenance model for scientific workflows. Workflows and traces are stored in RDF, and with the support of SPARQL and the tree cover encoding, the repository provides a scalable infrastructure for querying the provenance data. Furthermore, through its user interface, it is possible to: visualize workflows and execution traces; visualize reachability relations within these traces; issue SPARQL queries; and visualize query results.</p
Provenance storage, querying, and visualization in PBase
We present PBase, a repository for scientific workflows and their corresponding provenance information that facilitates the sharing of experiments among the scientific community. PBase is interoperable since it uses ProvONE, a standard provenance model for scientific workflows. Workflows and traces are stored in RDF, and with the support of SPARQL and the tree cover encoding, the repository provides a scalable infrastructure for querying the provenance data. Furthermore, through its user interface, it is possible to: visualize workflows and execution traces; visualize reachability relations within these traces; issue SPARQL queries; and visualize query results.</p
figureS3-3_mir
SUPPLEMENTARY MATERIALS S2. – Set of Euler/X toolkit output Maximally Informative Relations (MIR) for the input data files provided in the Supplementary Materials S1 and for the entire Prim-UC ("primates-all.csv"). Each output file is saved in .csv format. The MIR files form the basis for analyses of name:meaning relations (Tables 3–5)
- …