3,610 research outputs found
Hypothetical Reasoning via Provenance Abstraction
Data analytics often involves hypothetical reasoning: repeatedly modifying
the data and observing the induced effect on the computation result of a
data-centric application. Previous work has shown that fine-grained data
provenance can help make such an analysis more efficient: instead of a costly
re-execution of the underlying application, hypothetical scenarios are applied
to a pre-computed provenance expression. However, storing provenance for
complex queries and large-scale data leads to a significant overhead, which is
often a barrier to the incorporation of provenance-based solutions.
To this end, we present a framework that allows to reduce provenance size.
Our approach is based on reducing the provenance granularity using user defined
abstraction trees over the provenance variables; the granularity is based on
the anticipated hypothetical scenarios. We formalize the tradeoff between
provenance size and supported granularity of the hypothetical reasoning, and
study the complexity of the resulting optimization problem, provide efficient
algorithms for tractable cases and heuristics for others. We experimentally
study the performance of our solution for various queries and abstraction
trees. Our study shows that the algorithms generally lead to substantial
speedup of hypothetical reasoning, with a reasonable loss of accuracy
Recommended from our members
Capturing Scientific Knowledge on Medical Risk Factors
In this paper, we describe a model for representing scientific knowledge of risk factors in medicine in an explicit format which enables its use for automated reasoning. The resulting model supports linking the conclusions of up-to-date clinical research with data relating to individual patients. This model, which we have implemented as an ontology-based system using Linked Data, enables the capture of risk factor knowledge and serves as a translational research tool to apply that knowledge to assist with patient treatment, lifestyle, and education. Knowledge captured using this model can be disseminated for other intelligent systems to use for a variety of purposes, for example, to explore the state of the available medical knowledge
A Memetic Analysis of a Phrase by Beethoven: Calvinian Perspectives on Similarity and Lexicon-Abstraction
This article discusses some general issues arising from the study of similarity in music, both human-conducted and computer-aided, and then progresses to a consideration of similarity relationships between patterns in a phrase by Beethoven, from the first movement of the Piano Sonata in A flat major op. 110 (1821), and various potential memetic precursors. This analysis is followed by a consideration of how the kinds of similarity identified in the Beethoven phrase might be understood in psychological/conceptual and then neurobiological terms, the latter by means of William Calvinâs Hexagonal Cloning Theory. This theory offers a mechanism for the operation of David Copeâs concept of the lexicon, conceived here as a museme allele-class. I conclude by attempting to correlate and map the various spaces within which memetic replication occurs
Requirements for Provenance on the Web
From where did this tweet originate? Was this quote from the New York Times modified? Daily, we rely on data from the Web but often it is difficult or impossible to determine where it came from or how it was produced. This lack of provenance is particularly evident when people and systems deal with Web information or with any environment where information comes from sources of varying quality. Provenance is not captured pervasively in information systems. There are major technical, social, and economic impediments that stand in the way of using provenance effectively. This paper synthesizes requirements for provenance on the Web for a number of dimensions focusing on three key aspects of provenance: the content of provenance, the management of provenance records, and the uses of provenance information. To illustrate these requirements, we use three synthesized scenarios that encompass provenance problems faced by Web users toda
Hypotheses, evidence and relationships: The HypER approach for representing scientific knowledge claims
Biological knowledge is increasingly represented as a collection of (entity-relationship-entity) triplets. These are queried, mined, appended to papers, and published. However, this representation ignores the argumentation contained within a paper and the relationships between hypotheses, claims and evidence put forth in the article. In this paper, we propose an alternate view of the research article as a network of 'hypotheses and evidence'. Our knowledge representation focuses on scientific discourse as a rhetorical activity, which leads to a different direction in the development of tools and processes for modeling this discourse. We propose to extract knowledge from the article to allow the construction of a system where a specific scientific claim is connected, through trails of meaningful relationships, to experimental evidence. We discuss some current efforts and future plans in this area
- âŚ