55 research outputs found
An Operational Approach to Consistent Query Answering
Consistent query answering (CQA) aims to find meaningful answersto queries when databases are inconsistent, i.e., do not conformto their specifications. Such answers must be certainly true in allrepairs, which are consistent databases whose difference from theinconsistent one is minimal, according to some measure. This taskis often computationally intractable, and much of CQA researchconcentrated on finding islands of tractability. Nevertheless, thereare many relevant queries for which no efficient solutions exist,which is reflected by the limited practical applicability of the CQAapproach. To remedy this, one needs to devise a new CQA framework that provides explicit guarantees on the quality of queryanswers. However, the standard notions of repair and certain answers are too coarse to permit more elaborate schemes of queryanswering. Our goal is to provide a new framework for CQA basedon revised definitions of repairs and query answering that opensup the possibility of efficient approximate query answering withexplicit guarantees. The key idea is to replace the current declarative definition of a repair with anoperationalone, which explainshowa repair is constructed, and how likely it is that a consistentinstance is a repair. This allows us to define how certain we arethat a tuple should be in the answer. Using this approach, we studythe complexity of both exact and approximate CQA. Even thoughsome of the problems remain hard, for many common classes ofconstraints we can provide meaningful answers in reasonable time,for queries going far beyond the standard CQA approach
Counting Database Repairs under Primary Keys Revisited
Consistent query answering (CQA) aims to deliver meaningful answers when queries are evaluated over inconsistent databases. Such answers must be certainly true in all repairs, which are consistent databases whose difference from the inconsistent one is somehow minimal. An interesting task in this context is to count the number of repairs that entail the query. This problem has been already studied for conjunctive queries and primary keys; we know that it is #P-complete in data complexity under polynomial-time Turing reductions (a.k.a. Cook reductions). However, as it has been already observed in the literature of counting complexity, there are problems that are ''hard-to-count-easy-to-decide'', which cannot be complete (under reasonable assumptions) for #P under weaker reductions, and, in particular, under standard many-one logspace reductions (a.k.a. parsimonious reductions). For such ''hard-to-count-easy-to-decide'' problems, a crucial question is whether we can determine their exact complexity by looking for subclasses of #P to which they belong. Ideally, we would like to show that such a problem is complete for a subclass of #P under many-one logspace reductions. The main goal of this work is to perform such a refined analysis for the problem of counting the number of repairs under primary keys that entail the query
Detecting Decidable Classes of Finitely Ground Logic Programs with Function Symbols
In this article, we propose a new technique for checking whether the bottom-up evaluation of logic programs with function symbols terminates. The technique is based on the definition of
mappings
from arguments to strings of function symbols, representing possible values which could be taken by arguments during the bottom-up evaluation. Starting from mappings, we identify
mapping-restricted
arguments, a subset of limited arguments, namely arguments that take values from finite domains. Mapping-restricted programs, consisting of rules whose arguments are all mapping restricted, are terminating under the bottom-up computation, as all of its arguments take values from finite domains. We show that mappings can be computed by transforming the original program into a unary logic program: this allows us to establish decidability of checking if a program is mapping restricted. We study the complexity of the presented approach and compare it to other techniques known in the literature. We also introduce an extension of the proposed approach that is able to recognize a wider class of logic programs. The presented technique provides a significant improvement, as it can detect terminating programs not identified by other criteria proposed so far. Furthermore, it can be combined with other techniques to further enlarge the class of programs recognized as terminating under the bottom-up evaluation.
</jats:p
Chase Termination for Guarded Existential Rules
The chase procedure is considered as one of the most fundamental algorithmic tools in database theory. It has been successfully applied to different database problems such as data exchange, and query answering and containment under constraints, to name a few. One of the central problems regarding the chase procedure is all-instance termination, that is, given a set of tuple-generating dependencies (TGDs) (a.k.a. existential rules), decide whether the chase under that set terminates, for every input database. It is well-known that this problem is undecidable, no matter which version of the chase we consider. The crucial question that comes up is whether existing restricted classes of TGDs, proposed in different contexts such as ontological reasoning, make the above problem decidable. In this work, we focus our attention on the oblivious and the semi-oblivious versions of the chase procedure, and we give a positive answer for classes of TGDs that are based on the notion of guardedness
Exploiting Equality Generating Dependencies in Checking Chase Termination
The chase is a well-known algorithm with a wide range of applications in data exchange, data cleaning, data integration, query optimization, and ontological reasoning. Since the chase evaluation might not terminate and it is undecidablewhether it terminates, the problem of defining (decidable) sufficient conditions ensuring termination has received a great deal of interest in recent years. In this regard, several termination criteria have been proposed. One of the mainweaknesses of current approaches is the limited analysis they perform on equality generating dependencies (EGDs). In this paper, we propose sufficient conditions ensuring that a set of dependencies has at least one terminating chase sequence. We propose novel criteria which are able to perform a more accurate analysis of EGDs. Specifically, we propose a new stratification criterion and an adornment algorithm. The latter can both be used as a termination criterion and be combined with current techniques to make them more effective, in that strictly more sets of dependencies are identified. Our techniques identify sets of dependencies thatare not recognized by any of the current criteria.<br/
Querying Data Exchange Settings Beyond Positive Queries
Data exchange, the problem of transferring data from a source schema to a
target schema, has been studied for several years.
The semantics of answering positive queries over the target schema has been
defined in early work, but little attention has been paid to more general
queries. A few proposals of semantics for more general queries exist but they
either do not properly extend the standard semantics under positive queries,
giving rise to counterintuitive answers, or they make query answering
undecidable even for the most important data exchange settings, e.g., with
weakly-acyclic dependencies.
The goal of this paper is to provide a new semantics for data exchange that
is able to deal with general queries. At the same time, we want our semantics
to coincide with the classical one when focusing on positive queries, and to
not trade-off too much in terms of complexity of query answering. We show that
query answering is undecidable in general under the new semantics, but it is
\co\NP\complete when the dependencies are weakly-acyclic.
Moreover, in the latter case, we show that exact answers under our semantics
can be computed by means of logic programs with choice, thus exploiting
existing efficient systems. For more efficient computations, we also show that
our semantics allows for the construction of a representative target instance,
similar in spirit to a universal solution, that can be exploited for computing
approximate answers in polynomial time. Under consideration in Theory and
Practice of Logic Programming (TPLP).Comment: Under consideration in Theory and Practice of Logic Programming
(TPLP
- …
