19 research outputs found

    A Probabilistic Data Model and Its Semantics

    Full text link
    As database systems are increasingly being used in advanced applications, it is becoming common that data in these applications contain some elements of uncertainty. These arise from many factors, such as measurement errors and cognitive errors. As such, many researchers have focused on defining comprehensive uncertainty data models of uncertainty database systems. However, existing uncertainty data models do not adequately support some applications. Moreover, very few works address uncertainty tuple calculus. In this paper we advocate a probabilistic data model for representing uncertain information. In particular, we establish a probabilistic tuple calculus language and its semantics to meet the corresponding probabilistic relational algebra

    Paraconsistent logic and query answering in inconsistent databases

    Full text link
    This paper concerns the paraconsistent logic LPQ⊃,F^{\supset,\mathsf{F}} and an application of it in the area of relational database theory. The notions of a relational database, a query applicable to a relational database, and a consistent answer to a query with respect to a possibly inconsistent relational database are considered from the perspective of this logic. This perspective enables among other things the definition of a consistent answer to a query with respect to a possibly inconsistent database without resort to database repairs. In a previous paper, LPQ⊃,F^{\supset,\mathsf{F}} is presented with a sequent-style natural deduction proof system. In this paper, a sequent calculus proof system is presented because it is common to use a sequent calculus proof system as the basis of proof search procedures and such procedures may form the core of algorithms for computing consistent answers to queries.Comment: 21 pages; revision of v4, some inaccuracies removed and material streamlined at several place

    Coping with Incomplete Data: Recent Advances

    Get PDF
    International audienceHandling incomplete data in a correct manner is a notoriously hard problem in databases. Theoretical approaches rely on the computationally hard notion of certain answers, while practical solutions rely on ad hoc query evaluation techniques based on threevalued logic. Can we find a middle ground, and produce correct answers efficiently? The paper surveys results of the last few years motivated by this question. We reexamine the notion of certainty itself, and show that it is much more varied than previously thought. We identify cases when certain answers can be computed efficiently and, short of that, provide deterministic and probabilistic approximation schemes for them. We look at the role of three-valued logic as used in SQL query evaluation, and discuss the correctness of the choice, as well as the necessity of such a logic for producing query answers

    Coping with Incomplete Data: Recent Advances

    Get PDF
    Handling incomplete data in a correct manner is a notoriously hard problem in databases. Theoretical approaches rely on the computationally hard notion of certain answers, while practical solutions rely on ad hoc query evaluation techniques based on three-valued logic. Can we find a middle ground, and produce correct answers efficiently? The paper surveys results of the last few years motivated by this question. We re-examine the notion of certainty itself, and show that it is much more varied than previously thought. We identify cases when certain answers can be computed efficiently and, short of that, provide deterministic and probabilistic approximation schemes for them. We look at the role of three-valued logic as used in SQL query evaluation, and discuss the correctness of the choice, as well as the necessity of such a logic for producing query answers

    Interoperability of XML and relational data-optimization algorithm

    Get PDF
    "Within the past six years, Extensible Markup Language (XML) has spread rapidly and has gained popularity in the database community with its primary focus in the design of query languages and storage methods to select data from vast amounts of XML data efficiently. In this respect, I discuss some of the research that has been done by presenting three papers that describe different approaches to querying XML documents. This thesis concentrates on the method used by Sadri and Lakshmanan in [1]: viewing an XML document as a relational database upon which the user can write simple SQL queries that can be translated into equivalent XQuery queries. Taking the output of the translation algorithm presented, I further develop an optimization algorithm meant to decrease the running time of the translated queries. I mainly focus on two aspects: the need of the distinct-values() function and the minimization of the number of variables. "--Abstract from author supplied metadata
    corecore