11 research outputs found

    Integrity Constraints Revisited: From Exact to Approximate Implication

    Get PDF
    Integrity constraints such as functional dependencies (FD), and multi-valued dependencies (MVD) are fundamental in database schema design. Likewise, probabilistic conditional independences (CI) are crucial for reasoning about multivariate probability distributions. The implication problem studies whether a set of constraints (antecedents) implies another constraint (consequent), and has been investigated in both the database and the AI literature, under the assumption that all constraints hold exactly. However, many applications today consider constraints that hold only approximately. In this paper we define an approximate implication as a linear inequality between the degree of satisfaction of the antecedents and consequent, and we study the relaxation problem: when does an exact implication relax to an approximate implication? We use information theory to define the degree of satisfaction, and prove several results. First, we show that any implication from a set of data dependencies (MVDs+FDs) can be relaxed to a simple linear inequality with a factor at most quadratic in the number of variables; when the consequent is an FD, the factor can be reduced to 1. Second, we prove that there exists an implication between CIs that does not admit any relaxation; however, we prove that every implication between CIs relaxes "in the limit". Finally, we show that the implication problem for differential constraints in market basket analysis also admits a relaxation with a factor equal to 1. Our results recover, and sometimes extend, several previously known results about the implication problem: implication of MVDs can be checked by considering only 2-tuple relations, and the implication of differential constraints for frequent item sets can be checked by considering only databases containing a single transaction

    Probabilistic Team Semantics

    Get PDF
    Team semantics is a semantical framework for the study of dependence and independence concepts ubiquitous in many areas such as databases and statistics. In recent works team semantics has been generalised to accommodate also multisets and probabilistic dependencies. In this article we study a variant of probabilistic team semantics and relate this framework to a Tarskian two-sorted logic. We also show that very simple quantifier-free formulae of our logic give rise to backslashmathrm NP NP -hard model checking problems.Peer reviewe

    Entropy characterization of commutative partitions.

    Get PDF
    Lo Ying Hang.Thesis (M.Phil.)--Chinese University of Hong Kong, 2004.Includes bibliographical references (leaves 80-81).Abstracts in English and Chinese.Chapter Chapter 1 --- Introduction --- p.1Chapter Chapter 2 --- Background --- p.4Chapter Chapter 3 --- Commutative Partition Pair Analysis --- p.9Chapter Chapter 4 --- Entropy Inequalities for Partition Pair --- p.19Chapter Chapter 5 --- Entropy Characterization of Commutative Partition Pair --- p.32Chapter Chapter 6 --- Ordered Commutative Partitions --- p.43Chapter Chapter 7 --- Running Intersection Property for Partitions --- p.45Chapter Chapter 8 --- Entropy Characterization of Ordered Commutative Partitions --- p.53Chapter Chapter 9 --- Significance and Application --- p.72Chapter Chapter 10 --- Future Plan --- p.78Chapter Chapter 11 --- Conclusion --- p.79Bibliography --- p.8

    Prototipo de tutor inteligente para el aprendizaje de la programación de computadores

    Get PDF
    Trabajo de InvestigaciónLa creciente necesidad de ingenieros, en especial desarrolladores de software, ha incrementado la importancia de la instrucción de estos profesionales. Las dificultades en el aprendizaje de programación resaltan la importancia de localizar los factores críticos que afectan el desempeño de los estudiantes y plantear estrategias que permitan mejorar las posibilidades de cada estudiante en su proceso de aprendizaje. Para esto, se usan métodos de clasificación automática para localizar patrones que relacionen características personales con el desempeño en programación de computadores. Además se sugiere el uso de un modelo para mejorar el proceso de enseñanza.INTRODUCCIÓN 1. GENERALIDADES 2. CARACTERIZACIÓN DE LA ENSEÑANZA DE LA PROGRAMACIÓN 4. TUTORES INTELIGENTES 5. ALGORITMOS PARA EL RECONOCIMIENTO DE PATRONES. 6. DISEÑO DEL MODELO DE DATOS PARA LAS CARACTERISTICAS 7. ANÁLISIS DE RESULTADOS 8. DISEÑO Y CONSTRUCCIÓN DEL TUTOR INTELIGENTE 9. CONCLUSIONES 10. RECOMENDACIONES BIBLIOGRAFÍAPregradoIngeniero de Sistema

    On the Implication Problem for Probabilistic Conditional Independency

    No full text
    The implication problem is to test whether a given set of independencies logically implies another independency. This problem is crucial in the design of a probabilistic reasoning system. We advocate that Bayesian networks are a generalization of standard relational databases. On the contrary, it has been suggested that Bayesian networks are different from the relational databases because the implication problem of these two systems does not coincide for some classes of probabilistic independencies. This remark, however, does not take into consideration one important issue, namely, the solvability of the implication problem
    corecore