114 research outputs found

    Mining data quality rules based on T-dependence

    Get PDF
    Since their introduction in 1976, edit rules have been a standard tool in statistical analysis. Basically, edit rules are a compact representation of non-permitted combinations of values in a dataset. In this paper, we propose a technique to automatically find edit rules by use of the concept of T-dependence. We first generalize the traditional notion of lift, to that of T-lift, where stochastic independence is generalized to T-dependence. A combination of values is declared as an edit rule under a t-norm T if there is a strong negative correlation under T-dependence. We show several interesting properties of this approach. In particular, we show that under the minimum t-norm, edit rules can be computed efficiently by use of frequent pattern trees. Experimental results show that there is a weak to medium correlation in the rank order of edit rules obtained under T_M and T_P, indicating that the semantics of these kinds of dependencies are different

    Comparing fbeta-optimal with distance based merge functions

    Get PDF
    Merge functions informally combine information from a certain universe into a solution over that same universe. This typically results in a, preferably optimal, summarization. In previous research, merge functions over sets have been looked into extensively. A specic case concerns sets that allow elements to appear more than once, multisets. In this paper we compare two types of merge functions over multisets against each other. We examine both general properties as practical usability in a real world application

    Bipolarity in ear biometrics

    Get PDF
    Identifying people using their biometric data is a problem that is getting increasingly more attention. This paper investigates a method that allows the matching of people in the context of victim identification by using their ear biometric data. A high quality picture (taken professionally) is matched against a set of low quality pictures (family albums). In this paper soft computing methods are used to model different kinds of uncertainty that arise when manually annotating the pictures. More specifically, we study the use of bipolar satisfaction degrees to explicitly handle the bipolar information about the available ear biometrics

    A measure-theoretic foundation for data quality

    Get PDF

    Genital sensitivity after genito-urinary surgery

    Get PDF

    Quorumpeps database : chemical space, microbial origin and functionality of quorum sensing peptides

    Get PDF
    Quorum-sensing (QS) peptides are biologically attractive molecules, with a wide diversity of structures and prone to modifications altering or presenting new functionalities. Therefore, the Quorumpeps database (http://quorumpeps.ugent.be) is developed to give a structured overview of the QS oligopeptides, describing their microbial origin (species), functionality (method, result and receptor), peptide links and chemical characteristics (3D-structure-derived physicochemical properties). The chemical diversity observed within this group of QS signalling molecules can be used to develop new synthetic bio-active compounds
    corecore