2 research outputs found

    Incremental discovery of functional dependencies with a bit-vector algorithm

    No full text
    Functional dependencies (fds) were conceived in the early ’70s, and were mainly used to verify database design and assess data quality. Nowadays they are automatically discovered from data since they can be exploited for many different purposes, such as query relaxation, data cleansing, and record matching. In the context of big data, the speed at which new data is being created demand for new efficient algorithms for fd discovery. In this paper, we propose an incremental discovery algorithm for fds, which is able to update the set of holding fds upon modifications to the data instance, without having to restart the discovery process from scratch. It exploits a bit-vector representation of fds, and an upward/downward search strategy aiming to reduce the overall search space. Experimental results show that such algorithm could achieve extremely better time performances with respect to a complete re-execution of the discovery algorithm
    corecore