3 research outputs found

    system architecture for approximate query processing

    Get PDF
    Decision making is an activity that addresses the problem of extracting knowledge and information from data stored in data warehouses, in order to improve the business processes of information systems. Usually, decision making is based on On-Line Analytical Processing, data mining, or approximate query processing. In the last case, answers to analytical queries are provided in a fast manner, although affected with a small percentage of error. In the paper, we present the architecture of an approximate query answering system. Then, we illustrate our ADAP (Analytical Data Profile) system, which is based on an engine able to provide fast responses to the main statistical functions by using orthogonal polynomials series to approximate the data distribution of multi­dimensional relations. Moreover, several experimental results to measure the approximation error are shown and the response-time to analytical queries is reported.</p

    ANALYTICAL PROFILE ESTIMATION IN DATABASE-SYSTEMS

    No full text
    Most parameters which constitute the statistical profile are related to the record selectivity. To estimate record selectivity factors, the nonparametric are better than parametric methods in that they make no a priori assumptions concerning the data distribution and generally provide accurate results. Nonparametric methods are classified into the usual scale-based methods, which function by the scaling of attribute ranges, and analytic methods discussed in this paper, which are scale independent. Our analytic method is based on the computation of a set of parameters, the so-called Canonical Coefficients, which enable the multivariate distribution of the data to be well known. Based on the canonical coefficients, the main parameters of database statistical profiles can be easily defined and efficiently calculated (in terms of computation time and estimation accuracy). In addition, some important applications, which are of peculiar interest to statistical database systems can be developed. Experimental results on real databases are presented which demonstrate the versatility and reliability of the analytic approach
    corecore