217 research outputs found

    CoPhy: A Scalable, Portable, and Interactive Index Advisor for Large Workloads

    Get PDF
    Index tuning, i.e., selecting the indexes appropriate for a workload, is a crucial problem in database system tuning. In this paper, we solve index tuning for large problem instances that are common in practice, e.g., thousands of queries in the workload, thousands of candidate indexes and several hard and soft constraints. Our work is the first to reveal that the index tuning problem has a well structured space of solutions, and this space can be explored efficiently with well known techniques from linear optimization. Experimental results demonstrate that our approach outperforms state-of-the-art commercial and research techniques by a significant margin (up to an order of magnitude).Comment: VLDB201

    Supporting case-based retrieval by similarity skylines: Basic concepts and extensions

    Get PDF
    Conventional approaches to similarity search and case-based retrieval, such as nearest neighbor search, require the speci cation of a global similarity measure which is typically expressed as an aggregation of local measures pertaining to di erent aspects of a case. Since the proper aggregation of local measures is often quite di cult, we propose a novel concept called similarity skyline. Roughly speaking, the similarity skyline of a case base is de ned by the subset of cases that are most similar to a given query in a Pareto sense. Thus, the idea is to proceed from a d-dimensional comparison between cases in terms of d (local) distance measures and to identify those cases that are maximally similar in the sense of the Pareto dominance relation [2]. To re ne the retrieval result, we propose a method for computing maximally diverse subsets of a similarity skyline. Moreover, we propose a generalization of similarity skylines which is able to deal with uncertain data described in terms of interval or fuzzy attribute values. The method is applied to similarity search over uncertain archaeological data
    • …
    corecore