Search CORE

217 research outputs found

CoPhy: A Scalable, Portable, and Interactive Index Advisor for Large Workloads

Author: Ailamaki Anastasia
Dash Debabrata
Polyzotis Neoklis
Publication venue
Publication date: 16/04/2011
Field of study

Index tuning, i.e., selecting the indexes appropriate for a workload, is a crucial problem in database system tuning. In this paper, we solve index tuning for large problem instances that are common in practice, e.g., thousands of queries in the workload, thousands of candidate indexes and several hard and soft constraints. Our work is the first to reveal that the index tuning problem has a well structured space of solutions, and this space can be explored efficiently with well known techniques from linear optimization. Experimental results demonstrate that our approach outperforms state-of-the-art commercial and research techniques by a significant margin (up to an order of magnitude).Comment: VLDB201

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Supporting case-based retrieval by similarity skylines: Basic concepts and extensions

Author: Hüllermeier Eyke
Prados Suárez María Belén
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Conventional approaches to similarity search and case-based retrieval, such as nearest neighbor search, require the speci cation of a global similarity measure which is typically expressed as an aggregation of local measures pertaining to di erent aspects of a case. Since the proper aggregation of local measures is often quite di cult, we propose a novel concept called similarity skyline. Roughly speaking, the similarity skyline of a case base is de ned by the subset of cases that are most similar to a given query in a Pareto sense. Thus, the idea is to proceed from a d-dimensional comparison between cases in terms of d (local) distance measures and to identify those cases that are maximally similar in the sense of the Pareto dominance relation [2]. To re ne the retrieval result, we propose a method for computing maximally diverse subsets of a similarity skyline. Moreover, we propose a generalization of similarity skylines which is able to deal with uncertain data described in terms of interval or fuzzy attribute values. The method is applied to similarity search over uncertain archaeological data

Repositorio Institucional Universidad de Granada