17,769 research outputs found
Multiple Query Optimization on the D-Wave 2X Adiabatic Quantum Computer
The D-Wave adiabatic quantum annealer solves hard combinatorial optimization
problems leveraging quantum physics. The newest version features over 1000
qubits and was released in August 2015. We were given access to such a machine,
currently hosted at NASA Ames Research Center in California, to explore the
potential for hard optimization problems that arise in the context of
databases.
In this paper, we tackle the problem of multiple query optimization (MQO). We
show how an MQO problem instance can be transformed into a mathematical formula
that complies with the restrictive input format accepted by the quantum
annealer. This formula is translated into weights on and between qubits such
that the configuration minimizing the input formula can be found via a process
called adiabatic quantum annealing. We analyze the asymptotic growth rate of
the number of required qubits in the MQO problem dimensions as the number of
qubits is currently the main factor restricting applicability. We
experimentally compare the performance of the quantum annealer against other
MQO algorithms executed on a traditional computer. While the problem sizes that
can be treated are currently limited, we already find a class of problem
instances where the quantum annealer is three orders of magnitude faster than
other approaches
Optimal web-scale tiering as a flow problem
We present a fast online solver for large scale parametric max-flow problems as they occur in portfolio optimization, inventory management, computer vision, and logistics. Our algorithm solves an integer linear program in an online fashion. It exploits total unimodularity of the constraint matrix and a Lagrangian relaxation to solve the problem as a convex online game. The algorithm generates approximate solutions of max-flow problems by performing stochastic gradient descent on a set of flows. We apply the algorithm to optimize tier arrangement of over 84 million web pages on a layered set of caches to serve an incoming query stream optimally
Graphing of E-Science Data with varying user requirements
Based on our experience in the Swiss Experiment, exploring experimental, scientific data is often done in a visual way. Starting from a global overview the users are zooming in on interesting events. In case of huge data volumes special data structures have to be introduced to provide fast and easy access to the data. Since it is hard to predict on how users will work with the data a generic approach requires self-adaptation of the required special data structures. In this paper we describe the underlying NP-hard problem and present several approaches to address the problem with varying properties. The approaches are illustrated with a small example and are evaluated with a synthetic data set and user queries
On Optimally Partitioning Variable-Byte Codes
The ubiquitous Variable-Byte encoding is one of the fastest compressed
representation for integer sequences. However, its compression ratio is usually
not competitive with other more sophisticated encoders, especially when the
integers to be compressed are small that is the typical case for inverted
indexes. This paper shows that the compression ratio of Variable-Byte can be
improved by 2x by adopting a partitioned representation of the inverted lists.
This makes Variable-Byte surprisingly competitive in space with the best
bit-aligned encoders, hence disproving the folklore belief that Variable-Byte
is space-inefficient for inverted index compression. Despite the significant
space savings, we show that our optimization almost comes for free, given that:
we introduce an optimal partitioning algorithm that does not affect indexing
time because of its linear-time complexity; we show that the query processing
speed of Variable-Byte is preserved, with an extensive experimental analysis
and comparison with several other state-of-the-art encoders.Comment: Published in IEEE Transactions on Knowledge and Data Engineering
(TKDE), 15 April 201
- …