Search CORE

26 research outputs found

SeeDB: automatically generating query visualizations

Author: Aditya Parameswaran
Manasi Vartak
Mit
Neoklis Polyzotis
Samuel Madden
Publication venue: 'VLDB Endowment'
Publication date: 01/01/2014
Field of study

Data analysts operating on large volumes of data often rely on visualizations to interpret the results of queries. However, finding the right visualization for a query is a laborious and time-consuming task. We demonstrate SeeDB, a system that partially automates this task: given a query, SeeDB explores the space of all possible visualizations, and automatically identifies and recommends to the analyst those visualizations it finds to be most "interesting" or "useful". In our demonstration, conference attendees will see SeeDB in action for a variety of queries on multiple real-world datasets

CiteSeerX

DSpace@MIT

Crossref

Predicting your next OLAP query based on recent analytical sessions

Author: A. Giacometti
C. Sapia
G. Adomavicius
G. Chatzopoulou
N. Khoussainova
N. Khoussainova
R. Baeza-Yates
S. Sarawagi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

International audienceIn Business Intelligence systems, users interact with data warehouses by formulating OLAP queries aimed at exploring multidimensional data cubes. Being able to predict the most likely next queries would provide a way to recommend interesting queries to users on the one hand, and could improve the efficiency of OLAP sessions on the other. In particular, query recommendation would proactively guide users in data exploration and improve the quality of their interactive experience. In this paper, we propose a framework to predict the most likely next query and recommend this to the user. Our framework relies on a probabilistic user behavior model built by analyzing previous OLAP sessions and exploiting a query similarity metric. To gain insight in the recommendation precision and on what parameters it depends, we evaluate our approach using different quality assessments

HAL-CentraleSupelec

CiteSeerX

Crossref

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

HAL Université de Tours

Optimization of Association Rules Extraction Through Exploitation of Context Dependent Constraints

Author: Botta M.
Esposito R.
Gallo A.
Meo R.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

Institutional Research Information System University of Turin

Interactive Data Exploration with Smart Drill-Down

Author: Garcia-Molina Hector
Joglekar Manas
Parameswaran Aditya
Publication venue
Publication date: 01/05/2016
Field of study

We present {\em smart drill-down}, an operator for interactively exploring a relational table to discover and summarize "interesting" groups of tuples. Each group of tuples is described by a {\em rule}. For instance, the rule

(a, b, \star, 1000)

tells us that there are a thousand tuples with value

a

in the first column and

b

in the second column (and any value in the third column). Smart drill-down presents an analyst with a list of rules that together describe interesting aspects of the table. The analyst can tailor the definition of interesting, and can interactively apply smart drill-down on an existing rule to explore that part of the table. We demonstrate that the underlying optimization problems are {\sc NP-Hard}, and describe an algorithm for finding the approximately optimal list of rules to display when the user uses a smart drill-down, and a dynamic sampling scheme for efficiently interacting with large tables. Finally, we perform experiments on real datasets on our experimental prototype to demonstrate the usefulness of smart drill-down and study the performance of our algorithms

arXiv.org e-Print Archive

Crossref

PubMed Central

eScholarship - University of California

A Holistic Approach to OLAP Sessions Composition: The Falseto Experience

Author: Aligon Julien
Boulil Kamal
Marcel Patrick
Peralta Verónika
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2014
Field of study

International audienceOLAP is the main paradigm for flexible and effective exploration of multidimensional cubes in data warehouses. During an OLAP session the user analyzes the results of a query and determines a new query that will give her a better understanding of information. Given the huge size of the data space, this exploration process is often tedious and may leave the user disoriented and frustrated. This paper presents an OLAP tool 1 named Falseto (Former AnalyticaL Sessions for lEss Tedious Olap), that is meant to assist query and session composition, by letting the user summarize, browse, query, and reuse former analytical sessions. Falseto's implementation on top of a formal framework is detailed. We also report the experiments we run to obtain and analyze real OLAP sessions and assess Falseto with them. Finally, we discuss how Falseto can be seen as a starting point for bridging OLAP with exploratory search, a search paradigm centered on the user and the evolution of her knowledge

Crossref

HAL Université de Tours

Database Learning: Toward a Database that Becomes Smarter Every Time

Author: Acharya S.
Agrawal S.
Bishop C. M.
Carbonell J. G.
Carlson A.
Condie T.
Ganti V.
Idreos S.
Lawrence N.
Meliou A.
Micchelli C. A.
Mozafari B.
Mozafari B.
Mozafari B.
Olston C.
Park Y.
Rusu F.
Sarawagi S.
Sidirourgos L.
Skilling J.
Wasserman L.
Williams C. K.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 28/03/2017
Field of study

In today's databases, previous query answers rarely benefit answering future queries. For the first time, to the best of our knowledge, we change this paradigm in an approximate query processing (AQP) context. We make the following observation: the answer to each query reveals some degree of knowledge about the answer to another query because their answers stem from the same underlying distribution that has produced the entire dataset. Exploiting and refining this knowledge should allow us to answer queries more analytically, rather than by reading enormous amounts of raw data. Also, processing more queries should continuously enhance our knowledge of the underlying distribution, and hence lead to increasingly faster response times for future queries. We call this novel idea---learning from past query answers---Database Learning. We exploit the principle of maximum entropy to produce answers, which are in expectation guaranteed to be more accurate than existing sample-based approximations. Empowered by this idea, we build a query engine on top of Spark SQL, called Verdict. We conduct extensive experiments on real-world query traces from a large customer of a major database vendor. Our results demonstrate that Verdict supports 73.7% of these queries, speeding them up by up to 23.0x for the same accuracy level compared to existing AQP systems.Comment: This manuscript is an extended report of the work published in ACM SIGMOD conference 201

arXiv.org e-Print Archive

Crossref