Search CORE

3,237 research outputs found

View Selection in Semantic Web Databases

Author: François Goasdoué
François Goasdoué
Ioana Manolescu
Julien Leblay
Julien Leblay
Konstantinos Karanasos
Konstantinos Karanasos
Équipes-projets Leo
Publication venue
Publication date: 01/01/2011
Field of study

We consider the setting of a Semantic Web database, containing both explicit data encoded in RDF triples, and implicit data, implied by the RDF semantics. Based on a query workload, we address the problem of selecting a set of views to be materialized in the database, minimizing a combination of query processing, view storage, and view maintenance costs. Starting from an existing relational view selection method, we devise new algorithms for recommending view sets, and show that they scale significantly beyond the existing relational ones when adapted to the RDF context. To account for implicit triples in query answers, we propose a novel RDF query reformulation algorithm and an innovative way of incorporating it into view selection in order to avoid a combinatorial explosion in the complexity of the selection process. The interest of our techniques is demonstrated through a set of experiments.Comment: VLDB201

arXiv.org e-Print Archive

HAL-CentraleSupelec

CiteSeerX

INRIA a CCSD electronic archive server

Oxford University Research Archive

HAL-Rennes 1

Human behavior based particle swarm optimization for materialized view selection in data warehousing environment

Author: Fadhil Zainab Mahmood
Publication venue: 'International University of Sarajevo'
Publication date: 06/12/2020
Field of study

Because of the Materialized View (MV) space value and repair cost limitation in Data Warehouse (DW) environment, the materialization of all views was practically impossible thus suitable MV selection was one of the smart decisions in building DW to get optimal efficiency, at the same time in the modern world, techniques for enhancing DW quality were appeared continuously such as swarm intelligence. Therefore, this paper presents first framework for speeding up query response time depending on Human Particle Swarm Optimization (HPSO) algorithm for determining the best locations of the views in the DW. The results showed that the proposed method for selecting best MV using HPSO algorithm is better than other algorithms via calculating the ratio of query response time on the base tables of DW and compare it to the response time of the same queries on the MVs. Ratio of implementing the query on the base table takes 14 times more time than the query implementation on the MVs. Where the response time of queries through MVs access equal to 106 milliseconds while by direct access queries equal to 1066 milliseconds. This outlines that the performance of query through MVs access is 1471.698% better than those directly access via DW-logical

Periodicals of Engineering and Natural Sciences (PEN - International University of Sarajevo)

How to Price Shared Optimizations in the Cloud

Author: Balazinska Magdalena
Suciu Dan
Upadhyaya Prasang
Publication venue
Publication date: 01/01/2011
Field of study

Data-management-as-a-service systems are increasingly being used in collaborative settings, where multiple users access common datasets. Cloud providers have the choice to implement various optimizations, such as indexing or materialized views, to accelerate queries over these datasets. Each optimization carries a cost and may benefit multiple users. This creates a major challenge: how to select which optimizations to perform and how to share their cost among users. The problem is especially challenging when users are selfish and will only report their true values for different optimizations if doing so maximizes their utility. In this paper, we present a new approach for selecting and pricing shared optimizations by using Mechanism Design. We first show how to apply the Shapley Value Mechanism to the simple case of selecting and pricing additive optimizations, assuming an offline game where all users access the service for the same time-period. Second, we extend the approach to online scenarios where users come and go. Finally, we consider the case of substitutive optimizations. We show analytically that our mechanisms induce truth- fulness and recover the optimization costs. We also show experimentally that our mechanisms yield higher utility than the state-of-the-art approach based on regret accumulation.Comment: VLDB201

arXiv.org e-Print Archive

CiteSeerX

A Novel Hybrid Optimization With Ensemble Constraint Handling Approach for the Optimal Materialized Views

Author: Boppana Venugopal
Davuluri Suneetha
Gorintla Shobana
Murali Krishna Chinta Venkata
Rao Muvva Venkateswara
Srinivasarao Popuri
Vullam Nagagopiraj
Publication venue: Auricle Global Society of Education and Research
Publication date: 07/10/2023
Field of study

The datawarehouse is extremely challenging to work with, as doing so necessitates a significant investment of both time and space. As a result, it is essential to enable rapid data processing in order to cut down on the amount of time needed to respond to queries that are sent to the warehouse. To effectively solve this problem, one of the significant approaches that should be taken is to take the view of materialization. It is extremely unlikely that all of the views that can be derived from the data will ever be materialized. As a result, view subsets need to be selected intelligently in order to enable rapid data processing for queries coming from a variety of locations. The Materialized view selection problem is addressed by the model that has been proposed. The model is based on the ensemble constraint handling techniques (ECHT). In order to optimize the problem, we must take into account the constraints, which include the self-adaptive penalty, the Epsilon ()-parameter, and the stochastic ranking. For the purpose of making a quicker and more accurate selection of queries from the data warehouse, the proposed model includes the implementation of an innovative algorithm known as the constrained hybrid Ebola with COATI optimization (CHECO) algorithm. For the purpose of computing the best possible fitness, the goals of "processing cost of the query," "response cost," and "maintenance cost" are each defined. The top views are selected by the CHECO algorithm based on whether or not the defined fitness requirements are met. In the final step of the process, the proposed model is compared to the models already in use in order to validate the performance improvement in terms of a variety of performance metrics

International Journal on Recent and Innovation Trends in Computing and Communication

EFFICIENT APPROACH FOR VIEW SELECTION FOR DATA WAREHOUSE USING TREE MINING AND EVOLUTIONARY COMPUTATION

Author: Deshpande Parag
Thakare Atul
Publication venue: 'AGHU University of Science and Technology Press'
Publication date: 25/11/2018
Field of study

Selection of a proper set of views to materialize plays an important role indatabase performance. There are many methods of view selection which uses different techniques and frameworks to select an efficient set of views for materialization. In this paper, we present a new efficient, scalable method for view selection under the given storage constraints using a tree mining approach and evolutionary optimization. Tree mining algorithm is designed to determine the exact frequency of (sub)queries in the historical SQL dataset. Query Cost model achieves the objective of maximizing the performance benefits from the final view set which is derived from the frequent view set given by tree mining algorithm. Performance benefit of a query is defined as a function of queryfrequency, query creation cost, and query maintenance cost. The experimental results shows that the proposed method is successful in recommending a solution which is fairly close to optimal solution

Computer Science Journal (AGH University of Science and Technology, Krakow)