Search CORE

315 research outputs found

Expressing OLAP operators with the TAX XML algebra

Author: Darmont Jérôme
Hachicha Marouane
Mahboubi Hadj
Publication venue
Publication date: 01/01/2008
Field of study

With the rise of XML as a standard for representing business data, XML data warehouses appear as suitable solutions for Web-based decision-support applications. In this context, it is necessary to allow OLAP analyses over XML data cubes (XOLAP). Thus, XQuery extensions are needed. To help define a formal framework and allow much-needed performance optimizations on analytical queries expressed in XQuery, having an algebra at one's disposal is desirable. However, XOLAP approaches and algebras from the literature still largely rely on the relational model and/or only feature a small number of OLAP operators. In opposition, we propose in this paper to express a broad set of OLAP operators with the TAX XML algebra.Comment: in 3rd International Workshop on Database Technologies for Handling XML Information on the Web (DataX-EDBT 08), Nantes : France (2008

arXiv.org e-Print Archive

Crossref

HAL Descartes

Current Status of Radio Source Databases

Author: H Andernach
SA Trushkin
SA Trushkin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1997
Field of study

We review the history and present status of radio-source catalogue archiving and on-line retrieval of radio source data. Large efforts were spent by the first author in collecting and restoring electronic versions of new and old source catalogues. Some 67 catalogues with ~520,000 entries were searchable via the "Einstein On-line Service" (EOLS). When EOLS lost maintenance support in 1994 a group at SAO (Russia) started building software tools to search and cross-identify objects between the major radio catalogues, maintained as the "CATalog supporting System" (CATS) at the Special Astrophysical Observatory (SAO, Russia). The independent efforts in east and west have recently been joined. Almost 400 different source lists with ~2,000,000 entries have been archived (and partly prepared) by us. All 5C and Penticton "P"-surveys and many of the published WSRT survey lists are now available. CATS has been developed by O. Verkhodanov, S. Trushkin, V. Chernenkov at SAO primarily to support RATAN-600 radio observations. CATS runs under LINUX and can process requests on the basis of various net protocols and via email. Almost 70 well-known radio source catalogues and tables with about 1.3 Mrecords are now available via ftp from CATS, as well as their documentation files. Twenty of the larger tables may be searched simultaneously for objects in rectangular boxes of coordinates. New routines for cross-matching are in progress. More and more catalogues are being folded into CATS. CATS is supported by RFBR grant 96-07-89075.Comment: 2 pages, no figures; to appear in Proc. "Observational Cosmology with the New Radio Surveys", eds. M. Bremer, N. Jackson & I. Perez-Fournon, Kluwer Acad. Pres

arXiv.org e-Print Archive

CiteSeerX

Crossref

CERN Document Server

Databases and Information Systems in the AI Era: Contributions from ADBIS, TPDL and EDA 2020 Workshops and Doctoral Consortium

Author: Bellatreche Ladjel
Bentayeb Fadila
Bieliková Mária
Boussaid Omar
Catania Barbara
Ceravolo Paolo
Demidova Elena
Gomez Lopez Maria Teresa
Halfeld Ferrari Mirian
Kordić Slavica
Luković Ivan
Manghi Paolo
Mannocci Andrea
Osborne Francesco
Papatheodorou Christos
Ristić Sonja
Romero Oscar
S. Hara Carmem
Sacharidis Dimitris
Salatino Angelo
Talens Guilaine
van Keulen Maurice
Vergoulis Thanasis
Zumer Maja
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Research on database and information technologies has been rapidly evolving over the last couple of years. This evolution was lead by three major forces: Big Data, AI and Connected World that open the door to innovative research directions and challenges, yet exploiting four main areas: (i) computational and storage resource modeling and organization; (ii) new programming models, (iii) processing power and (iv) new applications that emerge related to health, environment, education, Cultural Heritage, Banking, etc. The 24th East-European Conference on Advances in Databases and Information Systems (ADBIS 2020), the 24th International Conference on Theory and Practice of Digital Libraries (TPDL 2020) and the 16th Workshop on Business Intelligence and Big Data (EDA 2020), held during August 25–27, 2020, at Lyon, France, and associated satellite events aimed at covering some emerging issues related to database and information system research in these areas. The aim of this paper is to present such events, their motivations, and topics of interest, as well as briefly outline the papers selected for presentations. The selected papers will then be included in the remainder of this volume

Crossref

AIR Universita degli studi di Milano

Open Research Online (The Open University)

Archivio istituzionale della ricerca - Università di Genova

University of Twente Research Information

idUS. Depósito de Investigación Universidad de Sevilla

Data Mining-based Fragmentation of XML Data Warehouses

Author: Darmont Jérôme
Mahboubi Hadj
Publication venue
Publication date: 01/01/2008
Field of study

With the multiplication of XML data sources, many XML data warehouse models have been proposed to handle data heterogeneity and complexity in a way relational data warehouses fail to achieve. However, XML-native database systems currently suffer from limited performances, both in terms of manageable data volume and response time. Fragmentation helps address both these issues. Derived horizontal fragmentation is typically used in relational data warehouses and can definitely be adapted to the XML context. However, the number of fragments produced by classical algorithms is difficult to control. In this paper, we propose the use of a k-means-based fragmentation approach that allows to master the number of fragments through its

k

parameter. We experimentally compare its efficiency to classical derived horizontal fragmentation algorithms adapted to XML data warehouses and show its superiority

arXiv.org e-Print Archive

CiteSeerX

Crossref

HAL Descartes

Resource Allocation for Query Optimization in Data Grid Systems: Static Load Balancing Strategies

Author: Epimakhov Igor
Hameurlain Abdelkader
MORVAN Franck
Yin Shaoyi
Publication venue: HAL CCSD
Publication date: 01/01/2013
Field of study

International audienceResource allocation is one of the principal stages of relational query processing in data grid systems. Static allocation methods allocate nodes to relational operations during query compilation. Existing heuristics did not take into account the multi-queries environment, where some nodes may become overloaded because they are allocated to too many concurrent queries. Dynamic resource allocation mechanisms are currently developed to modify the physical plan during query execution. In fact, when a node is detected to be overloaded, some of the operations on it will migrate. However, if the resource contention is too heavy in the initial execution plan, the operation migration cost may be very high. In this paper, we propose two load balancing strategies adopted during the static resource allocation phase, so that the workload is balanced at the beginning, the operation migration cost is decreased during the query execution, and therefore the average response time is reduced

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

Open Archive Toulouse Archive Ouverte

Implementation of multidimensional databases in column-oriented NoSQL systems

Author: Chevalier Max
El Malki Mohammed
Kopliku Arlind
Teste Olivier
Tournier Ronan
Publication venue: HAL CCSD
Publication date: 01/01/2015
Field of study

International audienceNoSQL (Not Only SQL) systems are becoming popular due to known advantages such as horizontal scalability and elasticity. In this paper, we study the implementation of multidimensional data warehouses with columnoriented NoSQL systems. We define mapping rules that transform the conceptual multidimensional data model to logical column-oriented models. We consider three different logical models and we use them to instantiate data warehouses. We focus on data loading, model-to-model conversion and OLAP cuboid computation

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

Open Archive Toulouse Archive Ouverte

Toulouse Capitole Publications

Toulouse 1 Capitole Publications

Knowledge and Metadata Integration for Warehousing Complex Data

Author: Darmont Jérôme
Ralaivao Jean-Christian
Publication venue
Publication date: 01/01/2007
Field of study

With the ever-growing availability of so-called complex data, especially on the Web, decision-support systems such as data warehouses must store and process data that are not only numerical or symbolic. Warehousing and analyzing such data requires the joint exploitation of metadata and domain-related knowledge, which must thereby be integrated. In this paper, we survey the types of knowledge and metadata that are needed for managing complex data, discuss the issue of knowledge and metadata integration, and propose a CWM-compliant integration solution that we incorporate into an XML complex data warehousing framework we previously designed.Comment: 6th International Conference on Information Systems Technology and its Applications (ISTA 07), Kharkiv : Ukraine (2007

arXiv.org e-Print Archive

HAL Descartes