315 research outputs found

    Expressing OLAP operators with the TAX XML algebra

    Full text link
    With the rise of XML as a standard for representing business data, XML data warehouses appear as suitable solutions for Web-based decision-support applications. In this context, it is necessary to allow OLAP analyses over XML data cubes (XOLAP). Thus, XQuery extensions are needed. To help define a formal framework and allow much-needed performance optimizations on analytical queries expressed in XQuery, having an algebra at one's disposal is desirable. However, XOLAP approaches and algebras from the literature still largely rely on the relational model and/or only feature a small number of OLAP operators. In opposition, we propose in this paper to express a broad set of OLAP operators with the TAX XML algebra.Comment: in 3rd International Workshop on Database Technologies for Handling XML Information on the Web (DataX-EDBT 08), Nantes : France (2008

    Current Status of Radio Source Databases

    Get PDF
    We review the history and present status of radio-source catalogue archiving and on-line retrieval of radio source data. Large efforts were spent by the first author in collecting and restoring electronic versions of new and old source catalogues. Some 67 catalogues with ~520,000 entries were searchable via the "Einstein On-line Service" (EOLS). When EOLS lost maintenance support in 1994 a group at SAO (Russia) started building software tools to search and cross-identify objects between the major radio catalogues, maintained as the "CATalog supporting System" (CATS) at the Special Astrophysical Observatory (SAO, Russia). The independent efforts in east and west have recently been joined. Almost 400 different source lists with ~2,000,000 entries have been archived (and partly prepared) by us. All 5C and Penticton "P"-surveys and many of the published WSRT survey lists are now available. CATS has been developed by O. Verkhodanov, S. Trushkin, V. Chernenkov at SAO primarily to support RATAN-600 radio observations. CATS runs under LINUX and can process requests on the basis of various net protocols and via email. Almost 70 well-known radio source catalogues and tables with about 1.3 Mrecords are now available via ftp from CATS, as well as their documentation files. Twenty of the larger tables may be searched simultaneously for objects in rectangular boxes of coordinates. New routines for cross-matching are in progress. More and more catalogues are being folded into CATS. CATS is supported by RFBR grant 96-07-89075.Comment: 2 pages, no figures; to appear in Proc. "Observational Cosmology with the New Radio Surveys", eds. M. Bremer, N. Jackson & I. Perez-Fournon, Kluwer Acad. Pres

    Databases and Information Systems in the AI Era: Contributions from ADBIS, TPDL and EDA 2020 Workshops and Doctoral Consortium

    Get PDF
    Research on database and information technologies has been rapidly evolving over the last couple of years. This evolution was lead by three major forces: Big Data, AI and Connected World that open the door to innovative research directions and challenges, yet exploiting four main areas: (i) computational and storage resource modeling and organization; (ii) new programming models, (iii) processing power and (iv) new applications that emerge related to health, environment, education, Cultural Heritage, Banking, etc. The 24th East-European Conference on Advances in Databases and Information Systems (ADBIS 2020), the 24th International Conference on Theory and Practice of Digital Libraries (TPDL 2020) and the 16th Workshop on Business Intelligence and Big Data (EDA 2020), held during August 25–27, 2020, at Lyon, France, and associated satellite events aimed at covering some emerging issues related to database and information system research in these areas. The aim of this paper is to present such events, their motivations, and topics of interest, as well as briefly outline the papers selected for presentations. The selected papers will then be included in the remainder of this volume

    Data Mining-based Fragmentation of XML Data Warehouses

    Full text link
    With the multiplication of XML data sources, many XML data warehouse models have been proposed to handle data heterogeneity and complexity in a way relational data warehouses fail to achieve. However, XML-native database systems currently suffer from limited performances, both in terms of manageable data volume and response time. Fragmentation helps address both these issues. Derived horizontal fragmentation is typically used in relational data warehouses and can definitely be adapted to the XML context. However, the number of fragments produced by classical algorithms is difficult to control. In this paper, we propose the use of a k-means-based fragmentation approach that allows to master the number of fragments through its kk parameter. We experimentally compare its efficiency to classical derived horizontal fragmentation algorithms adapted to XML data warehouses and show its superiority

    Resource Allocation for Query Optimization in Data Grid Systems: Static Load Balancing Strategies

    Get PDF
    International audienceResource allocation is one of the principal stages of relational query processing in data grid systems. Static allocation methods allocate nodes to relational operations during query compilation. Existing heuristics did not take into account the multi-queries environment, where some nodes may become overloaded because they are allocated to too many concurrent queries. Dynamic resource allocation mechanisms are currently developed to modify the physical plan during query execution. In fact, when a node is detected to be overloaded, some of the operations on it will migrate. However, if the resource contention is too heavy in the initial execution plan, the operation migration cost may be very high. In this paper, we propose two load balancing strategies adopted during the static resource allocation phase, so that the workload is balanced at the beginning, the operation migration cost is decreased during the query execution, and therefore the average response time is reduced

    Implementation of multidimensional databases in column-oriented NoSQL systems

    Get PDF
    International audienceNoSQL (Not Only SQL) systems are becoming popular due to known advantages such as horizontal scalability and elasticity. In this paper, we study the implementation of multidimensional data warehouses with columnoriented NoSQL systems. We define mapping rules that transform the conceptual multidimensional data model to logical column-oriented models. We consider three different logical models and we use them to instantiate data warehouses. We focus on data loading, model-to-model conversion and OLAP cuboid computation

    Knowledge and Metadata Integration for Warehousing Complex Data

    Full text link
    With the ever-growing availability of so-called complex data, especially on the Web, decision-support systems such as data warehouses must store and process data that are not only numerical or symbolic. Warehousing and analyzing such data requires the joint exploitation of metadata and domain-related knowledge, which must thereby be integrated. In this paper, we survey the types of knowledge and metadata that are needed for managing complex data, discuss the issue of knowledge and metadata integration, and propose a CWM-compliant integration solution that we incorporate into an XML complex data warehousing framework we previously designed.Comment: 6th International Conference on Information Systems Technology and its Applications (ISTA 07), Kharkiv : Ukraine (2007
    corecore