Search CORE

436 research outputs found

The data cyclotron query processing scheme

Author: Kersten M.L. (Martin)
Pereira Goncalves R.A. (Romulo Antonio)
Publication venue: 'MIT Press - Journals'
Publication date: 01/03/2010
Field of study

Distributed database systems exploit static workload characteristics to steer data fragmentation and data allocation schemes. However, the grand challenge of distributed query processing is to come up with a self-organizing architecture, which exploits all resources to manage the hot data set, minimize query response time, and maximize throughput without global co-ordination. In this paper, we introduce the Data Cyclotron architecture which addresses the challenges using turbulent data movement through a storage ring built from distributed main memory capitalizing modern remote-DMA facilities. Queries assigned to individual nodes interact with the Data Cyclotron by picking up data fragments continuously flowing around, i.e., the hot set. Each data fragment carries a level of interest (LOI) metric, which represents the cumulative query interest as the fragment passes around the ring multiple times. A fragment with a LOI below a given threshold, inversely proportional to the ring load, is pulled o

CWI's Institutional Repository

Peak Performance – Remote Memory Revisited

Author: Kersten M.L. (Martin)
Mühleisen H.F. (Hannes)
Pereira Goncalves R.A. (Romulo Antonio)
Publication venue
Publication date: 01/07/2013
Field of study

Many database systems share a need for large amounts of fast storage. However, economies of scale limit the utility of extending a single machine with an arbitrary amount of memory. The recent broad availability of the zero-copy data transfer protocol RDMA over low-latency and high throughput network connections such as InfiniBand prompts us to revisit the long-proposed usage of memory provided by remote machines. In this paper, we present a solution to make use of remote memory without manipulation of the operating system, and investigate the impact on database performance

CWI's Institutional Repository

An architecture for recycling intermediates in a column-store

Author: Ivanova M.G. (Milena)
Kersten M.L. (Martin)
Nes N.J. (Niels)
Pereira Goncalves R.A. (Romulo Antonio)
Publication venue: 'American College of Medical Physics (ACMP)'
Publication date: 01/06/2009
Field of study

Automatically recycling (intermediate) results is a grand challenge for state-of-the-art databases to improve both query response time and throughput. Tuples are loaded and streamed through a tuple-at-a-time processing pipeline avoiding materialization of intermediates as much as possible. This limits the opportunities for reuse of overlapping computations to DBA-defined materialized views and function/result cache tuning. In contrast, the operator-at-a-time execution paradigm produces fully materialized results in each step of the query plan. To avoid resource contention, these intermediates are evicted as soon as possible. In this paper we study an architecture that harvests the by-products of the operator-at-a-time paradigm in a column store system using a lightweight mechanism, the recycler. The key challenge then becomes selection of the policies to admit intermediates to the resource pool, their retention period, and the eviction strategy when facing resource limitations. The proposed recycling architecture has been implemented in an open-source system. An experimental analysis against the TPC-H ad-hoc decision support benchmark and a complex, real-world application (SkyServer) demonstrates its effectiveness in terms of self-organizing behavior and its significant performance gains. The results indicate the potentials of recycling intermediates and charters a route for further development of database kernels

CWI's Institutional Repository

Spinning Relations: High-Speed Networks for Distributed Join Processing

Author: Frey P.W. (Philip)
Kersten M.L. (Martin)
Pereira Goncalves R.A. (Romulo Antonio)
Teubner J. (Jens)
Publication venue: 'American College of Medical Physics (ACMP)'
Publication date: 01/01/2009
Field of study

By leveraging modern networking hardware (RDMA-enabled network cards), we can shift priorities in distributed database processing significantly. Complex and sophisticated mechanisms to avoid network traffic can be replaced by a scheme that takes advantag

CWI's Institutional Repository

An architecture for recycling intermediates in a column-store

Author: Ivanova M.G. (Milena)
Kersten M.L. (Martin)
Nes N.J. (Niels)
Pereira Goncalves R.A. (Romulo Antonio)
Publication venue: A.C.M.
Publication date: 01/12/2010
Field of study

Automatic recycling intermediate results to improve both query response time and throughput is a grand c

CWI's Institutional Repository

A column-store meets the point clouds

Author: Ivanova M.G. (Milena)
Kersten M.L. (Martin)
Martinez-Rubi O. (Oscar)
Pereira Goncalves R.A. (Romulo Antonio)
Publication venue
Publication date: 01/07/2014
Field of study

Dealing with LIDAR data in the context of database management systems calls for a re-assessment of their functionality, performance, and storage/processing limitations. The territory for efficient and scalable processing of LIDAR repositories using GIS enabled database systems is still largely unexplored. Bringing together hard core database management experts and GIS application developers is a sine qua non to advance the state of the art. In particular to assess the relative merits of both traditional row-based database engines and the modern column-oriented database engines

CWI's Institutional Repository

GIS Navigation Boosted by Column Stores

Author: Alvanaki F. (Foteini)
Ivanova M.G. (Milena)
Kersten M.L. (Martin)
Kyzirakos K. (Konstantinos)
Pereira Goncalves R.A. (Romulo Antonio)
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2015
Field of study

CWI's Institutional Repository

Column-store support for RDF data management: not all swans are white

Author: Kersten M.L. (Martin)
Manegold S. (Stefan)
Nes N.J. (Niels)
Pereira Goncalves R.A. (Romulo Antonio)
Sidirourgos E. (Eleftherios)
Publication venue
Publication date: 01/09/2008
Field of study

This paper reports on the results of an independent evaluation of the techniques presented in the VLDB 2007 paper "Scalable Semantic Web Data Management Using Vertical Partitioning", authored by D. Abadi, A. Marcus, S. R. Madden, and K. Hollenbach. We revisit the proposed benchmark and examine both the data and query space coverage. The benchmark is extended to cover a larger portion of the query space in a canonical way. Repeatability of the experiments is assessed using the code base obtained from the authors. Inspired by the proposed vertically-partitioned storage solution for RDF data and the performance figures using a column-store, we conduct a complementary analy- sis of state-of-the-art RDF storage solutions. To this end, we employ MonetDB/SQL, a fully-functional open source column-store, and a well-known --- for its performance --- commercial row-store DBMS.We implement two relational RDF storage solutions – triple-store and vertically-partitioned --- in both systems. This allows us to expand the scope of with the performance characterization along both dimensions --- triple-store vs. vertically-partitioned and row-store vs. column-store --- individually, before analyzing their combined effects. A detailed report of the experimental test-bed, as well as an in-depth analysis of the parameters involved, clarify the scope of the solution originally presented and position the results in a broader context by covering more systems

CWI's Institutional Repository

Reactive phenotypes after acute and chronic NK‐cell activation

Author: ALMEIDA J.
FONSECA S.
GONCALVES M.
LIMA M
MOURA J.
ORFAO A.
PINTO RIBEIRO A.C.
QUEIROS M.L.
SANTOS A.H.
TEIXEIRA M.A.
Publication venue: Wichtig Editore
Publication date: 01/07/2004
Field of study

J Biol Regul Homeost Agents. 2004 Jul-Dec;18(3-4):331-4. Reactive phenotypes after acute and chronic NK-cell activation. Lima M, Almeida J, Teixeira MA, Santos AH, Queirós ML, Fonseca S, Moura J, Gonçalves M, Orfão A, Pinto Ribeiro AC. Service of Clinical Hematology, Laboratory of Cytometry, Hospital Geral de Santo António, Porto, Portugal. [email protected] Abstract Several phenotypic changes have been shown to occur after NK-cell stimulation, involving molecules that have been proved to regulate NK-cell migration into tissues and NK-cell activation and proliferation as well as target cell recognition and killing. Here, we review the reactive phenotypes observed in vivo after acute and chronic NK-cell activation. PMID: 15786700 [PubMed - indexed for MEDLINE

Repositório Científico do Centro Hospitalar do Porto

Benchmarking and improving point cloud data management in MonetDB

Author: Alvanaki F. (Foteini)
Ivanova M.G. (Milena)
Kersten M.L. (Martin)
Martinez-Rubi O. (Oscar)
Oosterom P. van
Pereira Goncalves R.A. (Romulo Antonio)
Tijssen T.
Publication venue: 'American College of Medical Physics (ACMP)'
Publication date: 01/07/2014
Field of study

The popularity, availability and sizes of point cloud data sets are increasing, thus raising interesting data management and processing challenges. Various software solutions are available for the management of point cloud data. A benchmark for point cloud data management systems was defined and it was executed for several solutions. In this paper we focus on the solutions based on the column-store MonetDB, the generic out-of-the-box approach is compared with two alternative approaches that exploit the spatial coherence of the data to improve the data access and to minimize the storage requirement

CWI's Institutional Repository