Querying Very Large Multi-dimensional Datasets in ADR - Extended
Abstract

Chang, Chialin; Ferreira, Renato; Kurc, Tahsin; Saltz, Joel; Sussman, Alan

Querying Very Large Multi-dimensional Datasets in ADR - Extended Abstract

Authors: Chialin Chang
Renato Ferreira
Tahsin Kurc
Joel Saltz
Alan Sussman
Publication date: 26 May 1999
Publisher

Abstract

This paper addresses optimizing the execution of range queries into multi-dimensional datasets on distributed memory parallel machines within the Active Data Repository framework. ADR is an infrastructure that integrates storage, retrieval and processing of large multi-dimensional datasets on distributed memory parallel architectures with multiple disks attached to each node. We describe three potential strategies for efficient execution of such queries that employ different tiling and workload partitioning approaches. We evaluate scalability of these strategies for different application scenarios, varying both the number of processors and the input dataset size on a 128 processor IBM SP multicomputer. Also cross-referenced as UMIACS-TR-99-2

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Digital Repository at the University of Maryland

oai:drum.lib.umd.edu:1903/1011

Last time updated on 12/11/2016