Search CORE

1 research outputs found

Compressed database structure to manage large scale data in a distributed environment

Author: Hannaford Michael
Henskens Frans
Monjurul Alom B. M.
Publication venue: INSTICC
Publication date
Field of study

Loss-less data compression is attractive in database systems as it may facilitate query performance improvement and storage reduction. Although there are many compression techniques which handle the whole database in main memory, problems arise when the amount of data increases gradually over time, and also when the data has high cardinality. Management of a rapidly evolving large volume of data in a scalable way is very challenging. This paper describes a disk based single vector large data cardinality approach, incorporating data compression in a distributed environment. The approach provides substantial storage performance improvement compared to other high performance database systems. The compressed database structure presented provides direct addressability in a distributed environment, thereby reducing retrieval latency when handling large volumes of data

University of Newcastle's Digital Repository