Search CORE

9 research outputs found

Here are my data files. Here are my queries. Where are my results?

Author: Ailamaki Anastasia
Alagiannis I.
Idreos Stratos
Johnson R.
Publication venue: 'American College of Medical Physics (ACMP)'
Publication date: 06/12/2010
Field of study

Database management systems (DBMS) provide incredible flexibility and performance when it comes to query processing, scalability and accuracy. To fully exploit DBMS features, however, the user must \emph{define} a schema, \emph{load} the data, \emph{tune} the system for the expected workload, and answer several questions. Should the database use a column-store, a row-store or some hybrid format? What indices should be created? All these questions make for a formidable and time-consuming hurdle, often deterring new applications or imposing high cost to existing ones. A characteristic example is that of scientific databases with huge data sets. The prohibitive initialization cost and complexity still forces scientists to rely on ``ancient" tools for their data management tasks, delaying scientific understanding and progress. Users and applications collect their data in flat files, which have traditionally been considered to be ``outside" a DBMS. A DBMS wants control: always bring all data ``inside", replicate it and format it in its own ``secret" way. The problem has been recognized and current efforts extend existing systems with abilities such as reading information from flat files and gracefully incorporating it into the processing engine. This paper proposes a new generation of systems where the only requirement from the user is \emph{a link to the raw data files}. Queries can then immediately be fired without preparation steps in between. Internally and in an abstract way, the system takes care of selectively, adaptively and incrementally providing the proper environment given the queries at hand. Only part of the data is loaded at any given time and it is being stored and accessed in the format suitable for the current workload

Infoscience - École polytechnique fédérale de Lausanne

CWI's Institutional Repository

Just-in-time data virtualization: lightweight data management with ViDa.

Author: Ailamaki A
Alagiannis I
Branco M
Heinis T
Karpathiotakis M
Publication venue
Publication date: 04/01/2015
Field of study

Spiral - Imperial College Digital Repository

NoDB in action

Author: Alagiannis I.
Chaudhuri S.
Idreos S.
Idreos S.
Jagadish H. V.
Kersten M. L.
Nandi A.
Stonebraker M.
Publication venue: 'VLDB Endowment'
Publication date
Field of study

Crossref

Adaptive query processing on RAW data

Author: Aad G.
Abadi D.
Abadi D. J.
Alagiannis I.
Alagiannis I.
Boncz P. A.
Brun R.
DeWitt D. J.
Graefe G.
Ivanova M.
Ivanova M.
Klonatos Y.
Koch C.
Krikellas K.
Laney D.
Lattner C.
Olston C.
Pirk H.
Stonebraker M.
Wu K.
Publication venue: 'VLDB Endowment'
Publication date
Field of study

Crossref

Towards Large-Scale Meteorological Data Services: A Case Study

Author: I Alagiannis
M Kersten
P Baumann
P Baumann
P Baumann
R Machlin
RGG Cattell
S Manegold
TK Sellis
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref