7,717 research outputs found
An Adaptive Genetic Algorithm with Dynamic Population Size for Optimizing Join Queries
The problem of finding the optimal join ordering executing a query to a relational database
management system is a combinatorial optimization problem, which makes deterministic exhaustive solution
search unacceptable for queries with a great number of joined relations. In this work an adaptive genetic
algorithm with dynamic population size is proposed for optimizing large join queries. The performance of the
algorithm is compared with that of several classical non-deterministic optimization algorithms. Experiments have
been performed optimizing several random queries against a randomly generated data dictionary. The proposed
adaptive genetic algorithm with probabilistic selection operator outperforms in a number of test runs the canonical
genetic algorithm with Elitist selection as well as two common random search strategies and proves to be a viable
alternative to existing non-deterministic optimization approaches
BioWorkbench: A High-Performance Framework for Managing and Analyzing Bioinformatics Experiments
Advances in sequencing techniques have led to exponential growth in
biological data, demanding the development of large-scale bioinformatics
experiments. Because these experiments are computation- and data-intensive,
they require high-performance computing (HPC) techniques and can benefit from
specialized technologies such as Scientific Workflow Management Systems (SWfMS)
and databases. In this work, we present BioWorkbench, a framework for managing
and analyzing bioinformatics experiments. This framework automatically collects
provenance data, including both performance data from workflow execution and
data from the scientific domain of the workflow application. Provenance data
can be analyzed through a web application that abstracts a set of queries to
the provenance database, simplifying access to provenance information. We
evaluate BioWorkbench using three case studies: SwiftPhylo, a phylogenetic tree
assembly workflow; SwiftGECKO, a comparative genomics workflow; and RASflow, a
RASopathy analysis workflow. We analyze each workflow from both computational
and scientific domain perspectives, by using queries to a provenance and
annotation database. Some of these queries are available as a pre-built feature
of the BioWorkbench web application. Through the provenance data, we show that
the framework is scalable and achieves high-performance, reducing up to 98% of
the case studies execution time. We also show how the application of machine
learning techniques can enrich the analysis process
Investigating Decision Support Techniques for Automating Cloud Service Selection
The compass of Cloud infrastructure services advances steadily leaving users
in the agony of choice. To be able to select the best mix of service offering
from an abundance of possibilities, users must consider complex dependencies
and heterogeneous sets of criteria. Therefore, we present a PhD thesis proposal
on investigating an intelligent decision support system for selecting Cloud
based infrastructure services (e.g. storage, network, CPU).Comment: Accepted by IEEE Cloudcom 2012 - PhD consortium trac
- …