8,346 research outputs found
Cost-aware real-time divisible loads scheduling in cloud computing
Cloud computing has become an important alternative for solving large scale resource intensive problems in science, engineering, and analytics. Resource management play an important role in improving the quality of service (QoS). This paper is concerned with the investigation of scheduling strategies for divisible loads with deadlines constraints upon heterogeneous processors in a cloud computing environment. The workload allocation approach presents in this paper is using Divisible Load Theory (DLT). It is based on the fact that the computation can be partitioned into some arbitrary sizes and each partition can be processed independently of each other. Through series of simulation against the baseline strategies, it can be found that the worker selection order in the service pool and the amount of fraction load assigned to each of them have significant effects on the total computation cost.Keywords: Cloud computing, Divisible Load Theory (DLT), Cost, Quality-of-service (QoS
QoS-aware trust establishment for cloud federation
Cloud federation enables inter-layer resource exchanges among multiple, heterogeneous cloud service providers. This article proposes a Quality of Service (QoS) aware trust model for effective resource allocation in response to the various user requests within the Clouds4Coordination (C4C) federation system. This QoS mainly comprises of nine parameters combined into three categories: (i) node profile, (ii) reliability, and (iii) competence. Numerical values for these parameters are computed every âtâ seconds for each cloud provider. All values measured over an interval Ît are further processed by the proposed model to evaluate the utility associated with a provider (referred to as a discipline in the presented case study). The decision about interacting with a discipline in a collaborative project is based on this utility value. The systems architecture, evaluation methodology, proposed model, and experimental evaluation on a practical test bed is outlined. The proposed QoS-aware trust evaluation mechanism allows selection of the most useful (based on a utility value) providers. The proposed approach can be used to support federation of cloud services across a number of different application domains
Autonomic management of virtualized resources in cloud computing
The last five years have witnessed a rapid growth of cloud computing in business, governmental and educational IT deployment. The success of cloud services depends critically on the effective management of virtualized resources. A key requirement of cloud management is the ability to dynamically match resource allocations to actual demands, To this end, we aim to design and implement a cloud resource management mechanism that manages underlying complexity, automates resource provisioning and controls client-perceived quality of service (QoS) while still achieving resource efficiency.
The design of an automatic resource management centers on two questions: when to adjust resource allocations and how much to adjust. In a cloud, applications have different definitions on capacity and cloud dynamics makes it difficult to determine a static resource to performance relationship. In this dissertation, we have proposed a generic metric that measures application capacity, designed model-independent and adaptive approaches to manage resources and built a cloud management system scalable to a cluster of machines.
To understand web system capacity, we propose to use a metric of
productivity index (PI), which is defined as the ratio of yield to
cost, to measure the system processing capability online. PI is a generic concept that can be applied to different levels to monitor system progress in order to identify if more capacity is needed. We applied the concept of PI to the problem of overload prevention in multi-tier websites. The overload predictor built on the PI metric shows more accurate and responsive overload prevention compared to conventional approaches.
To address the issue of the lack of accurate server model, we propose a model-independent fuzzy control based approach for CPU allocation. For adaptive and stable control performance, we embed the controller with self-tuning output amplification and flexible rule selection. Finally, we build a QoS provisioning framework that supports multi-objective QoS control and service differentiation. Experiments on a virtual cluster with two service classes show the effectiveness of our approach in both performance and power control.
To address the problems of complex interplay between resources and process delays in fine-grained multi-resource allocation, we consider capacity management as a decision-making problem and employ reinforcement learning (RL) to optimize the process. The optimization depends on the trial-and-error interactions with the cloud system. In order to improve the initial management performance, we propose a model-based RL algorithm. The neural network based environment model, which is learned from previous management history, generates simulated resource allocations for the RL agent. Experiment results on heterogeneous applications show that our approach makes efficient use of limited interactions and find near optimal resource configurations within 7 steps.
Finally, we present a distributed reinforcement learning approach to the cluster-wide cloud resource management. We decompose the cluster-wide resource allocation problem into sub-problems concerning individual VM resource configurations. The cluster-wide allocation is optimized if individual VMs meet their SLA with a high resource utilization. For scalability, we develop an efficient reinforcement learning approach with continuous state space. For adaptability, we use VM low-level runtime statistics to accommodate workload dynamics. Prototyped in a iBalloon system, the distributed learning approach successfully manages 128 VMs on a 16-node close correlated cluster
Recommended from our members
On Optimal and Fair Service Allocation in Mobile Cloud Computing
This paper studies the optimal and fair service allocation for a variety of
mobile applications (single or group and collaborative mobile applications) in
mobile cloud computing. We exploit the observation that using tiered clouds,
i.e. clouds at multiple levels (local and public) can increase the performance
and scalability of mobile applications. We proposed a novel framework to model
mobile applications as a location-time workflows (LTW) of tasks; here users
mobility patterns are translated to mobile service usage patterns. We show that
an optimal mapping of LTWs to tiered cloud resources considering multiple QoS
goals such application delay, device power consumption and user cost/price is
an NP-hard problem for both single and group-based applications. We propose an
efficient heuristic algorithm called MuSIC that is able to perform well (73% of
optimal, 30% better than simple strategies), and scale well to a large number
of users while ensuring high mobile application QoS. We evaluate MuSIC and the
2-tier mobile cloud approach via implementation (on real world clouds) and
extensive simulations using rich mobile applications like intensive signal
processing, video streaming and multimedia file sharing applications. Our
experimental and simulation results indicate that MuSIC supports scalable
operation (100+ concurrent users executing complex workflows) while improving
QoS. We observe about 25% lower delays and power (under fixed price
constraints) and about 35% decrease in price (considering fixed delay) in
comparison to only using the public cloud. Our studies also show that MuSIC
performs quite well under different mobility patterns, e.g. random waypoint and
Manhattan models
A Self-adaptive Agent-based System for Cloud Platforms
Cloud computing is a model for enabling on-demand network access to a shared
pool of computing resources, that can be dynamically allocated and released
with minimal effort. However, this task can be complex in highly dynamic
environments with various resources to allocate for an increasing number of
different users requirements. In this work, we propose a Cloud architecture
based on a multi-agent system exhibiting a self-adaptive behavior to address
the dynamic resource allocation. This self-adaptive system follows a MAPE-K
approach to reason and act, according to QoS, Cloud service information, and
propagated run-time information, to detect QoS degradation and make better
resource allocation decisions. We validate our proposed Cloud architecture by
simulation. Results show that it can properly allocate resources to reduce
energy consumption, while satisfying the users demanded QoS
Optimization of Elastic Cloud Brokerage Mechanisms for Future Telecommunication Service Environments
Dieser Beitrag ist mit Zustimmung des Rechteinhabers aufgrund einer (DFG geförderten) Allianz- bzw. Nationallizenz frei zugĂ€nglich.This publication is with permission of the rights owner freely accessible due to an Alliance licence and a national licence (funded by the DFG, German Research Foundation) respectively.Cloud computing mechanisms and cloud-based services are currently revolutionizing Web as well as telecommunication service platforms and service offerings. Apart from providing infrastructures, platforms and software as a service, mechanism for dynamic allocation of compute and storage resources on-demand, commonly termed as âelastic cloud computingâ account for the most important cloud computing functionalities.
Resource elasticity allows not only for efficient internal compute and storage resource consumption, but also, through so called hybrid cloud computing mechanisms, for dynamic utilization of external resources on-demand. This capability is especially useful in order to cost-efficiently cope with peakworkloads, allowing service providers to significantly reduce usually required over-provisioned service infrastructures, allowing for âpay-per-useâ cost models.
With a steadily growing number of cloud providers and with the proliferation of unified cloud computing interfaces, service providers are given free choice of flexibly selecting and utilizing cloud resources from different cloud providers. Cloud brokering systems allow for dynamic selection and utilization of cloud computing resources based on functional (e.g. QoS, SLA, energy consumption) as well as nonfunctional criteria (e.g. costs).
The presented work focuses on enhanced cloud brokering mechanisms for telecommunication service platforms, enabling quality telecommunication service assurance, still optimizing cloud resources consumption, i.e. saving costs and energy.
Furthermore this work shows that by combining cloud brokering mechanisms with standardized telecommunication service brokering mechanisms an even greater benefit for telecommunication service providers can be achieved as this enables an even better cost-efficiency since different user segments can seamlessly be served by allocating different cloud resources to them in a policy-driven manner
Energy-Efficient Management of Data Center Resources for Cloud Computing: A Vision, Architectural Elements, and Open Challenges
Cloud computing is offering utility-oriented IT services to users worldwide.
Based on a pay-as-you-go model, it enables hosting of pervasive applications
from consumer, scientific, and business domains. However, data centers hosting
Cloud applications consume huge amounts of energy, contributing to high
operational costs and carbon footprints to the environment. Therefore, we need
Green Cloud computing solutions that can not only save energy for the
environment but also reduce operational costs. This paper presents vision,
challenges, and architectural elements for energy-efficient management of Cloud
computing environments. We focus on the development of dynamic resource
provisioning and allocation algorithms that consider the synergy between
various data center infrastructures (i.e., the hardware, power units, cooling
and software), and holistically work to boost data center energy efficiency and
performance. In particular, this paper proposes (a) architectural principles
for energy-efficient management of Clouds; (b) energy-efficient resource
allocation policies and scheduling algorithms considering quality-of-service
expectations, and devices power usage characteristics; and (c) a novel software
technology for energy-efficient management of Clouds. We have validated our
approach by conducting a set of rigorous performance evaluation study using the
CloudSim toolkit. The results demonstrate that Cloud computing model has
immense potential as it offers significant performance gains as regards to
response time and cost saving under dynamic workload scenarios.Comment: 12 pages, 5 figures,Proceedings of the 2010 International Conference
on Parallel and Distributed Processing Techniques and Applications (PDPTA
2010), Las Vegas, USA, July 12-15, 201
Resource allocation and scheduling of multiple composite web services in cloud computing using cooperative coevolution genetic algorithm
In cloud computing, resource allocation and scheduling of multiple composite web services is an important and challenging problem. This is especially so in a hybrid cloud where there may be some low-cost resources available from private clouds and some high-cost resources from public clouds. Meeting this challenge involves two classical computational problems: one is assigning resources to each of the tasks in the composite web services; the other is scheduling the allocated resources when each resource may be used by multiple tasks at different points of time. In addition, Quality-of-Service (QoS) issues, such as execution time and running costs, must be considered in the resource allocation and scheduling problem. Here we present a Cooperative Coevolutionary Genetic Algorithm (CCGA) to solve the deadline-constrained resource allocation and scheduling problem for multiple composite web services. Experimental results show that our CCGA is both efficient and scalable
- âŠ