300 research outputs found

    A Framework for Approximate Optimization of BoT Application Deployment in Hybrid Cloud Environment

    Get PDF
    We adopt a systematic approach to investigate the efficiency of near-optimal deployment of large-scale CPU-intensive Bag-of-Task applications running on cloud resources with the non-proportional cost to performance ratios. Our analytical solutions perform in both known and unknown running time of the given application. It tries to optimize users' utility by choosing the most desirable tradeoff between the make-span and the total incurred expense. We propose a schema to provide a near-optimal deployment of BoT application regarding users' preferences. Our approach is to provide user with a set of Pareto-optimal solutions, and then she may select one of the possible scheduling points based on her internal utility function. Our framework can cope with uncertainty in the tasks' execution time using two methods, too. First, an estimation method based on a Monte Carlo sampling called AA algorithm is presented. It uses the minimum possible number of sampling to predict the average task running time. Second, assuming that we have access to some code analyzer, code profiling or estimation tools, a hybrid method to evaluate the accuracy of each estimation tool in certain interval times for improving resource allocation decision has been presented. We propose approximate deployment strategies that run on hybrid cloud. In essence, proposed strategies first determine either an estimated or an exact optimal schema based on the information provided from users' side and environmental parameters. Then, we exploit dynamic methods to assign tasks to resources to reach an optimal schema as close as possible by using two methods. A fast yet simple method based on First Fit Decreasing algorithm, and a more complex approach based on the approximation solution of the transformed problem into a subset sum problem. Extensive experiment results conducted on a hybrid cloud platform confirm that our framework can deliver a near optimal solution respecting user's utility function

    Atlas: Hybrid Cloud Migration Advisor for Interactive Microservices

    Full text link
    Hybrid cloud provides an attractive solution to microservices for better resource elasticity. A subset of application components can be offloaded from the on-premises cluster to the cloud, where they can readily access additional resources. However, the selection of this subset is challenging because of the large number of possible combinations. A poor choice degrades the application performance, disrupts the critical services, and increases the cost to the extent of making the use of hybrid cloud unviable. This paper presents Atlas, a hybrid cloud migration advisor. Atlas uses a data-driven approach to learn how each user-facing API utilizes different components and their network footprints to drive the migration decision. It learns to accelerate the discovery of high-quality migration plans from millions and offers recommendations with customizable trade-offs among three quality indicators: end-to-end latency of user-facing APIs representing application performance, service availability, and cloud hosting costs. Atlas continuously monitors the application even after the migration for proactive recommendations. Our evaluation shows that Atlas can achieve 21% better API performance (latency) and 11% cheaper cost with less service disruption than widely used solutions.Comment: To appear at EuroSys 202

    A Decision Support Model for Cloud Bursting

    Get PDF
    The cloud computing market divides into public (commercial) and private (self-provisioned) clouds. The concept of cloud bursting combines public and private clouds: The private cloud (internal resources) provides the computational capacity, but a part of the demand is offloaded onto public clouds. This article proposes an easy-to-apply economic decision support model for determining on the one hand the optimal size of the internal capacity for cloud bursting technology, and on the other hand the cost savings. The model uses an expected value approach that considers stochastic workload and is flexible with respect to the distribution choice. Two empirical examples demonstrate the applicability of the model

    A user-centric execution environment for <em>CineGrid</em> workloads

    Get PDF
    The abundance and heterogeneity of IT resources available, together with the ability to dynamically scale applications poses significant usability issues to users. Without understanding the performance profile of available resources users are unable to efficiently scale their applications in order to meet performance objectives. High quality media collaborations, like CineGrid, are one example of such diverse environments where users can leverage dynamic infrastructures to move and process large amounts of data. This paper describes our user-centric approach to executing high quality media processing workloads over dynamic infrastructures. Our main contribution is the CGtoolkit environment, an integrated system which aids users cope with the infrastructure complexity and large data sets specific to the digital cinema domain

    Fuzzy-Logic Based Call Admission Control in 5G Cloud Radio Access Networks with Pre-emption

    Get PDF
    YesFifth generation (5G) cellular networks will be comprised of millions of connected devices like wearable devices, Androids, iPhones, tablets and the Internet of Things (IoT) with a plethora of applications generating requests to the network. The 5G cellular networks need to cope with such sky-rocketing tra c requests from these devices to avoid network congestion. As such, cloud radio access networks (C-RAN) has been considered as a paradigm shift for 5G in which requests from mobile devices are processed in the cloud with shared baseband processing. Despite call admission control (CAC) being one of radio resource management techniques to avoid the network congestion, it has recently been overlooked by the community. The CAC technique in 5G C-RAN has a direct impact on the quality of service (QoS) for individual connections and overall system e ciency. In this paper, a novel Fuzzy-Logic based CAC scheme with pre-emption in C-RAN is proposed. In this scheme, cloud bursting technique is proposed to be used during congestion, where some delay tolerant low-priority connections are pre-empted and outsourced to a public cloud with a penalty charge. Simulation results show that the proposed scheme has low blocking probability below 5%, high throughput, low energy consumption and up to 95% of return on revenue

    A Survey on Load Balancing in Cloud Computing Using Optimization Technique

    Get PDF
    The main goal of this work is to create a system that uses the improved Ant Colony and Artificial Bee Colony (AB) algorithm to provide load balancing for the cloud computing technology. This algorithm is a combination of Ant colony algorithm and Artificial Bee colony algorithm. It will improve the existing AB algorithm. There are certain limitations in the existing algorithm. This algorithm will overcome those limitations and provide good optimal solution for effective load balancing

    A Self-Tuning procedure for resource management in InterCloud Computing

    Get PDF
    Beijing Key Laboratory on Integration and Analysis of Large-scale Stream Data, College of Computer Science, North China University of Technology, Beijing, China The file attached to this record is the author's final peer reviewed version. The Publisher's final version can be found by following the DOI link.InterCloud Computing is a new cloud paradigm designed to guarantee service quality or performance and availability of on-demand resources. InterCloud enables cloud interoperability by promoting the interworking of cloud systems from different cloud providers using standard interfacing. Resource management in InterCloud, considered as an important functional requirement, has not attracted commensurate research attention. The focus of this paper is to propose a Software Cybernetic approach, in the form of an adaptive control framework, for efficient management of shared resources in peer-to-peer InterCloud computing. This research effort adopts cooperative game theory to model resource management in InterCloud. The space of cooperative arrangements (resource sharing) between the participant cloud systems is presented by using Integer Partitioning to characterise the worst case communication complexity in peer to peer InterCloud. Essentially, this paper presents an Integer partition based anytime algorithm as an optimal cost solution to the bi-objective optimisation problem in resource management, anchored principally on practical trade-off between the desired performance (quality of service) and communication complexity of collaborating resource clouds

    Serverless Computing and Scheduling Tasks on Cloud: A Review

    Get PDF
    Recently, the emergence of Function-as-a-Service (FaaS) has gained increasing attention by researchers. FaaS, also known as serverless computing, is a new concept in cloud computing that allows the services computation that triggers the code execution as a response for certain events. In this paper, we discuss various proposals related to scheduling tasks in clouds. These proposals are categorized according to their objective functions, namely minimizing execution time, minimizing execution cost, or multi objectives (time and cost). The dependency relationships between the tasks plays a vital role in determining the efficiency of the scheduling approach. This dependency may result in resources underutilization. FaaS is expected to have a significant impact on the process of scheduling tasks. This problem can be reduced by adopting a hybrid approach that combines both the benefit of FaaS and Infrastructure-as-a-Service (IaaS). Using FaaS, we can run the small tasks remotely and focus only on scheduling the large tasks. This helps in increasing the utilization of the resources because the small tasks will not be considered during the process of scheduling. An extension of the restricted time limit by cloud vendors will allow running the complete workflow using the serverless architecture, avoiding the scheduling problem
    • …
    corecore