Search CORE

1,868 research outputs found

Transfer Learning for Improving Model Predictions in Highly Configurable Software

Author: Jamshidi Pooyan
Kawthekar Prasad
Kästner Christian
Siegmund Norbert
Velez Miguel
Publication venue
Publication date: 20/04/2017
Field of study

Modern software systems are built to be used in dynamic environments using configuration capabilities to adapt to changes and external uncertainties. In a self-adaptation context, we are often interested in reasoning about the performance of the systems under different configurations. Usually, we learn a black-box model based on real measurements to predict the performance of the system given a specific configuration. However, as modern systems become more complex, there are many configuration parameters that may interact and we end up learning an exponentially large configuration space. Naturally, this does not scale when relying on real measurements in the actual changing environment. We propose a different solution: Instead of taking the measurements from the real system, we learn the model using samples from other sources, such as simulators that approximate performance of the real system at low cost. We define a cost model that transform the traditional view of model learning into a multi-objective problem that not only takes into account model accuracy but also measurements effort as well. We evaluate our cost-aware transfer learning solution using real-world configurable software including (i) a robotic system, (ii) 3 different stream processing applications, and (iii) a NoSQL database system. The experimental results demonstrate that our approach can achieve (a) a high prediction accuracy, as well as (b) a high model reliability.Comment: To be published in the proceedings of the 12th International Symposium on Software Engineering for Adaptive and Self-Managing Systems (SEAMS'17

arXiv.org e-Print Archive

Crossref

Interplaying Cassandra NoSQL Consistency and Performance: A Benchmarking Approach

Author: A Gorbenko
AD Fekete
D Pritchett
DJ Abadi
E Brewer
F Cruz
G Haughian
H Mezni
O Tarasyuk
R Guerraoui
S Burckhardt
S Gilbert
V Farias
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 31/08/2020
Field of study

This experience report analyses performance of the Cassandra NoSQL database and studies the fundamental trade-off between data consistency and delays in distributed data storages. The primary focus is on investigating the interplay between the Cassandra performance (response time) and its consistency settings. The paper reports the results of the read and write performance benchmarking for a replicated Cassandra cluster, deployed in the Amazon EC2 Cloud. We present quantitative results showing how different consistency settings affect the Cassandra performance under different workloads. One of our main findings is that it is possible to minimize Cassandra delays and still guarantee the strong data consistency by optimal coordination of consistency settings for both read and write requests. Our experiments show that (i) strong consistency costs up to 25% of performance and (ii) the best setting for strong consistency depends on the ratio of read and write operations. Finally, we generalize our experience by proposing a benchmarking-based methodology for run-time optimization of consistency settings to achieve the maximum Cassandra performance and still guarantee the strong data consistency under mixed workloads

Crossref

Leeds Beckett Repository

Cloud Services Brokerage: A Survey and Research Roadmap

Author: Barker Adam
Thai Long
Varghese Blesson
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/06/2015
Field of study

A Cloud Services Brokerage (CSB) acts as an intermediary between cloud service providers (e.g., Amazon and Google) and cloud service end users, providing a number of value adding services. CSBs as a research topic are in there infancy. The goal of this paper is to provide a concise survey of existing CSB technologies in a variety of areas and highlight a roadmap, which details five future opportunities for research.Comment: Paper published in the 8th IEEE International Conference on Cloud Computing (CLOUD 2015

arXiv.org e-Print Archive

Crossref

University of St. Andrews - Pure

St Andrews Research Repository

Curriculum Guidelines for Undergraduate Programs in Data Science

Author: Agarwal Mahesh
Averett Maia
Baumer Benjamin
Bray Andrew
Bressoud Thomas
Bryant Lance
Cheng Lei
De Veaux Richard
Francis Amanda
Gould Robert
Kim Albert Y.
Kretchmar Matt
Lu Qin
Moskol Ann
Nolan Deborah
Pelayo Roberto
Raleigh Sean
Sethi Ricky J.
Sondjaja Mutiara
Tiruviluamala Neelesh
Uhlig Paul
Washington Talitha
Wesley Curtis
White David
Ye Ping
Publication venue: 'Annual Reviews'
Publication date: 01/01/2017
Field of study

The Park City Math Institute (PCMI) 2016 Summer Undergraduate Faculty Program met for the purpose of composing guidelines for undergraduate programs in Data Science. The group consisted of 25 undergraduate faculty from a variety of institutions in the U.S., primarily from the disciplines of mathematics, statistics and computer science. These guidelines are meant to provide some structure for institutions planning for or revising a major in Data Science

arXiv.org e-Print Archive

Smith College: Smith ScholarWorks

Security-aware elasticity for NoSQL databases in multi-cloud environments

Author: Gounaris Anastasios
Katsaros Panagiotis
Mouratidis Haralambos
Naskos Athanasios
Publication venue: 'Inderscience Publishers'
Publication date: 14/09/2017
Field of study

University of Brighton Research Portal

Composable architecture for rack scale big data computing

Author: Abali Bulent
Chang Victor
Franke Hubertus
Kesavan Mukil
Li Chung-Sheng
Parris Colin
Publication venue: 'Elsevier BV'
Publication date: 01/02/2017
Field of study

The rapid growth of cloud computing, both in terms of the spectrum and volume of cloud workloads, necessitate re-visiting the traditional rack-mountable servers based datacenter design. Next generation datacenters need to offer enhanced support for: (i) fast changing system configuration requirements due to workload constraints, (ii) timely adoption of emerging hardware technologies, and (iii) maximal sharing of systems and subsystems in order to lower costs. Disaggregated datacenters, constructed as a collection of individual resources such as CPU, memory, disks etc., and composed into workload execution units on demand, are an interesting new trend that can address the above challenges. In this paper, we demonstrated the feasibility of composable systems through building a rack scale composable system prototype using PCIe switch. Through empirical approaches, we develop assessment of the opportunities and challenges for leveraging the composable architecture for rack scale cloud datacenters with a focus on big data and NoSQL workloads. In particular, we compare and contrast the programming models that can be used to access the composable resources, and developed the implications for the network and resource provisioning and management for rack scale architecture

Southampton (e-Prints Soton)

Consumer Life Cycle and Profiling: A Data Mining Perspective

Author: Farruh Kushnazarov
Publication venue: 'IntechOpen'
Publication date: 23/04/2019
Field of study

With the development of technology and continuously increasing of the market demand, the concept to produce better merchandises is generated in the companies. Each customer wants an individual approach or exclusive product, which creates the concept: “one customer one product.” The implementation of the one-to-one approach in the current days is the main exciting task of companies. Millions of customers lead to millions of exclusive products from the manufactures’ views. It is the primary step to study the needs of customers in the market economy. The main task for a company is to know the customer and to provide their desired products and services. In order to get knowledge ahead of the customers’ wishes, a system of profiling potential customers is created accordingly. This chapter provides the review of the customer lifetime from the reach customer (claim future customer’s attention) to the loyalty customer (turn a customer into a company advocate). During the discussion about the customer lifetime, readers will get acquainted with such technologies as funnel analysis, data management platform, customer profiling, customer behavior analysis, and others. The listed technologies in a complex will be created as the one-to-one product or service with a high Return on Investment (ROI)

IntechOpen

Crossref