11,743 research outputs found
Inside Dropbox: Understanding Personal Cloud Storage Services
Personal cloud storage services are gaining popularity. With a rush of providers to enter the market and an increasing of- fer of cheap storage space, it is to be expected that cloud storage will soon generate a high amount of Internet traffic. Very little is known about the architecture and the perfor- mance of such systems, and the workload they have to face. This understanding is essential for designing efficient cloud storage systems and predicting their impact on the network. This paper presents a characterization of Dropbox, the leading solution in personal cloud storage in our datasets. By means of passive measurements, we analyze data from four vantage points in Europe, collected during 42 consecu- tive days. Our contributions are threefold: Firstly, we are the first to study Dropbox, which we show to be the most widely-used cloud storage system, already accounting for a volume equivalent to around one third of the YouTube traffic at campus networks on some days. Secondly, we characterize the workload typical users in different environments gener- ate to the system, highlighting how this reflects on network traffic. Lastly, our results show possible performance bot- tlenecks caused by both the current system architecture and the storage protocol. This is exacerbated for users connected far from control and storage data-center
Workload characterization and customer interaction at e-commerce web servers
Electronic commerce servers have a significant presence in today's Internet. Corporations want to maintain high availability, sufficient capacity, and satisfactory performance for their E-commerce Web systems, and want to provide satisfactory services to customers. Workload characterization and the analysis of customers' interactions with Web sites are the bases upon which to analyze server performance, plan system capacity, manage system resources, and personalize services at the Web site. To date, little empirical evidence has been discovered that identifies the characteristics for Web workloads of E-commerce systems and the behaviours of customers.
This thesis analyzes the Web access logs at public Web sites for three organizations: a car rental company, an IT company, and the Computer
Science department of the University of Saskatchewan. In these case studies, the
characteristics of Web workloads are explored at the request level, functionlevel, resource level, and session level; customers' interactions
with Web sites are analyzed by identifying
and characterizing session groups.
The main E-commerce Web workload characteristics and performance implications are: i) The requests for dynamic Web objects are an important
part of the workload. These requests should be characterized separately since the system processes them differently; ii) Some popular image files, which are embedded in the same Web page, are always requested together. If these files are requested and sent in a bundle, a system will greatly reduce the overheads in processing requests for these files; iii) The
percentage of requests for each Web page category tends to be stable in the workload when the time scale is large enough. This observation is helpful in forecasting workload composition; iv) the Secure Socket Layer protocol (SSL) is heavily used and most Web objects are either requested primarily through SSL or primarily not through SSL; and v) Session groups of different characteristics are identified for all logs. The analysis of session groups may be helpful in improving system performance, maximizing revenue throughput of the system, providing better services to customers, and managing and planning system resources.
A hybrid clustering algorithm, which is a combination of the minimum spanning tree method and k-means clustering algorithm, is proposed to identify session clusters. Session clusters obtained using the three session representations
Pages Requested, Navigation Pattern, and Resource Usage are similar enough so that it is possible to use different session representations interchangeably to produce similar groupings. The grouping based on one session representation is believed to be sufficient to answer questions in server performance, resource management, capacity planning and Web site personalization, which previously would have required multiple different groupings. Grouping by Pages Requested is recommended since it is the simplest and data on Web pages requested is relatively easy to obtain in HTTP logs
Enabling Micro-level Demand-Side Grid Flexiblity in Resource Constrained Environments
The increased penetration of uncertain and variable renewable energy presents
various resource and operational electric grid challenges. Micro-level
(household and small commercial) demand-side grid flexibility could be a
cost-effective strategy to integrate high penetrations of wind and solar
energy, but literature and field deployments exploring the necessary
information and communication technologies (ICTs) are scant. This paper
presents an exploratory framework for enabling information driven grid
flexibility through the Internet of Things (IoT), and a proof-of-concept
wireless sensor gateway (FlexBox) to collect the necessary parameters for
adequately monitoring and actuating the micro-level demand-side. In the summer
of 2015, thirty sensor gateways were deployed in the city of Managua
(Nicaragua) to develop a baseline for a near future small-scale demand response
pilot implementation. FlexBox field data has begun shedding light on
relationships between ambient temperature and load energy consumption, load and
building envelope energy efficiency challenges, latency communication network
challenges, and opportunities to engage existing demand-side user behavioral
patterns. Information driven grid flexibility strategies present great
opportunity to develop new technologies, system architectures, and
implementation approaches that can easily scale across regions, incomes, and
levels of development
Cloud Computing Trace Characterization and Synthetic Workload Generation
This thesis researches cloud computing workload characteristics and synthetic workload generation. A heuristic presented in the work guides the process of workload trace characterization and synthetic workload generation. Analysis of a cloud trace provides insight into client request behaviors and statistical parameters. A versatile workload generation tool creates client connections, controls request rates, defines number of jobs, produces tasks within each job, and manages task durations. The test system consists of multiple clients creating workloads and a server receiving request, all contained within a virtual machine environment. Statistical analysis verifies the synthetic workload experimental results are consistent with real workload behaviors and characteristics
- âŚ