55,856 research outputs found
ALOJA: A benchmarking and predictive platform for big data performance analysis
The main goals of the ALOJA research project from BSC-MSR, are to explore and automate the characterization of cost-effectivenessof Big Data deployments. The development of the project over its first year, has resulted in a open source benchmarking platform, an online public repository of results with over 42,000 Hadoop job runs, and web-based analytic tools to gather insights about system's cost-performance1.
This article describes the evolution of the project's focus and research
lines from over a year of continuously benchmarking Hadoop under dif-
ferent configuration and deployments options, presents results, and dis
cusses the motivation both technical and market-based of such changes.
During this time, ALOJA's target has evolved from a previous low-level
profiling of Hadoop runtime, passing through extensive benchmarking
and evaluation of a large body of results via aggregation, to currently
leveraging Predictive Analytics (PA) techniques. Modeling benchmark
executions allow us to estimate the results of new or untested configu-
rations or hardware set-ups automatically, by learning techniques from
past observations saving in benchmarking time and costs.This work is partially supported the BSC-Microsoft Research Centre, the Span-
ish Ministry of Education (TIN2012-34557), the MINECO Severo Ochoa Research program (SEV-2011-0067) and the Generalitat de Catalunya (2014-SGR-1051).Peer ReviewedPostprint (author's final draft
Cloud WorkBench - Infrastructure-as-Code Based Cloud Benchmarking
To optimally deploy their applications, users of Infrastructure-as-a-Service
clouds are required to evaluate the costs and performance of different
combinations of cloud configurations to find out which combination provides the
best service level for their specific application. Unfortunately, benchmarking
cloud services is cumbersome and error-prone. In this paper, we propose an
architecture and concrete implementation of a cloud benchmarking Web service,
which fosters the definition of reusable and representative benchmarks. In
distinction to existing work, our system is based on the notion of
Infrastructure-as-Code, which is a state of the art concept to define IT
infrastructure in a reproducible, well-defined, and testable way. We
demonstrate our system based on an illustrative case study, in which we measure
and compare the disk IO speeds of different instance and storage types in
Amazon EC2
Lean and green – a systematic review of the state of the art literature
The move towards greener operations and products has forced companies to seek alternatives to balance efficiency gains and environmental friendliness in their operations and products. The exploration of the sequential or simultaneous deployment of lean and green initiatives is the results of this balancing action. However, the lean-green topic is relatively new, and it lacks of a clear and structured research definition. Thus, this paper’s main contribution is the offering of a systematic review of the existing literature on lean and green, aimed at providing guidance on the topic, uncovering gaps and inconsistencies in the literature, and finding new paths for research. The paper identifies and structures, through a concept map, six main research streams that comprise both conceptual and empirical research conducted within the context of various organisational functions and industrial sectors. Important issues for future research are then suggested in the form of research questions. The paper’s aim is to also contribute by stimulating scholars to further study this area in depth, which will lead to a better understanding of the compatibility and impact on organisational performance of lean and green initiatives. It also holds important implications for industrialists, who can develop a deeper and richer knowledge on lean and green to help them formulate more effective strategies for their deployment
Cancer gene prioritization by integrative analysis of mRNA expression and DNA copy number data: a comparative review
A variety of genome-wide profiling techniques are available to probe
complementary aspects of genome structure and function. Integrative analysis of
heterogeneous data sources can reveal higher-level interactions that cannot be
detected based on individual observations. A standard integration task in
cancer studies is to identify altered genomic regions that induce changes in
the expression of the associated genes based on joint analysis of genome-wide
gene expression and copy number profiling measurements. In this review, we
provide a comparison among various modeling procedures for integrating
genome-wide profiling data of gene copy number and transcriptional alterations
and highlight common approaches to genomic data integration. A transparent
benchmarking procedure is introduced to quantitatively compare the cancer gene
prioritization performance of the alternative methods. The benchmarking
algorithms and data sets are available at http://intcomp.r-forge.r-project.orgComment: PDF file including supplementary material. 9 pages. Preprin
A comparative study of benchmarking approaches for non-domestic buildings: Part 1 – Top-down approach
Benchmarking plays an important role in improving energy efficiency of non-domestic buildings. A review of energy benchmarks that underpin the UK’s Display Energy Certificate (DEC) scheme have prompted necessities to explore the benefits and limitations of using various methods to derive energy benchmarks. The existing methods were reviewed and grouped into top-down and bottom-up approaches based on the granularity of the data used. In the study, two top-down methods, descriptive statistics and artificial neural networks (ANN), were explored for the purpose of benchmarking energy performances of schools. The results were used to understand the benefits of using these benchmarks for assessing energy efficiency of buildings and the limitations that affect the robustness of the derived benchmarks. Compared to the bottom-up approach, top-down approaches were found to be beneficial in gaining insight into how peers perform. The relative rather than absolute feedback on energy efficiency meant that peer pressure was a motivator for improvement. On the other hand, there were limitations with regard to the extent to which the energy efficiency of a building could be accurately assessed using the top-down benchmarks. Moreover, difficulties in acquiring adequate data were identified as a key limitation to using the top-down approach for benchmarking non-domestic buildings. The study suggested that there are benefits in rolling out of DECs to private sector buildings and that there is a need to explore more complex methods to provide more accurate indication of energy efficiency in non-domestic buildings
Integrated Environmental Process Planning for the Design & Manufacture of Automotive Components
Advanced Product Quality Planning (APQP) logic is widely used by manufacturers for
the design and manufacture of automotive components. Manufacturers are increasingly
finding difficulties to incorporate environmental considerations in the broad range of
products that they manufacture. Therefore, there is a need for a systematic method for
environmental process planning to evaluate product configurations and their associated
environmental impact. The framework and models discussed in this paper can deal with
a variety of product characteristics and environmental impacts through a selection of
Environmental Performance Indicators (EPIs) for a final product configuration. The
framework and models have been applied in a real-life application and have proven that
changes in product design or process selection can reduce the product's environmental
impact and increase process efficiency. Hence, manufacturers can use the framework
and models during the Advanced Product Quality Planning (APQP) process to
benchmark each product variation that they manufacture in a standardised manner and
realise cost saving opportunities
- …