Search CORE

18,589 research outputs found

ALOJA: A benchmarking and predictive platform for big data performance analysis

Author: Berral García Josep Lluís
Carrera Pérez David
Poggi Nicolas
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

The main goals of the ALOJA research project from BSC-MSR, are to explore and automate the characterization of cost-effectivenessof Big Data deployments. The development of the project over its first year, has resulted in a open source benchmarking platform, an online public repository of results with over 42,000 Hadoop job runs, and web-based analytic tools to gather insights about system's cost-performance1. This article describes the evolution of the project's focus and research lines from over a year of continuously benchmarking Hadoop under dif- ferent configuration and deployments options, presents results, and dis cusses the motivation both technical and market-based of such changes. During this time, ALOJA's target has evolved from a previous low-level profiling of Hadoop runtime, passing through extensive benchmarking and evaluation of a large body of results via aggregation, to currently leveraging Predictive Analytics (PA) techniques. Modeling benchmark executions allow us to estimate the results of new or untested configu- rations or hardware set-ups automatically, by learning techniques from past observations saving in benchmarking time and costs.This work is partially supported the BSC-Microsoft Research Centre, the Span- ish Ministry of Education (TIN2012-34557), the MINECO Severo Ochoa Research program (SEV-2011-0067) and the Generalitat de Catalunya (2014-SGR-1051).Peer ReviewedPostprint (author's final draft

Crossref

UPCommons. Portal del coneixement obert de la UPC

Virtual HR Departments: Getting Out of the Middle

Author: Lepak David P.
Snell Scott A.
Stueber Donna
Publication venue: DigitalCommons@ILR
Publication date: 01/03/2001
Field of study

In this chapter, we explore the notion of virtual HR departments: a network-based organization built on partnerships and mediated by information technologies in order to be simultaneously strategic, flexible, cost-efficient, and service-oriented. We draw on experiences and initiatives at Merck Pharmaceuticals in order to show how information technology in establishing an infrastructure for virtual HR. Then, we present a model for mapping the architecture of HR activities that includes both internal and external sourcing options. We conclude by offering some recommendations for management practice as well as future research

DigitalCommons@ILR

eCommons@Cornell

Predicting Intermediate Storage Performance for Workflow Applications

Author: Abd-El-Malek M.
Al-Kiswany S.
Anderson E.
Costa L. B.
Costa L. B.
Haddad I. F.
Strunk J. D.
Publication venue
Publication date: 10/06/2013
Field of study

Configuring a storage system to better serve an application is a challenging task complicated by a multidimensional, discrete configuration space and the high cost of space exploration (e.g., by running the application with different storage configurations). To enable selecting the best configuration in a reasonable time, we design an end-to-end performance prediction mechanism that estimates the turn-around time of an application using storage system under a given configuration. This approach focuses on a generic object-based storage system design, supports exploring the impact of optimizations targeting workflow applications (e.g., various data placement schemes) in addition to other, more traditional, configuration knobs (e.g., stripe size or replication level), and models the system operation at data-chunk and control message level. This paper presents our experience to date with designing and using this prediction mechanism. We evaluate this mechanism using micro- as well as synthetic benchmarks mimicking real workflow applications, and a real application.. A preliminary evaluation shows that we are on a good track to meet our objectives: it can scale to model a workflow application run on an entire cluster while offering an over 200x speedup factor (normalized by resource) compared to running the actual application, and can achieve, in the limited number of scenarios we study, a prediction accuracy that enables identifying the best storage system configuration

arXiv.org e-Print Archive

Crossref

Trade Facilitation in Developing Countries

Author: Chris Milner
Evious Zgovu
Oliver Morrissey
Publication venue
Publication date
Field of study

Measures to actively facilitate trade are increasingly seen as essential to assist developing countries in expanding trade and benefiting from globalisation. Although often viewed as narrowly concerned with the ease and speed of Customs procedures, even greater trade cost reductions and trade and welfare benefits may be reaped from a broader view of trade facilitation (TF) that incorporates transportation, distribution and communication issues. A number of TF reforms are particularly beneficial: improving procedures, especially Customs clearance; introducing automation and use of information technology; reducing excessive documentation requirements; addressing lack of transparency in import and export requirements; addressing lack of modernisation of and cooperation between Customs and other government agencies. The review identifies the types of TF reforms that could address these problems and deliver a return in terms of increased revenue collection efficiency, reductions in trade costs and promotion of greater regional cooperation (at least in Customs and transport, especially as many TF measures are appropriate for inclusion in regional integration agreements).Trade Facilitation, Regional Integration

Research Papers in Economics

Invest to Save: Report and Recommendations of the NSF-DELOS Working Group on Digital Archiving and Preservation

Author: Ashley K.
Christensen-Dalsgaard B.
Duff W.
Gladney H.
Hedstrom M.
Huc C.
Kenney A. R.
Moore R.
Neuhold E.
Ross S.
Publication venue
Publication date: 01/01/2003
Field of study

Digital archiving and preservation are important areas for research and development, but there is no agreed upon set of priorities or coherent plan for research in this area. Research projects in this area tend to be small and driven by particular institutional problems or concerns. As a consequence, proposed solutions from experimental projects and prototypes tend not to scale to millions of digital objects, nor do the results from disparate projects readily build on each other. It is also unclear whether it is worthwhile to seek general solutions or whether different strategies are needed for different types of digital objects and collections. The lack of coordination in both research and development means that there are some areas where researchers are reinventing the wheel while other areas are neglected. Digital archiving and preservation is an area that will benefit from an exercise in analysis, priority setting, and planning for future research. The WG aims to survey current research activities, identify gaps, and develop a white paper proposing future research directions in the area of digital preservation. Some of the potential areas for research include repository architectures and inter-operability among digital archives; automated tools for capture, ingest, and normalization of digital objects; and harmonization of preservation formats and metadata. There can also be opportunities for development of commercial products in the areas of mass storage systems, repositories and repository management systems, and data management software and tools.

Supporting the Everyday Work of Scientists: Automating Scientific Workflows

Author: Mews Keith
Singer Janice A.
Stewart Darlene
Vidger Mark
Vinson Norman G.
Publication venue
Publication date: 24/06/2008
Field of study

This paper describes an action research project that we undertook with National Research Council Canada (NRC) scientists. Based on discussions about their \ud difficulties in using software to collect data and manage processes, we identified three requirements for increasing research productivity: ease of use for end- \ud users; managing scientific workflows; and facilitating software interoperability. Based on these requirements, we developed a software framework, Sweet, to \ud assist in the automation of scientific workflows. \ud \ud Throughout the iterative development process, and through a series of structured interviews, we evaluated how the framework was used in practice, and identified \ud increases in productivity and effectiveness and their causes. While the framework provides resources for writing application wrappers, it was easier to code the applications’ functionality directly into the framework using OSS components. Ease of use for the end-user and flexible and fully parameterized workflow representations were key elements of the framework’s success. \u

NRC Publications Archive

CogPrints Cognitive Sciences Eprint Archive