9,536 research outputs found
The state of SQL-on-Hadoop in the cloud
Managed Hadoop in the cloud, especially SQL-on-Hadoop, has been gaining attention recently. On Platform-as-a-Service (PaaS), analytical services like Hive and Spark come preconfigured for general-purpose and ready to use. Thus, giving companies a quick entry and on-demand deployment of ready SQL-like solutions for their big data needs. This study evaluates cloud services from an end-user perspective, comparing providers including: Microsoft Azure, Amazon Web Services, Google Cloud,
and Rackspace. The study focuses on performance, readiness, scalability, and cost-effectiveness of the different solutions at entry/test level clusters sizes. Results are based on over 15,000 Hive queries derived from the industry standard TPC-H benchmark.
The study is framed within the ALOJA research project, which features an open source benchmarking and analysis platform that has been recently extended to support SQL-on-Hadoop engines.
The ALOJA Project aims to lower the total cost of ownership (TCO) of big data deployments and study their performance characteristics for optimization.
The study benchmarks cloud providers across a diverse range instance types, and uses input data scales from 1GB to 1TB, in order to survey the popular entry-level PaaS SQL-on-Hadoop solutions, thereby establishing a common results-base upon which subsequent research can be carried out by the project. Initial results already show the main performance trends to both hardware and software configuration, pricing, similarities and architectural differences of the evaluated PaaS solutions. Whereas some
providers focus on decoupling storage and computing resources while offering network-based elastic storage, others choose to keep the local processing model from Hadoop for high performance, but reducing flexibility. Results also show the importance of application-level tuning and how keeping up-to-date hardware and software stacks can influence performance even more than replicating the on-premises model in the cloud.This work is partially supported by the Microsoft Azure for Research program, the European Research Council (ERC) under
the EUs Horizon 2020 programme (GA 639595), the Spanish Ministry of Education (TIN2015-65316-P), and the Generalitat
de Catalunya (2014-SGR-1051).Peer ReviewedPostprint (author's final draft
LA2050
LA2050 is an initiative to create a shared vision for the future of Los Angeles, and to drive and track progress toward that vision. Spearheaded by the Goldhirsh Foundation, the LA2050 Report has looked at the health of the region along well-defined indicators (Arts & Cultural Vitality, Education, Environmental Quality, Health, Housing, Income & Employment, Public Safety, and Social Connectedness), and made informed projections about where we'll be in the year 2050 if we continue on this current path
HPC Cloud for Scientific and Business Applications: Taxonomy, Vision, and Research Challenges
High Performance Computing (HPC) clouds are becoming an alternative to
on-premise clusters for executing scientific applications and business
analytics services. Most research efforts in HPC cloud aim to understand the
cost-benefit of moving resource-intensive applications from on-premise
environments to public cloud platforms. Industry trends show hybrid
environments are the natural path to get the best of the on-premise and cloud
resources---steady (and sensitive) workloads can run on on-premise resources
and peak demand can leverage remote resources in a pay-as-you-go manner.
Nevertheless, there are plenty of questions to be answered in HPC cloud, which
range from how to extract the best performance of an unknown underlying
platform to what services are essential to make its usage easier. Moreover, the
discussion on the right pricing and contractual models to fit small and large
users is relevant for the sustainability of HPC clouds. This paper brings a
survey and taxonomy of efforts in HPC cloud and a vision on what we believe is
ahead of us, including a set of research challenges that, once tackled, can
help advance businesses and scientific discoveries. This becomes particularly
relevant due to the fast increasing wave of new HPC applications coming from
big data and artificial intelligence.Comment: 29 pages, 5 figures, Published in ACM Computing Surveys (CSUR
“It’s Been a Hard Day’s Night” for Songwriters: Why the ASCAP and BMI Consent Decrees Must Undergo Reform
In order to guarantee reasonable fees for songwriters, composers, and publishers, the consent decrees must undergo critical reform to account for how music is licensed in new media. Part I of this Note will provide background on the mechanics of music licensing, both traditional and through modern mediums, in order to explain why the two largest PROs initially entered into governmental consent decrees. Part II will discuss recent judicial determinations of “reasonable” licensing rates for public performances in new media and demonstrate the discrepancy in compensation between songwriters and their sound recording counterparts, namely record companies and recording artists. Finally, Part III will argue that the solution to this problem is through consent decree reform. The decrees should be modified to allow songwriters to withdraw their digital rights in order to separately license songs in new media. A new PRO should then emerge in the market place to account solely for public performance rights in new media, leaving traditional licensing to the existing PROs. Additionally, the current judicial process for setting rates, known as the “rate court” system, should be replaced with expedited, binding arbitration. Making these important changes to the music-licensing system will work towards bridging the gap in compensation inequality between songwriters and recording artists
Compendium of Yukon climate change science: 2018 supplement
"The Compendium is intended to provide an overview of recent climate change work involving Yukon. This document is intended to supplement the 2003-2013 version of the Compendium with climate change work that has taken place during 2017 and 2018. It is comprised of various types of documents, including scientific journal articles, government publications, and synopsis summaries." --from Foreword
- …