Search CORE

1,166 research outputs found

RETHINK big: European roadmap for hardware anc networking optimizations for big data

Author: ahmad
alkhatib
chanthadavong
coleman
earl joseph
feldman
huang
manolis marazakis
mitchell waldrop
nunberg
press
prickett morgan
prickett morgan
woodie
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 15/05/2017
Field of study

This paper discusses the results of the RETHINK big Project, a 2-year Collaborative Support Action funded by the European Commission in order to write the European Roadmap for Hardware and Networking optimizations for Big Data. This industry-driven project was led by the Barcelona Supercomputing Center (BSC), and it included large industry partners, SMEs and academia. The roadmap identifies business opportunities from 89 in-depth interviews with 70 European industry stakeholders in the area of Big Data and predicts the future technologies that will disrupt the state of the art in Big Data processing in terms of hardware and networking optimizations. Moreover, it presents coordinated technology development recommendations (focused on optimizations in networking and hardware) that would be in the best interest of European Big Data companies to undertake in concert as a matter of competitive advantage.This project has received funding from the European Union’s Seventh Framework Programme for research, technological development and demonstration under grant agreement n° 619788. It has also been supported by the Spanish Government (grant SEV2015-0493 of the Severo Ochoa Program), by the Spanish Ministry of Science and Innovation (contract TIN2015-65316) and by Generalitat de Catalunya (contracts 2014-SGR-1051 and 2014-SGR-1272).Peer ReviewedPostprint (author's final draft

Crossref

UPCommons. Portal del coneixement obert de la UPC

Research and Education in Computational Science and Engineering

Over the past two decades the field of computational science and engineering (CSE) has penetrated both basic and applied research in academia, industry, and laboratories to advance discovery, optimize systems, support decision-makers, and educate the scientific and engineering workforce. Informed by centuries of theory and experiment, CSE performs computational experiments to answer questions that neither theory nor experiment alone is equipped to answer. CSE provides scientists and engineers of all persuasions with algorithmic inventions and software systems that transcend disciplines and scales. Carried on a wave of digital technology, CSE brings the power of parallelism to bear on troves of data. Mathematics-based advanced computing has become a prevalent means of discovery and innovation in essentially all areas of science, engineering, technology, and society; and the CSE community is at the core of this transformation. However, a combination of disruptive developments---including the architectural complexity of extreme-scale computing, the data revolution that engulfs the planet, and the specialization required to follow the applications to new frontiers---is redefining the scope and reach of the CSE endeavor. This report describes the rapid expansion of CSE and the challenges to sustaining its bold advances. The report also presents strategies and directions for CSE research and education for the next decade.Comment: Major revision, to appear in SIAM Revie

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

A Survey on Vertical and Horizontal Scaling Platforms for Big Data Analytics

Author: Ali Ahmed Hussein
Publication venue: 'Penerbit UTHM'
Publication date: 12/09/2019
Field of study

There is no doubt that we are entering the era of big data. The challenge is on how to store, search, and analyze the huge amount of data that is being generated per second. One of the main obstacles to the big data researchers is how to find the appropriate big data analysis platform. The basic aim of this work is to present a complete investigation of all the available platforms for big data analysis in terms of vertical and horizontal scaling, and its compatible framework and applications in detail. Finally, this article will outline some research trends and other open issues in big data analytic

Journals of Universiti Tun Hussein Onn Malaysia (UTHM)

International Journal of Integrated Engineering

Technical Research Priorities for Big Data

Author: Auer Sören
Berre Arne J.
Curry Edward
Curry Edward
Despenic Marija
García Robles Ana
Hasan Souleiman
Metzger Andreas
Metzger Andreas
Ojo Adegboyega
Pazzaglia Jean-Christophe
Petkovic Milan
Roman Dumitru
Seidl Robert
ul Hassan Umair
Walshe Ray
Waterfeld Walter
Zillner Sonja
Zillner Sonja
Publication venue: Cham : Springer International Publishing
Publication date: 01/01/2021
Field of study

To drive innovation and competitiveness, organisations need to foster the development and broad adoption of data technologies, value-adding use cases and sustainable business models. Enabling an effective data ecosystem requires overcoming several technical challenges associated with the cost and complexity of management, processing, analysis and utilisation of data. This chapter details a community-driven initiative to identify and characterise the key technical research priorities for research and development in data technologies. The chapter examines the systemic and structured methodology used to gather inputs from over 200 stakeholder organisations. The result of the process identified five key technical research priorities in the areas of data management, data processing, data analytics, data visualisation and user interactions, and data protection, together with 28 sub-level challenges. The process also highlighted the important role of data standardisation, data engineering and DevOps for Big Data

Institutionelles Repositorium der Leibniz Universität Hannover

Toward High-Performance Computing and Big Data Analytics Convergence: The Case of Spark-DIY

Author: Caino Lores Silvina
Carretero Pérez Jesús
Nicolae Bogdan
Peterka Tom
Yildiz Orcun
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 28/10/2019
Field of study

Convergence between high-performance computing (HPC) and big data analytics (BDA) is currently an established research area that has spawned new opportunities for unifying the platform layer and data abstractions in these ecosystems. This work presents an architectural model that enables the interoperability of established BDA and HPC execution models, reflecting the key design features that interest both the HPC and BDA communities, and including an abstract data collection and operational model that generates a unified interface for hybrid applications. This architecture can be implemented in different ways depending on the process- and data-centric platforms of choice and the mechanisms put in place to effectively meet the requirements of the architecture. The Spark-DIY platform is introduced in the paper as a prototype implementation of the architecture proposed. It preserves the interfaces and execution environment of the popular BDA platform Apache Spark, making it compatible with any Spark-based application and tool, while providing efficient communication and kernel execution via DIY, a powerful communication pattern library built on top of MPI. Later, Spark-DIY is analyzed in terms of performance by building a representative use case from the hydrogeology domain, EnKF-HGS. This application is a clear example of how current HPC simulations are evolving toward hybrid HPC-BDA applications, integrating HPC simulations within a BDA environment.This work was supported in part by the Spanish Ministry of Economy, Industry and Competitiveness under Grant TIN2016-79637-P(toward Unification of HPC and Big Data Paradigms), in part by the Spanish Ministry of Education under Grant FPU15/00422 TrainingProgram for Academic and Teaching Staff Grant, in part by the Advanced Scientific Computing Research, Office of Science, U.S.Department of Energy, under Contract DE-AC02-06CH11357, and in part by the DOE with under Agreement DE-DC000122495,Program Manager Laura Biven

Universidad Carlos III de Madrid e-Archivo

Why High-Performance Modelling and Simulation for Big Data Applications Matters

Author: Aldinucci M.
Bracciali A.
Grelck C.
Larsson E.
Niewiadomska-Szynkiewicz E.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

International Migration, Integration and Social Cohesion online publications

Why High-Performance Modelling and Simulation for Big Data Applications Matters

Author: A Abdullatif
A Al-Fuqaha
A Bracciali
A Cristina-Bicharra
A Fanti
A Glotić
A Heidari Gorji
A Inostrosa-Psijas
A Oussous
A Serra
A Sikora
A Singh
A Zamuda
A Zamuda
A Zamuda
A Zamuda
A Zamuda
A Zamuda
B Kitchenham
C Grelck
C Grelck
C Lorenzo
C Misale
C Sansom
C Zechner
CS Iliopoulos
D Griol
E Bartocci
E Bartocci
E Capobianco
E Capobianco
E Frank
E Niewiadomska-Szynkiewicz
E Niewiadomska-Szynkiewicz
EA Lee
EI Vlahogianni
F Bardozzo
F Berman
G Bernardini
G Garnett
G Vitello
H Casanova
H Kennedy
I Cotes-Ruiz
I Milne
I Park
J Dean
J Holub
J Zhang
K Rutherford
L Calviello
L Calzone
L Garg
L Garg
L Huang
L Lazzerini-Ospri
L Marti
L Nasti
M Aldinucci
M Aldinucci
M Aldinucci
M Aldinucci
M Beccuti
M Cannataro
M Cole
M Herlihy
M Jahangirian
M Karpowicz
M Patterson
MA Martínez-del-Amor
MP Karpowicz
N Akhter
N Paoletti
N Sehgal
N Totis
P Danecek
P Liò
P Liò
P Suravajhala
P Szynkiewicz
PD Healy
PL Luisi
PS Pacheco
R Calheiros
RJ Walters
S Aleem
S John Walker
S McClean
S McClean
S Shanmugam
S Vitabile
T Akidau
T Carver
T Mastelic
T White
X Song
Y Kuruma
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Modelling and Simulation (M&S) offer adequate abstractions to manage the complexity of analysing big data in scientific and engineering domains. Unfortunately, big data problems are often not easily amenable to efficient and effective use of High Performance Computing (HPC) facilities and technologies. Furthermore, M&S communities typically lack the detailed expertise required to exploit the full potential of HPC solutions while HPC specialists may not be fully aware of specific modelling and simulation requirements and applications. The COST Action IC1406 High-Performance Modelling and Simulation for Big Data Applications has created a strategic framework to foster interaction between M&S experts from various application domains on the one hand and HPC experts on the other hand to develop effective solutions for big data applications. One of the tangible outcomes of the COST Action is a collection of case studies from various computing domains. Each case study brought together both HPC and M&S experts, giving witness of the effective cross-pollination facilitated by the COST Action. In this introductory article we argue why joining forces between M&S and HPC communities is both timely in the big data era and crucial for success in many application domains. Moreover, we provide an overview on the state of the art in the various research areas concerned

Crossref

Stirling Online Research Repository (RIOXX)

Stirling Online Research Repository

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Institutional Research Information System University of Turin