Search CORE

26,156 research outputs found

A Visual Analytics Framework for Reviewing Streaming Performance Data

Author: Carothers Christopher D.
Fujiwara Takanori
Kesavan Suraj P.
Li Jianping Kelvin
Ma Kwan-Liu
Mubarak Misbah
Ross Caitlin
Ross Robert B.
Publication venue
Publication date: 25/01/2020
Field of study

Understanding and tuning the performance of extreme-scale parallel computing systems demands a streaming approach due to the computational cost of applying offline algorithms to vast amounts of performance log data. Analyzing large streaming data is challenging because the rate of receiving data and limited time to comprehend data make it difficult for the analysts to sufficiently examine the data without missing important changes or patterns. To support streaming data analysis, we introduce a visual analytic framework comprising of three modules: data management, analysis, and interactive visualization. The data management module collects various computing and communication performance metrics from the monitored system using streaming data processing techniques and feeds the data to the other two modules. The analysis module automatically identifies important changes and patterns at the required latency. In particular, we introduce a set of online and progressive analysis methods for not only controlling the computational costs but also helping analysts better follow the critical aspects of the analysis results. Finally, the interactive visualization module provides the analysts with a coherent view of the changes and patterns in the continuously captured performance data. Through a multi-faceted case study on performance analysis of parallel discrete-event simulation, we demonstrate the effectiveness of our framework for identifying bottlenecks and locating outliers.Comment: This is the author's preprint version that will be published in Proceedings of IEEE Pacific Visualization Symposium, 202

arXiv.org e-Print Archive

Crossref

Reducing memory requirements for large size LBM simulations on GPUs

Author: Axner
Bernaschi
Bernaschi
Gross
He
Januszewski
Kollmannsberger
Latt
Li
Li
Malaspinas
Marié
Mohamad
Obrecht
Pohl
Qian
Rinaldi
Shet
Succi
Valero-Lara
Valero-Lara
Valero-Lara
Valero-Lara
Valero-Lara
Valero-Lara
Valero-Lara
Wellein
Wendt
Yang
Yang
Ye
Publication venue: 'Wiley'
Publication date: 01/01/2017
Field of study

The scientific community in its never-ending road of larger and more efficient computational resources is in need of more efficient implementations that can adapt efficiently on the current parallel platforms. Graphics processing units are an appropriate platform that cover some of these demands. This architecture presents a high performance with a reduced cost and an efficient power consumption. However, the memory capacity in these devices is reduced and so expensive memory transfers are necessary to deal with big problems. Today, the lattice-Boltzmann method (LBM) has positioned as an efficient approach for Computational Fluid Dynamics simulations. Despite this method is particularly amenable to be efficiently parallelized, it is in need of a considerable memory capacity, which is the consequence of a dramatic fall in performance when dealing with large simulations. In this work, we propose some initiatives to minimize such demand of memory, which allows us to execute bigger simulations on the same platform without additional memory transfers, keeping a high performance. In particular, we present 2 new implementations, LBM-Ghost and LBM-Swap, which are deeply analyzed, presenting the pros and cons of each of them.This project was funded by the Spanish Ministry of Economy and Competitiveness (MINECO): BCAM Severo Ochoa accreditation SEV-2013-0323, MTM2013-40824, Computación de Altas Prestaciones VII TIN2015-65316-P, by the Basque Excellence Research Center (BERC 2014-2017) pro- gram by the Basque Government, and by the Departament d' Innovació, Universitats i Empresa de la Generalitat de Catalunya, under project MPEXPAR: Models de Programació i Entorns d' Execució Paral·lels (2014-SGR-1051). We also thank the support of the computing facilities of Extremadura Research Centre for Advanced Technologies (CETA-CIEMAT) and NVIDIA GPU Research Center program for the provided resources, as well as the support of NVIDIA through the BSC/UPC NVIDIA GPU Center of Excellence.Peer ReviewedPostprint (author's final draft

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

UPCommons. Portal del coneixement obert de la UPC

Recommended from our members

Nuclear Physics Network Requirements Review Report

Author: Brown Benjamin
Dart Eli
Rai Gulshan
Rotman Lauren
Wefel Paul
Zurawski jason
Publication venue: eScholarship, University of California
Publication date: 05/05/2020
Field of study

eScholarship - University of California

Smart PIN: utility-based replication and delivery of multimedia content to mobile users in wireless networks

Author: Lee Seung-Bum
Muntean Gabriel-Miro
Smeaton Alan F.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 31/03/2008
Field of study

Next generation wireless networks rely on heterogeneous connectivity technologies to support various rich media services such as personal information storage, file sharing and multimedia streaming. Due to users’ mobility and dynamic characteristics of wireless networks, data availability in collaborating devices is a critical issue. In this context Smart PIN was proposed as a personal information network which focuses on performance of delivery and cost efficiency. Smart PIN uses a novel data replication scheme based on individual and overall system utility to best balance the requirements for static data and multimedia content delivery with variable device availability due to user mobility. Simulations show improved results in comparison with other general purpose data replication schemes in terms of data availability

Crossref

Irish Universities

DCU Online Research Access Service

Big Data Caching for Networking: Moving from Cloud to Edge

Author: Baştuğ Ejder
Bennis Mehdi
Debbah Mérouane
Er Ahmet Salih
Kader Manhal Abdel
Karatepe Alper
Zeydan Engin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 05/06/2016
Field of study

In order to cope with the relentless data tsunami in

5G

wireless networks, current approaches such as acquiring new spectrum, deploying more base stations (BSs) and increasing nodes in mobile packet core networks are becoming ineffective in terms of scalability, cost and flexibility. In this regard, context-aware

5

G networks with edge/cloud computing and exploitation of \emph{big data} analytics can yield significant gains to mobile operators. In this article, proactive content caching in

5

G wireless networks is investigated in which a big data-enabled architecture is proposed. In this practical architecture, vast amount of data is harnessed for content popularity estimation and strategic contents are cached at the BSs to achieve higher users' satisfaction and backhaul offloading. To validate the proposed solution, we consider a real-world case study where several hours of mobile data traffic is collected from a major telecom operator in Turkey and a big data-enabled analysis is carried out leveraging tools from machine learning. Based on the available information and storage capacity, numerical studies show that several gains are achieved both in terms of users' satisfaction and backhaul offloading. For example, in the case of

16

BSs with

30\%

of content ratings and

13

Gbyte of storage size (

78\%

of total library size), proactive caching yields

100\%

of users' satisfaction and offloads

98\%

of the backhaul.Comment: accepted for publication in IEEE Communications Magazine, Special Issue on Communications, Caching, and Computing for Content-Centric Mobile Network

arXiv.org e-Print Archive

HAL-CentraleSupelec

HAL-Rennes 1

Autonomous resource-aware scheduling of large-scale media workflows

Author: B. Volckaert
F.J. Seinstra
J. Yu
T. Harmer
Y.K. Kwok
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

The media processing and distribution industry generally requires considerable resources to be able to execute the various tasks and workflows that constitute their business processes. The latter processes are often tied to critical constraints such as strict deadlines. A key issue herein is how to efficiently use the available computational, storage and network resources to be able to cope with the high work load. Optimizing resource usage is not only vital to scalability, but also to the level of QoS (e.g. responsiveness or prioritization) that can be provided. We designed an autonomous platform for scheduling and workflow-to-resource assignment, taking into account the different requirements and constraints. This paper presents the workflow scheduling algorithms, which consider the state and characteristics of the resources (computational, network and storage). The performance of these algorithms is presented in detail in the context of a European media processing and distribution use-case

Crossref

Ghent University Academic Bibliography