Search CORE

6,651 research outputs found

Multi-GPU Graph Analytics

Author: Owens John D.
Pan Yuechao
Wang Yangzihao
Wu Yuduo
Yang Carl
Publication venue
Publication date: 01/03/2017
Field of study

We present a single-node, multi-GPU programmable graph processing library that allows programmers to easily extend single-GPU graph algorithms to achieve scalable performance on large graphs with billions of edges. Directly using the single-GPU implementations, our design only requires programmers to specify a few algorithm-dependent concerns, hiding most multi-GPU related implementation details. We analyze the theoretical and practical limits to scalability in the context of varying graph primitives and datasets. We describe several optimizations, such as direction optimizing traversal, and a just-enough memory allocation scheme, for better performance and smaller memory consumption. Compared to previous work, we achieve best-of-class performance across operations and datasets, including excellent strong and weak scalability on most primitives as we increase the number of GPUs in the system.Comment: 12 pages. Final version submitted to IPDPS 201

arXiv.org e-Print Archive

eScholarship - University of California

Computing for Perturbative QCD - A Snowmass White Paper

Author: /Argonne
/Fermilab
/LBNL Berkeley
/SLAC
/SLAC
/UCLA
Bauer Christian
Bern Zvi
Boughezal Radja
Campbell John
Christensen Neil
Dixon Lance
Gehrmann Thomas
Hoeche Stefan
Kanzaki Junichi
Mitov Alexander
Nadolsky Pavel
Olness Fredrick
Peskin Michael
Petriello Frank
Pittsburgh /U.
Pozzorini Stefano
Reina Laura
Siegert Frank
Wackeroth Doreen
Walsh Jonathan
Williams Ciaran
Wobisch Markus
Zurich /U.
Publication venue
Publication date: 13/09/2013
Field of study

We present a study on high-performance computing and large-scale distributed computing for perturbative QCD calculations.Comment: 21 pages, 5 table

arXiv.org e-Print Archive

UNT Digital Library

Parallel Processing of Large Graphs

Author: Indyk Wojciech
Kajdanowicz Tomasz
Kazienko Przemyslaw
Publication venue
Publication date: 03/06/2013
Field of study

More and more large data collections are gathered worldwide in various IT systems. Many of them possess the networked nature and need to be processed and analysed as graph structures. Due to their size they require very often usage of parallel paradigm for efficient computation. Three parallel techniques have been compared in the paper: MapReduce, its map-side join extension and Bulk Synchronous Parallel (BSP). They are implemented for two different graph problems: calculation of single source shortest paths (SSSP) and collective classification of graph nodes by means of relational influence propagation (RIP). The methods and algorithms are applied to several network datasets differing in size and structural profile, originating from three domains: telecommunication, multimedia and microblog. The results revealed that iterative graph processing with the BSP implementation always and significantly, even up to 10 times outperforms MapReduce, especially for algorithms with many iterations and sparse communication. Also MapReduce extension based on map-side join usually noticeably presents better efficiency, although not as much as BSP. Nevertheless, MapReduce still remains the good alternative for enormous networks, whose data structures do not fit in local memories.Comment: Preprint submitted to Future Generation Computer System

arXiv.org e-Print Archive

CiteSeerX

GraphGrind: addressing load imbalance of graph partitioning

Author: Congshan Yang (3483983)
Fengmei Guo (539478)
Haibo Qiu (539479)
Hua Shao (295138)
Jianfeng Xie (539475)
Xudong Ma (539476)
Yi Yang (116183)
Yingzi Huang (539477)
Zhiwei Gao (686378)
Publication venue
Publication date: 01/01/2017
Field of study

The incidence of HCAIs before and after antimicrobial stewardship. Incidence of VAP, CRBSI and CAUTI were defined as the number of VAP, CRBSI and CAUTI patients per 1000 ventilation days, per 1000 central venous catheter days and per 1000 urine-catheter days, respectively. (DOCX 15Â kb

FigShare

Accelerating Graph Analytics by Utilising the Memory Locality of Graph Partitioning

Author: Nikolopoulos Dimitrios S.
Sun Jiawen
Vandierendonck Hans
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 07/09/2017
Field of study

A Survey on the Evolution of Stream Processing Systems

Author: Carbone Paris
Fragkoulis Marios
Kalavri Vasiliki
Katsifodimos Asterios
Publication venue
Publication date: 03/08/2020
Field of study

Stream processing has been an active research field for more than 20 years, but it is now witnessing its prime time due to recent successful efforts by the research community and numerous worldwide open-source communities. This survey provides a comprehensive overview of fundamental aspects of stream processing systems and their evolution in the functional areas of out-of-order data management, state management, fault tolerance, high availability, load management, elasticity, and reconfiguration. We review noteworthy past research findings, outline the similarities and differences between early ('00-'10) and modern ('11-'18) streaming systems, and discuss recent trends and open problems.Comment: 34 pages, 15 figures, 5 table

arXiv.org e-Print Archive