Search CORE

91,279 research outputs found

Workload-aware Automatic Parallelization for Multi-GPU DNN Training

Author: Choi Jungwook
Jo Youngmin
Shin Sungho
Srinivasan Vijayalakshmi
Sung Wonyong
Venkataramani Swagath
Publication venue
Publication date: 06/02/2019
Field of study

Deep neural networks (DNNs) have emerged as successful solutions for variety of artificial intelligence applications, but their very large and deep models impose high computational requirements during training. Multi-GPU parallelization is a popular option to accelerate demanding computations in DNN training, but most state-of-the-art multi-GPU deep learning frameworks not only require users to have an in-depth understanding of the implementation of the frameworks themselves, but also apply parallelization in a straight-forward way without optimizing GPU utilization. In this work, we propose a workload-aware auto-parallelization framework (WAP) for DNN training, where the work is automatically distributed to multiple GPUs based on the workload characteristics. We evaluate WAP using TensorFlow with popular DNN benchmarks (AlexNet and VGG-16), and show competitive training throughput compared with the state-of-the-art frameworks, and also demonstrate that WAP automatically optimizes GPU assignment based on the workload's compute requirements, thereby improving energy efficiency.Comment: This paper is accepted in ICASSP201

arXiv.org e-Print Archive

Crossref

SNU Open Repository and Archive

Solving for Micro- and Macro- Scale Electrostatic Configurations Using the Robin Hood Algorithm

Author: Abraham H.
Corona T. J.
Formaggio J. A.
Gluck F.
Lazic P.
Stefancic H.
Publication venue
Publication date: 01/11/2011
Field of study

We present a novel technique by which highly-segmented electrostatic configurations can be solved. The Robin Hood method is a matrix-inversion algorithm optimized for solving high density boundary element method (BEM) problems. We illustrate the capabilities of this solver by studying two distinct geometry scales: (a) the electrostatic potential of a large volume beta-detector and (b) the field enhancement present at surface of electrode nano-structures. Geometries with elements numbering in the O(10^5) are easily modeled and solved without loss of accuracy. The technique has recently been expanded so as to include dielectrics and magnetic materials.Comment: 40 pages, 20 figure

arXiv.org e-Print Archive

DSpace@MIT

Full-text Institutional Repository of the Ruđer Bošković Institute

The measurement of air supply volumes and velocities in cleanrooms

Author: Green G.
Whyte W.
Whyte W.M.
Publication venue: Veetech Ltd.
Publication date: 01/03/2010
Field of study

Air supply volumes and velocities in cleanrooms are monitored by airflow measuring hoods and anemometers but these measuring methods can be inaccurate if used incorrectly. It is demonstrated in this article that measuring hoods are accurate if the air supply passes evenly out of the hood, as occurs when the air volume is measured from a four-way diffuser or no air supply diffuser. However, when a swirl diffuser was investigated, the measuring hood gave readings more than 50% greater than the true volume. The reasons for the inaccuracy, and methods to correct it were established. Vane anemometers give inaccurate readings at the face of high-efficiency air supply filters, and it was found that the most accurate reading was found about 15 cm from the filter face. The number of readings required across the filter face to obtain an accurate average velocity was investigated, as was a scanning method using overlapping passes

Enlighten

Continuum variational and diffusion quantum Monte Carlo calculations

Author: Alfè D
Badinski A
Ceperley D M
Dewing M
Kalos M H
López Ríos P Needs R J
M D Towler
N D Drummond
Needs R J Towler M D Drummond N D López Ríos P
Nemec N
P López Ríos
R J Needs
Schmidt K E
Publication venue: 'IOP Publishing'
Publication date: 11/12/2009
Field of study

This topical review describes the methodology of continuum variational and diffusion quantum Monte Carlo calculations. These stochastic methods are based on many-body wave functions and are capable of achieving very high accuracy. The algorithms are intrinsically parallel and well-suited to petascale computers, and the computational cost scales as a polynomial of the number of particles. A guide to the systems and topics which have been investigated using these methods is given. The bulk of the article is devoted to an overview of the basic quantum Monte Carlo methods, the forms and optimisation of wave functions, performing calculations within periodic boundary conditions, using pseudopotentials, excited-state calculations, sources of calculational inaccuracy, and calculating energy differences and forces

arXiv.org e-Print Archive

Crossref

Lancaster E-Prints

Flux creep in type-II superconductors: self-organized criticality approach

Author: A. I. Larkin
A. I. Larkin
A. M. Campbell
H. J. Jensen
Ilya Papiashvili
L. I. Glazman
L. I. Glazman
P. Bak
P. G. de Gennes
R. G. Mints
Publication venue: 'American Physical Society (APS)'
Publication date: 31/10/2004
Field of study

We consider the current density distribution function of a flux creep regime in type-II superconductors by mapping the flux creep process to the dynamics of a model with a self-organized criticality. We use an extremal Robin Hood type model which evolves to Been's type critical state to treat magnetic flux penetration into a superconductor and derive an analog of the current-voltage characteristics in the flux creep region.Comment: 7 pages, 7 figure

arXiv.org e-Print Archive

Crossref