Search CORE

896 research outputs found

Efficient Memory Management for GPU-based Deep Learning Systems

Author: He Bingsheng
Shu Yao
Wang Wei
Yeung Sai Ho
Zhang Junzhe
Publication venue
Publication date: 19/02/2019
Field of study

GPU (graphics processing unit) has been used for many data-intensive applications. Among them, deep learning systems are one of the most important consumer systems for GPU nowadays. As deep learning applications impose deeper and larger models in order to achieve higher accuracy, memory management becomes an important research topic for deep learning systems, given that GPU has limited memory size. Many approaches have been proposed towards this issue, e.g., model compression and memory swapping. However, they either degrade the model accuracy or require a lot of manual intervention. In this paper, we propose two orthogonal approaches to reduce the memory cost from the system perspective. Our approaches are transparent to the models, and thus do not affect the model accuracy. They are achieved by exploiting the iterative nature of the training algorithm of deep learning to derive the lifetime and read/write order of all variables. With the lifetime semantics, we are able to implement a memory pool with minimal fragments. However, the optimization problem is NP-complete. We propose a heuristic algorithm that reduces up to 13.3% of memory compared with Nvidia's default memory pool with equal time complexity. With the read/write semantics, the variables that are not in use can be swapped out from GPU to CPU to reduce the memory footprint. We propose multiple swapping strategies to automatically decide which variable to swap and when to swap out (in), which reduces the memory cost by up to 34.2% without communication overhead

arXiv.org e-Print Archive

ScholarBank@NUS

Efficient Memory Management for GPU-based Deep Learning Systems

Author: Zhang Junzhe
Yeung Sai Ho
Shu Yao
He Bingsheng
Wang Wei
Publication venue
Publication date: 10/01/1961
Field of study

arXiv.org e-Print Archive

Trinity College

Source bearing and steering-vector estimation using partially calibrated arrays

Author: Li M.
Lu Y.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/10/2009
Field of study

The problem of source direction-of-arrival (DOA) estimation using a sensor array is addressed, where some of the sensors are perfectly calibrated, while others are uncalibrated. An algorithm is proposed for estimating the source directions in addition to the estimation of unknown array parameters such as sensor gains and phases, as a way of performing array self-calibration. The cost function is an extension of the maximum likelihood (ML) criteria that were originally developed for DOA estimation with a perfectly calibrated array. A particle swarm optimization (PSO) algorithm is used to explore the high-dimensional problem space and find the global minimum of the cost function. The design of the PSO is a combination of the problem-independent kernel and some newly introduced problem-specific features such as search space mapping, particle velocity control, and particle position clipping. This architecture plus properly selected parameters make the PSO highly flexible and reusable, while being sufficiently specific and effective in the current application. Simulation results demonstrate that the proposed technique may produce more accurate estimates of the source bearings and unknown array parameters in a cheaper way as compared with other popular methods, with the root-mean-squared error (RMSE) approaching and asymptotically attaining the Cramer Rao bound (CRB) even in unfavorable conditions

University of Strathclyde Institutional Repository

Application of Continuous ACOR to Neural Network Training: Direction of Arrival Problem

Author: Hamed Movahedipour
Publication venue: 'IntechOpen'
Publication date: 04/02/2011
Field of study

IntechOpen

DOA Estimation for Local Scattered CDMA Signals by Particle Swarm Optimization

Author: Astely
Astely
Chang
Chang
Chiang
Fogel
Haykin
Jhih-Chung Chang
Li
Li
Niknam
Panigrahi
Schmidt
Shi
Publication venue: Molecular Diversity Preservation International (MDPI)
Publication date: 01/03/2012
Field of study

This paper deals with the direction-of-arrival (DOA) estimation of local scattered code-division multiple access (CDMA) signals based on a particle swarm optimization (PSO) search. For conventional spectral searching estimators with local scattering, the searching complexity and estimating accuracy strictly depend on the number of search grids used during the search. In order to obtain high-resolution and accurate DOA estimation, a smaller grid size is needed. This is time consuming and it is unclear how to determine the required number of search grids. In this paper, a modified PSO is presented to reduce the required search grids for the conventional spectral searching estimator with the effects of local scattering. Finally, several computer simulations are provided for illustration and comparison

Multidisciplinary Digital Publishing Institute

Crossref

Directory of Open Access Journals

PubMed Central