2 research outputs found

    High Dimensional Markov Chain Monte Carlo with Multiple GPUs

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (์„์‚ฌ)-- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ์ž์—ฐ๊ณผํ•™๋Œ€ํ•™ ํ†ต๊ณ„ํ•™๊ณผ, 2018. 2. ์ด์žฌ์šฉ.์ธ๊ณต์‹ ๊ฒฝ๋ง๊ณผ ๊ฐ™์€ ๋งŽ์€ ๊ณ„์‚ฐ์„ ์š”ํ•˜๋Š” ๋ชจํ˜•์ด ๋‹ค์–‘ํ•œ ๋ถ„์•ผ์—์„œ ํšจ๊ณผ์ ์ž„์ด ๋“œ๋Ÿฌ๋‚จ์— ๋”ฐ๋ผ ํ˜•๋ ฌ ์—ฐ์‚ฐ์„ ๋ณ‘๋ ฌ์ฒ˜๋ฆฌ ํ•˜๊ธฐ ์œ„ํ•ด ๊ทธ๋ž˜ํ”ฝ์นด๋“œ(GPU) ์ƒ ์—์„œ ๊ณ„์‚ฐํ•˜๋Š” ๊ฒƒ์ด ์ผ๋ฐ˜ํ™”๋˜๊ณ  ์žˆ์œผ๋ฉฐ ์ด๋ฅผ ์œ„ํ•ด ๊ณ„์‚ฐ์„ ์—ฌ๋Ÿฌ ์Šค๋ ˆ๋“œ๋กœ ๋‚˜๋ˆ„๋Š” ๋ฐฉ๋ฒ•์„ ์ฐพ๋Š” ๊ฒƒ์ด ์ค‘์š”ํ•ด์ง€๊ณ  ์žˆ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์€ ๋ถ„ํ•  ๋œ ์ƒ˜ํ”Œ ๊ณต๊ฐ„์—์„œ ๋ธŒ๋ฆฌ์ง€ ์ƒ˜ํ”Œ๋ง๊ณผ ํ•ด๋ฐ€ํ† ๋‹ˆ์•ˆ ๋ชฌํ…Œ์นด๋ฅผ๋กœ๋ฅผ ๊ฒฐํ•ฉํ•˜์—ฌ ์—ฌ๋Ÿฌ GPU์— ๋ถ„์‚ฐ๋  ์ˆ˜ ์žˆ๋Š” ์ƒˆ๋กœ์šด MCMC ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ œ์‹œํ•œ๋‹ค. ์ด ์ ‘๊ทผ๋ฒ•์€ ๋ฒ ์ด์ง€์•ˆ ๋‰ด๋Ÿด ๋„คํŠธ์›Œํฌ (Bayesian Neural Network)์™€ ๊ฐ™์€ ํƒ€๊ฒŸ ๋ถ„ํฌ์— ๋Œ€ํ•œ MCMC ์ƒ˜ํ”Œ๋ง์„ ๋น ๋ฅด๊ฒŒ ํ•  ์ˆ˜ ์žˆ๋‹ค. ๋˜ํ•œ ๋‹ค์ค‘ ๋ชจ๋‹ฌ (Multimodality)์ด ์กด์žฌํ•  ๋•Œ ์ด ์•Œ๊ณ ๋ฆฌ์ฆ˜์€ ๋‚ฎ์€ ํ™•๋ฅ  ์˜์—ญ์—์„œ๋„ ์ƒ˜ํ”Œ๋ง์„ ํšจ์œจ์ ์œผ๋กœ ์ž˜ ํ• ์ˆ˜ ์žˆ๋Š”๊ฒƒ์œผ๋กœ ๋‚˜ํƒ€๋‚ฌ๋‹ค. ๋งˆ์ง€๋ง‰์œผ๋กœ ์ด ๋…ผ๋ฌธ์€ Adam Optimizer, ํ•ด๋ฐ€ํ† ๋‹ˆ์•ˆ ๋ชฌํ…Œ์นด๋ฅผ๋กœ์™€ ๊ฐ™์€ ๋‹ค๋ฅธ ํ•™์Šต ๋ฐฉ๋ฒ•์˜ ๋ณ€์ˆ˜ ๋ถ„ํฌ์™€ ๋ณธ ์•Œ๊ณ ๋ฆฌ์ฆ˜์˜ ๋ณ€์ˆ˜๋ถ„ํฌ๋ฅผ ๋น„๊ตํ•จ์œผ๋กœ์จ, ์ œํ•œ๋œ ํ‘œ๋ณธ ๊ณต๊ฐ„์ด ์ผ๋ฐ˜ํ™” ์˜ค์ฐจ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์— ๋Œ€ํ•œ ์ถ”๊ฐ€ ์—ฐ๊ตฌ๊ฐ€ ์ˆ˜ํ–‰ ๋  ์ˆ˜ ์žˆ์Œ์„ ์ œ์‹œํ•œ๋‹ค.Allocating computation over multiple threads to reduce running time has become a key to training big models such as deep neural networks because a Graphics Processing Unit (GPU), which is parallel in nature, can speed up intensive matrix operations. We present a new MCMC algorithm that can be distributed over multiple GPUs by combining bridge sampling with Hamiltonian Monte Carlo on partitioned sample spaces. We empirically show that this approach can expedite MCMC sampling for any unnormalized target distribution such as Bayesian Neural Network in a high dimensional setting. Furthermore, in the presence of multimodality, this algorithm is expected to be more efficient in mixing MCMC chains when proper partitions are chosen. Finally, by comparing the parameter distributions of different learning method, we suggest that further studies could be conducted on the effect of a constrained sample space on the generalization error.Abstract 3 Chapter 1 Introduction 9 1.1 Related Works 10 1.2 Contribution 12 Chapter 2 Hamiltonian Monte Carlo 13 2.1 Momentum proposal 15 2.2 Leap frog update 15 2.3 Metropolis Accept-Reject 16 Chapter 3 Bridged Hamiltonian Monte Carlo 18 3.1 Sampling from Partitioned Sample Space 18 3.1.1 Constrained HMC 19 3.2 Combining Samples from different Sample Space 20 3.2.1 Bridge Sampling 22 3.3 Practical Issues in Implementing Bridged Hamiltonian Monte Carlo 24 3.3.1 Numerical Overflow or Underflow 24 3.3.2 Partitioning Scheme 26 Chapter 4 Experiments 28 4.1 Bivariate Normal Mixture Model 28 4.2 Moon Data Classification 31 4.3 MNIST Data Classification 36 4.4 Result 37 Chapter 5 Discussion 42 ์ดˆ๋ก 48Maste

    On the distribution of throughput of transfer lines

    Get PDF
    Ankara : Department of Industrial Engineering and the Institute of Engineering and Sciences of Bilkent University, 1998.Thesis (Master's) -- Bilkent University, 1998.Includes bibliographical references leaves 86-107A transfer line corresponds to a manufacturing system consisting of a number of work stations in series integrated into one system by a common transfer mechanism and a control system. There is a vast literature on the transfer lines. However, little has been done on the transient analysis of these systems by making use of the higher order moments of their performance measures due to the difficulty in determining the evolution of the stochastic processes under consideration. This thesis examines the transient behavior of relatively short transfer lines and derives the distribution of the performance measures of interest. The proposed method based on the analytical derivation of the distribution of throughput is also applied to the systems with two-part types. An experiment is designed in order to compare the results of this study with the state-space representations and the simulation. They are also interpreted from the point of view of the line behavior and design issue. Furthermore, extensions are briefly discussed and directions for future research are suggested.Deler, BaharM.S
    corecore