12 research outputs found
Training Methods for Deep Neural Networks using Low Precision Dynamic Fixed-Point
νμλ
Όλ¬Έ (μμ¬)-- μμΈλνκ΅ λνμ : 곡과λν μ κΈ°Β·μ 보곡νλΆ, 2019. 2. μ΅κΈ°μ.μ¬μΈ΅μ κ²½λ§(Deep Neural NetworkDNN)μ λ°μ΄ν° μΈμ, μμ§, ν©μ± λ±μ λΆμΌμμ λμ μ νλλ₯Ό 보μ΄λ©° λ리 μ¬μ©λκ³ μλ€. μΌλ°μ μΌλ‘, μ¬μΈ΅μ κ²½λ§ νμ΅μμ μ¬μ©λλ μ°μ°μ μ ννμ μ νμ±κ³Ό λμ ννλ²μλ₯Ό νμλ‘ νκΈ°λλ¬Έμ 32λΉνΈ λ¨μΌμ λ°λ, λλ 64λΉνΈ λ°°μ λ°λμ λΆλ μμμ μκ° μ¬μ©λλ€. λ°λΌμ λΆλ μμμ μ μ¬μ©μΌλ‘ μΈν μ λ ₯ μλͺ¨μ μΉ© ν¬κΈ°, λ©λͺ¨λ¦¬ λμνμ μ€μ΄κΈ° μν΄ μ¬μΈ΅ μ κ²½λ§ νμ΅ μ°μ°μμμ λΉνΈ κΈΈμ΄λ₯Ό μ€μ΄κΈ° μν λ§μ μ°κ΅¬κ° μμλ€.
κ·Έ μ€, λμ κ³ μ μμμ (Dynamic Fixed-PointDFP)μ λΆλ μμμ κ³Ό κ³ μ μμμ μ κ²°ν©μ μΈ ννλ‘μ¨, μ¬μΈ΅μ κ²½λ§ νμ΅μ μ±κ³΅μ μΈ κ²°κ³Όλ₯Ό 보μλ μ νν μ²΄κ³ μ€ νλμ΄λ€. λμ κ³ μ μμμ μ μ¬μ©ν μ κ²½λ§ νμ΅μ λν΄ 16λΉνΈ μ°μ°μΌλ‘ κ°λ₯νλ€λ μ°κ΅¬λ€μ΄ μμμ§λ§, μ΄λ κ°μ€μΉ μ
λ°μ΄νΈλ₯Ό μ μΈνμ κ²½μ°λ‘ μ νλ κ²μ΄μλ€. κ°μ€μΉ μ
λ°μ΄νΈλ μμ κ°λ€μ μΆ©λΆν μΆμ νμ¬ κΈ°μ‘΄μ κ°μ€μΉμ λν΄μ§μΌλ‘μ¨ μ
λ°μ΄νΈ λ κ°μ€μΉ κ°μ μ κ²½λ§ νμ΅μ λ€μ μ¬μ©νλ€. λ°λΌμ μΌλ°μ μΈ μ κ²½λ§λ€μ κ°μ€μΉ μ
λ°μ΄νΈμλ λ λμ μ λ°λκ° νμνκΈ° λλ¬Έμ 32λΉνΈ λΆλ μμμ μλ₯Ό μ¬μ©νλ€. κ·Έλ¬λ λ€λ₯Έ μ°μ°λ€κ³Ό λΉνΈ κΈΈμ΄λ₯Ό λ¬λ¦¬ μ¬μ©νλ©΄ νλμ¨μ΄ μ€κ³κ° μ΄λ €μμ§ μ μλ€λ λ¬Έμ μ μ μ¬μ ν λ¨μμμλ€.
λ³Έ λ
Όλ¬Έμ κ°μ€μΉ μ
λ°μ΄νΈκΉμ§ ν¬ν¨νμ¬ μ 체μ μΈ μ κ²½λ§ νμ΅μ μ°μ°μ 16λΉνΈ λμ κ³ μ μμμ μ μ¬μ©ν μ μλ λ°©λ²λ€μ μ μνλ€. 첫 λ²μ§Έ λ°©λ²μ 'κ°μ€μΉ ν΄λ¦¬ν(Weight Clipping)'μΌλ‘μ¨, κ°μ€μΉ κ°λ€μ λ²μλ₯Ό μ ννμ¬ ν΄λΉ λ²μλ₯Ό μ΄κ³Όνλ μ λκ°μ΄ ν° κ°μ€μΉλ₯Ό μ ννκ³ μ‘°κΈ λ λ§μ κ°μ€μΉλ€μ΄ μ κ²½λ§ νμ΅μ μ μλ―Έν κ°μ μ μ§νλλ‘ νλλ‘ νλ λ°©λ²μ΄λ€. 'μ μ§μ λ°°μΉ ν¬κΈ° μ¦κ°λ²'μ μμ μ
λ°μ΄νΈ κ°μ μΈλΆ μ 보λ₯Ό 보쑴νμ¬ 16λΉνΈ κ°μ€μΉ μ
λ°μ΄νΈμ μλ¬ λ°μ°μ ν΄κ²°ν μ μλ€. λν, λ³Έ λ
Όλ¬Έμμλ μ΄ λ κ°μ§ λ°©λ²μ κ²°ν©ν¨μΌλ‘μ¨ λ λμ μ±λ₯μ νμ΅ κ°λ₯μ±μ νμΈνμλ€. μ€νμλ LeNet-5μ VGG-16 λ€νΈμν¬ μμμ CIFAR10, CIFAR100 λ°μ΄ν°μ
μ μ¬μ©νμμΌλ©°, κ°μ€μΉ μ
λ°μ΄νΈ μ°μ°μ ν¬ν¨ν μ 체μ μΈ νμ΅ μ°μ°μ 16 λΉνΈ λμ κ³ μ μμμ μλ₯Ό μ μ©νμλ€. κ·Έ κ²°κ³Ό, μ΄λ―Έμ§ λΆλ₯ μ νλ λΏλ§ μλλΌ νμ΅ μμ€λ₯ λ λΆλ μμμ μ κ·Όμ ν κ²°κ³Όλ₯Ό μ»μμΌλ‘μ¨ μ μ λ ₯ μΉ©μΌλ‘ λμ ν μ μλ μ¬μΈ΅ μ κ²½λ§μ νμ΅μ μ±κ³΅νμλ€.Deep Neural Networks (DNNs) are widely used in the areas of RMS (Recognition, Mining, and Synthesis) applications due to their high accuracies. In general, the operations in the DNN training simultaneously need wide dynamic range and high precision, thus the 32-bit single or 64-bit double precision floating-point operations are usually used. However, the hardware using floating-point requires many drawbacks. There have been many attempts to reduce the bit-widths of the DNN training operations, to reduce the power consumption, chip area, and memory bandwidth. One of the most successful attempts is the dynamic fixed-point (DFP), which is the combination of the floating-point and fixed-point formats. It has been reported that DFP can reduce the bit-widths of the training operations to 16 bits, except for parameter update operationsparameter updates for general networks need higher precision and are usually done with 32-bit floating point operations.
In this paper, we propose two methods of using 16-bit DFP for all the training operations including parameter updates. One is weight clipping method, which clips big weight values that exceed a certain upper bound to preserve enough precision for DNNs to be trained. Another one is gradual batch size increase method, which addresses the error divergence issue of the 16-bit parameter updates by preserving the details of the small update values. Lastly, we combine the two methods to further explore their potentials. We successfully apply 16-bit DFP operations on the entire training process including parameter updates, and achieve the same accuracies with LeNet-5 and VGG-16 networks using CIFAR-10 and CIFAR-100 datasets.Abstract i
Contents iii
List of Tables v
List of Figures vi
1 INTRODUCTION 1
2 Training DNN with Dynamic Fixed-Point 6
2.1 Numerical Representation Format . . . . . . . . . . . . . . . . . . . 6
2.1.1 Floating Point and Fixed Point . . . . . . . . . . . . . . . . . 6
2.1.2 Dynamic Fixed-Point . . . . . . . . . . . . . . . . . . . . . . 8
2.2 Training Result and Analysis of DNN using DFP . . . . . . . . . . . 10
3 Weight Clipping 15
3.1 Implementation of Weight Clipping . . . . . . . . . . . . . . . . . . 15
3.2 Experiment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
3.2.1 Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
3.2.2 Error and Loss Result . . . . . . . . . . . . . . . . . . . . . . 18
4 Gradual Batchsize Increase 23
4.1 Implementation of Gradual Batchsize Increase . . . . . . . . . . . . . 23
4.2 Experiment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
4.2.1 Error and Loss Result . . . . . . . . . . . . . . . . . . . . . . 25
5 Combined Version of Two Method 29
6 Summary 33
Abstract (In Korean) 39Maste
Anti-neoplastic effect of selective COX-2 inhibitor on gastric cancer cells
Thesis(master`s)--μμΈλνκ΅ λνμ :μνκ³Ό λ΄κ³Όνμ 곡,2006.Maste
Effects of organic Ca supplements on Ca bioavailability and physiological functions in growing rats and ovariectomized osteoporotic model rats
νμλ
Όλ¬Έ (μμ¬)-- μμΈλνκ΅ λνμ : μνμμνκ³Ό, 2011.2. μ΄μ°μ.Maste
Development and Evaluation of a Tailored Gestational Diabetes Mellitus Management Smartphone Application
νμλ
Όλ¬Έ (μμ¬)-- μμΈλνκ΅ λνμ : κ°νΈνκ³Ό, 2014. 8. λ°νμ .λ³Έ μ°κ΅¬μ λͺ©μ μ μμ μ± λΉλ¨λ³ μκΈ°κ΄λ¦¬λ₯Ό λκΈ° μνμ¬ μμ μ± λΉλ¨λ³ κ΄λ¦¬ μμ μ€λ¬΄μ§μΉ¨μμ μμ μ± λΉλ¨λ³ κ΄λ¦¬μ νμν μ§μμ μΆμΆνκ³ μ¬μ©μ μꡬλ μ‘°μ¬λ₯Ό ν ν μ΄λ₯Ό λ°νμΌλ‘ λ§μΆ€ν κΆκ³ μμ κ°λ°νκ³ μ€λ§νΈν° μ±μ κ°λ°νλ λ° μλ€. λ³Έ μ°κ΅¬μμ κ°λ°ν μμ μ± λΉλ¨λ³ κ΄λ¦¬ μ±μ μμ€ν
κ°λ° μλͺ
μ£ΌκΈ°μ κ·Όκ±°νμ¬ λΆμ - μ€κ³ - ꡬν β νκ°μ κ³Όμ μ λ°λΌ μννμλ€.
λΆμ λ¨κ³μμλ μμ μ± λΉλ¨λ³ μ€λ¬΄ κ°μ΄λλΌμΈκ³Ό μ¬μ©μ μꡬλ μ‘°μ¬λ₯Ό ν΅ν΄ 9κ° κ΄λ¦¬μμμ μ§μκ³Ό 15κ°μ μμ€ν
κΈ°λ₯μ μΆμΆνμλ€. μΆμΆν 9κ° κ΄λ¦¬μμμμ 14κ° μΌλ°μ κ΅μ‘λ΄μ©κ³Ό 49κ° λ§μΆ€ν κΆκ³ μμ μΆμΆνμλ€. μ€κ³ λ¨κ³μμλ μ±μ κΈ°λ₯ μꡬμ¬νμ λ°νμΌλ‘ μ¬μ©μ μΈν°νμ΄μ€, μ§μ λ² μ΄μ€μ λ°μ΄ν°λ² μ΄μ€λ₯Ό μ€κ³νμλ€. κ·Έλ° λ€μ λ°μ΄ν°μ λ§μΆ€ν κΆκ³ μμ μ°κ²°νλ μκ³ λ¦¬μ¦μ κ°λ°νμλ€. ꡬν λ¨κ³μμ Eclipse κ°λ°νκ²½μμ Android Developer toolsλ₯Ό μ¬μ©νμ¬ Android 2.2λ²μ μμ 4.4λ²μ κΉμ§ μ΄μ κ°λ₯ν μ±μ Javaλ‘ κ°λ°νμλ€. νκ°λ¨κ³μμλ μ±μ μ€λ₯ νκ°μ μ¬μ©μ± νκ°, μμ© μλ νκ°λ₯Ό νμλ€. μ±μ μ€λ₯ νκ°μμλ 2λͺ
μ νκ°μκ° μλ리μ€λ₯Ό λ³΄κ³ μ μνλ κΆκ³ μκ³Ό κ° μλ리μ€μ μ μλ λ°μ΄ν°λ₯Ό μ±μ μ
λ ₯νλ©΄ μ±μ΄ μ μνλ κΆκ³ μμ λΉκ΅νμλ€. νκ° κ²°κ³Ό μ€λ₯κ° μλ κ²μΌλ‘ λνλ¬λ€. μ±μ μ¬μ©μ± νκ°μμλ μμ μ± λΉλ¨λ³ μμ°λΆ 5λͺ
μ λμμΌλ‘ μ±μ 1μ£ΌμΌκ° μ¬μ©ν λ€ System Usability Scaleμ μΈ‘μ νμλ€. μ¬μ©μ± νκ° κ²°κ³Ό νκ· 69.5μ μΌλ‘ 보ν΅μμ€μΌλ‘ λνλ¬λ€. μμ© μλ νκ°λ¨κ³μμλ μ¬μ©μ± νκ°μμ μ μλ μ견μ λ°μνμ¬ μ±μ μμ ν ν μ¬μ© μλ λ° μ¬μ© μλμ μν₯μ λ―ΈμΉλ λ΄μ¬μ λκΈ°, μ§κ°λ μ©μ΄μ±, μ§κ°λ μ μ©μ±μ νκ°νμλ€. νκ° κ²°κ³Ό 7μ λ§μ μ κ°κ° 5.5μ , 4.3μ , 4.6μ , 5.0μ μΌλ‘ λνλ¬λ€.
λ³Έ μ°κ΅¬λ μ²μμΌλ‘ μμ μ± λΉλ¨λ³ μλΆμ μ
λ ₯κ°μ λ°λ₯Έ λ§μΆ€ν κΆκ³ μμ μ 곡νλ μ€λ§νΈν° μ±μ κ°λ°ν μ°κ΅¬λ‘μ, λ³Έ μ°κ΅¬ κ³Όμ μμ μ»μ μ§μ, λ§μΆ€ν κΆκ³ μμ μμ μ± λΉλ¨λ³ κ΄λ¦¬μ λμμ΄ λ μ μμ κ²μ΄λ€.κ΅λ¬Έμ΄λ‘
β
. μλ‘
1. μ°κ΅¬μ νμμ±
2. μ°κ΅¬ λͺ©μ
3. μ©μ΄ μ μ
β
‘. λ¬Έν κ³ μ°°
1. μμ μ± λΉλ¨λ³ κ΄λ¦¬
2. λͺ¨λ°μΌ ν¬μ€μΌμ΄
3. κΈ°μ μμ©λͺ¨ν
β
’. μ°κ΅¬ λ°©λ²
1. μ§μ λ° κΈ°λ₯ μꡬμ¬ν λΆμ
2. μ€κ³
3. ꡬν
4. νκ°
5. μμ λ° λ³΄μ
6. μ±μ μμ© μλ νκ°
7. μ°κ΅¬μ μ€λ¦¬μ κ³ λ €
β
£. μ°κ΅¬ κ²°κ³Ό
1. μ§μ λ° κΈ°λ₯ μꡬμ¬ν λΆμ
2. μ€κ³
3. ꡬν
4. νκ°
5. μμ λ° λ³΄μ
6. μ±μ μμ© μλ νκ°
β
€. λ
Όμ
β
₯. κ²°λ‘ λ° μ μΈ
μ°Έκ³ λ¬Έν
λΆ λ‘
λΆλ‘ 1. μ°κ΅¬ μ€λ¦¬ μ¬μ κ²°κ³Ό ν΅λ³΄μ
λΆλ‘ 2. μ°κ΅¬ λꡬ μ¬μ© νκ°μ
λΆλ‘ 3. μꡬλ μ€λ¬Έμ§
λΆλ‘ 4. Data Dictionary
λΆλ‘ 5. μκ³ λ¦¬μ¦
λΆλ‘ 6. λ§μΆ€ν κΆκ³ μ
λΆλ‘ 7. μ± μ¬μ© μ€λͺ
μ
λΆλ‘ 8. μ± μ¬μ©μ± νκ° μ€λ¬Έμ§
λΆλ‘ 9. μμ© μλ νκ° μ€λ¬Έμ§
AbstractMaste
A Study on the Intracellular Distribution of Transaminases in the Liver of Mouse
Intracellular fractionation was carried out by ultracentrifugal
analysis in order to obtain the nuclear,
mitochondrial, and supernatant fractions in the liver
of mouse. Colorimetric analysis was performed to
estimate the distribution patterns of GOT and GPT
levels among the fractions.
Cortisone was administered intraperitoneally to
study the response of the transaminase activities
among the three intracellular fractions.
Along with study on the influence of cortisone, the
effects of dietary protein and fasting were investigated
to observe their effects of gluconeogenic activity
exerted upon the transaminase activity in the liver
of mouse, with the following conclusions.
1. The distributions of GOT and GPT are common
to all fractions, the activities in mitochondial fractions
being the highest and the nuclear the lowest.
2. GOT level is higher than GPT in every intra-cellular
fractions.
:l. Cortisone administration causes an increase in the activities of both transaminases in all intracellular
fractions, and its magnitude exerted upon GPT being
more stronger than GOT.
4. Conditions known to display gluconeogenic action,
such as high protein intake and fasting, cause increased
levels of GOT and GPT in all intracellular
fractions, the magnitude of which is far higher in
GPT than in GOT level, as observed with cortisone
administration.
5. In the conditions mentioned above, as cortisone
administration, increased level of mitochondrial GPT
is most pronounced among the three intracellular
fractions.
A brief discussion was made on the results, especially
on the significance of nuclear GOT and GPT as
well as variations of these enzymes in the intracellular
fractions resulting from gluconeoengesis
μ€νν΄μ μν λμ₯μ μΈν¬μ μ΄μ μ’ μμ μΈν¬ μ¬λ©Έ μ λ ν¨κ³Ό λ° μ₯μΌ-κ΄λ ¨ λμ₯μμ μ΅μ ν¨κ³Ό
Thesis(doctors) --μμΈλνκ΅ λνμ :μνκ³Ό(λ΄κ³Όν μ 곡),2009.2.Docto
Effect of Volume Fraction of Undissolved Cementite on the High Cycle Fatigue Properties of High Carbon Steels
Maste