5 research outputs found

    Design of an Offline Handwriting Recognition System Tested on the Bangla and Korean Scripts

    Get PDF
    This dissertation presents a flexible and robust offline handwriting recognition system which is tested on the Bangla and Korean scripts. Offline handwriting recognition is one of the most challenging and yet to be solved problems in machine learning. While a few popular scripts (like Latin) have received a lot of attention, many other widely used scripts (like Bangla) have seen very little progress. Features such as connectedness and vowels structured as diacritics make it a challenging script to recognize. A simple and robust design for offline recognition is presented which not only works reliably, but also can be used for almost any alphabetic writing system. The framework has been rigorously tested for Bangla and demonstrated how it can be transformed to apply to other scripts through experiments on the Korean script whose two-dimensional arrangement of characters makes it a challenge to recognize. The base of this design is a character spotting network which detects the location of different script elements (such as characters, diacritics) from an unsegmented word image. A transcript is formed from the detected classes based on their corresponding location information. This is the first reported lexicon-free offline recognition system for Bangla and achieves a Character Recognition Accuracy (CRA) of 94.8%. This is also one of the most flexible architectures ever presented. Recognition of Korean was achieved with a 91.2% CRA. Also, a powerful technique of autonomous tagging was developed which can drastically reduce the effort of preparing a dataset for any script. The combination of the character spotting method and the autonomous tagging brings the entire offline recognition problem very close to a singular solution. Additionally, a database named the Boise State Bangla Handwriting Dataset was developed. This is one of the richest offline datasets currently available for Bangla and this has been made publicly accessible to accelerate the research progress. Many other tools were developed and experiments were conducted to more rigorously validate this framework by evaluating the method against external datasets (CMATERdb 1.1.1, Indic Word Dataset and REID2019: Early Indian Printed Documents). Offline handwriting recognition is an extremely promising technology and the outcome of this research moves the field significantly ahead

    Segmentation-Free Korean Handwriting Recognition Using Neural Network Training

    Get PDF
    The idea of segmentation-free handwriting recognition has been introduced within the rise of deep learning. This technique is designed to recognize any script language/symbols as long as feedable training image set exists. The VGG-16 convolutional neural network model is used as a character spotting network using Faster R-CNN. Through the process of manual tagging, the location, size, and types of recognizable symbols are provided to train the network. This approach has been tested previously on text written in the Bangla script, where it has shown over 90% of accuracy overall. For Bangla, the network is trained and tested on Boise State Bangla Handwriting dataset. For Korean, the network is trained using the PE_92 Handwritten Korean character image database and shows promising results

    ๊ธฐ๊ธฐ ์ƒ์—์„œ์˜ ์‹ฌ์ธต ์‹ ๊ฒฝ๋ง ๊ฐœ์ธํ™” ๋ฐฉ๋ฒ•

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (์„์‚ฌ)-- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ์ปดํ“จํ„ฐ๊ณตํ•™๋ถ€, 2019. 2. Egger, Bernhard.There exist several deep neural network (DNN) architectures suitable for embedded inference, however little work has focused on training neural networks on-device. User customization of DNNs is desirable due to the difficulty of collecting a training set representative of real world scenarios. Additionally, inter-user variation means that a general model has a limitation on its achievable accuracy. In this thesis, a DNN architecture that allows for low power on-device user customization is proposed. This approach is applied to handwritten character recognition of both the Latin and the Korean alphabets. Experiments show a 3.5-fold reduction of the prediction error after user customization for both alphabets compared to a DNN trained with general data. This architecture is additionally evaluated using a number of embedded processors demonstrating its practical application.๋‚ด์žฅํ˜• ๊ธฐ๊ธฐ์—์„œ ์‹ฌ์ธต ์‹ ๊ฒฝ๋ง์„ ์ถ”๋ก ํ•  ์ˆ˜ ์žˆ๋Š” ์•„ํ‚คํ…์ฒ˜๋“ค์€ ์กด์žฌํ•˜์ง€๋งŒ ๋‚ด์žฅํ˜• ๊ธฐ๊ธฐ์—์„œ ์‹ ๊ฒฝ๋ง์„ ํ•™์Šตํ•˜๋Š” ์—ฐ๊ตฌ๋Š” ๋ณ„๋กœ ์ด๋ค„์ง€์ง€ ์•Š์•˜๋‹ค. ์‹ค์ œ ํ™˜๊ฒฝ์„ ๋ฐ˜์˜ํ•˜๋Š” ํ•™์Šต์šฉ ๋ฐ์ดํ„ฐ ์ง‘ํ•ฉ์„ ๋ชจ์œผ๋Š” ๊ฒƒ์ด ์–ด๋ ต๊ณ  ์‚ฌ์šฉ์ž๊ฐ„์˜ ๋‹ค์–‘์„ฑ์œผ๋กœ ์ธํ•ด ์ผ๋ฐ˜์ ์œผ๋กœ ํ•™์Šต๋œ ๋ชจ๋ธ์ด ์ถฉ๋ถ„ํ•œ ์ •ํ™•๋„๋ฅผ ๊ฐ€์ง€๊ธฐ์—” ํ•œ๊ณ„๊ฐ€ ์กด์žฌํ•˜๊ธฐ ๋•Œ๋ฌธ์— ์‚ฌ์šฉ์ž ๋งž์ถคํ˜• ์‹ฌ์ธต ์‹ ๊ฒฝ๋ง์ด ํ•„์š”ํ•˜๋‹ค. ์ด ๋…ผ๋ฌธ์—์„œ๋Š” ๊ธฐ๊ธฐ์ƒ์—์„œ ์ €์ „๋ ฅ์œผ๋กœ ์‚ฌ์šฉ์ž ๋งž์ถคํ™”๊ฐ€ ๊ฐ€๋Šฅํ•œ ์‹ฌ์ธต ์‹ ๊ฒฝ๋ง ์•„ํ‚คํ…์ฒ˜๋ฅผ ์ œ์•ˆํ•œ๋‹ค. ์ด๋Ÿฌํ•œ ์ ‘๊ทผ ๋ฐฉ๋ฒ•์€ ๋ผํ‹ด์–ด์™€ ํ•œ๊ธ€์˜ ํ•„๊ธฐ์ฒด ๊ธ€์ž ์ธ์‹์— ์ ์šฉ๋œ๋‹ค. ๋ผํ‹ด์–ด์™€ ํ•œ๊ธ€์— ์‚ฌ์šฉ์ž ๋งž์ถคํ™”๋ฅผ ์ ์šฉํ•˜์—ฌ ์ผ๋ฐ˜์ ์ธ ๋ฐ์ดํ„ฐ๋กœ ํ•™์Šตํ•œ ์‹ฌ์ธต ์‹ ๊ฒฝ๋ง๋ณด๋‹ค 3.5๋ฐฐ๋‚˜ ์ž‘์€ ์˜ˆ์ธก ์˜ค๋ฅ˜์˜ ๊ฒฐ๊ณผ๋ฅผ ์–ป์—ˆ๋‹ค. ๋˜ํ•œ ์ด ์•„ํ‚คํ…์ฒ˜์˜ ์‹ค์šฉ์„ฑ์„ ๋ณด์—ฌ์ฃผ๊ธฐ ์œ„ํ•˜์—ฌ ๋‹ค์–‘ํ•œ ๋‚ด์žฅํ˜• ํ”„๋กœ์„ธ์„œ์—์„œ ์‹คํ—˜์„ ์ง„ํ–‰ํ•˜์˜€๋‹ค.Abstract i Contents iii List of Figures vii List of Tables ix Chapter 1 Introduction 1 Chapter 2 Motivation 4 Chapter 3 Background 6 3.1 Deep Neural Networks 6 3.1.1 Inference 6 3.1.2 Training 7 3.2 Convolutional Neural Networks 8 3.3 On-Device Acceleration 9 3.3.1 Hardware Accelerators 9 3.3.2 Software Optimization 10 Chapter 4 Methodology 12 4.1 Initialization 13 4.2 On-Device Training 14 Chapter 5 Implementation 16 5.1 Pre-processing 16 5.2 Latin Handwritten Character Recognition 17 5.2.1 Dataset and BIE Selection 17 5.2.2 AE Design 17 5.3 Korean Handwritten Character Recognition 21 5.3.1 Dataset and BIE Selection 21 5.3.2 AE Design 21 Chapter 6 On-Device Acceleration 26 6.1 Architecure Optimizations 27 6.2 Compiler Optimizations 29 Chapter 7 Experimental Setup 30 Chapter 8 Evaluation 33 8.1 Latin Handwritten Character Recognition 33 8.2 Korean Handwritten Character Recognition 38 8.3 On-Device Acceleration 40 Chapter 9 Related Work 44 Chapter 10 Conclusion 47 Bibliography 47 ์š”์•ฝ 55 Acknowledgements 56Maste
    corecore