9 research outputs found

    Perbandingan Akurasi Pengenalan Karakter Plat Nomor Menggunakan Tesseract Dan Data Latih Emnist

    Get PDF
    Plat nomor merupakan identitas wajib terdiri dari huruf dan angka yang ada pada kendaraan. Plat nomor dapat dimanfaatkan dalam berbagai kebutuhan seperti sistem parkir, pengawasan lalu lintas, dan pengecekan identitas ketika terjadi kecelakaan. Pengenalan karakter dapat menggunakan Optical Character Recognition (OCR) yang melakukan metode template matching pada huruf dan angka. Menggunakan Convolutional Neural Network dengan melatih data EMINST untuk melakukan pengenalan karakter. Tujuan penelitian ini sebagai perbandingan penggunaan metode OCR menggunakan Tesseract dan CNN dalam melakukan pengenalan karakter. Data yang diuji sebanyak 58 citra mobil dengan 36 kelas karakter yang terdiri dari huruf dan angka. Pengujian pengenalan karakter menggunakan CNN pada data latih EMNIST menghasilkan kinerja yang kurang baik dengan 11 citra miliki akurasi diatas 75%. Penelitian ini menghasilkan pengenalan karakter terbaik pada Tesseract-OCR menggunakan segmentasi karakter pada plat nomor dengan 44 citra memiliki akurasi diatas 75%

    Interface Development for Digitization of Documents Using OCR

    Get PDF
    The purpose of this thesis is to develop a semi-automated interface that uses Optical Character Recognition (OCR) routines to identify text-based information from a large volume of digitized drawings associated with the oil and gas industry. The identified information is presented in an appropriate interface for any necessary manual modification, with the target of improving the efficiency of maintaining large amounts of older documents. The thesis outlines the design of the interface and the implementation of Tesseract OCR engine, in combination with tailor-made functions and classes that leverage OpenCV to enhance the recognition processThe purpose of this thesis is to develop a semi-automated interface that uses Optical Character Recognition (OCR) routines to identify text-based information from a large volume of digitized drawings associated with the oil and gas industry. The identified information is presented in an appropriate interface for any necessary manual modification, with the target of improving the efficiency of maintaining large amounts of older documents. The thesis outlines the design of the interface and the implementation of Tesseract OCR engine, in combination with tailor-made functions and classes that leverage OpenCV to enhance the recognition proces

    Interface Development for Digitization of Documents Using OCR

    Get PDF
    The purpose of this thesis is to develop a semi-automated interface that uses Optical Character Recognition (OCR) routines to identify text-based information from a large volume of digitized drawings associated with the oil and gas industry. The identified information is presented in an appropriate interface for any necessary manual modifica- tion, with the target of improving the efficiency of maintaining large amounts of older documents. The thesis outlines the design of the interface and the implementation of Tesseract OCR engine, in combination with tailor-made functions and classes that lever- age OpenCV to enhance the recognition process.The purpose of this thesis is to develop a semi-automated interface that uses Optical Character Recognition (OCR) routines to identify text-based information from a large volume of digitized drawings associated with the oil and gas industry. The identified information is presented in an appropriate interface for any necessary manual modifica- tion, with the target of improving the efficiency of maintaining large amounts of older documents. The thesis outlines the design of the interface and the implementation of Tesseract OCR engine, in combination with tailor-made functions and classes that lever- age OpenCV to enhance the recognition process

    ์ผ๋ฐ˜์ ์ธ ๋ฌธ์ž ์ด๋ฏธ์ง€์˜ ์–ธ์–ด๋ถ„๋ฅ˜

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (๋ฐ•์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ์ž์—ฐ๊ณผํ•™๋Œ€ํ•™ ํ˜‘๋™๊ณผ์ • ๊ณ„์‚ฐ๊ณผํ•™์ „๊ณต, 2021. 2. ๊ฐ•๋ช…์ฃผ.As other machine learning fields, there has been a lot of progress in text detection and recognition to obtain text information contained in images since the deep learning era. When multiple languages are mixed in the im- age, the process of recognition typically goes through a detection, language classification and recognition. This dissertation aims to classify languages of image patches which are the results of text detection. As far as we know, there are no prior research exactly targeting language classification of images. So we started from basic backbone networks that are used commonly in many other general object detection fields. With a ResNeSt-based network which is based on Resnet and automated pre-processing of ground-truth data to improve classification performance, we can achieve state of the art record of this task with a public benchmark dataset.๋‹ค๋ฅธ ๊ธฐ๊ณ„ํ•™์Šต๋ถ„์•ผ์™€ ๋งˆ์ฐฌ๊ฐ€์ง€๋กœ, ์ด๋ฏธ์ง€๊ฐ€ ๋‹ด๊ณ  ์žˆ๋Š” ๋ฌธ์ž์ •๋ณด๋ฅผ ์–ป์–ด ๋‚ด๋ ค๋Š” ๋ฌธ์ž์ธ์‹ ๋ถ„์•ผ์—์„œ๋„ ๋”ฅ๋Ÿฌ๋‹ ์ดํ›„ ๋งŽ์€ ์ง„์ „์ด ์žˆ์—ˆ๋‹ค. ์ธ์‹์˜ ๊ณผ์ •์€ ํ†ต์ƒ์ ์œผ๋กœ ๋ฌธ์ž๊ฒ€์ถœ, ๋ฌธ์ž์ธ์‹์˜ ๊ณผ์ •์„ ์ฐจ๋ก€๋กœ ๊ฑฐ์น˜๋Š”๋ฐ, ๋‹ค์ˆ˜์˜ ์–ธ์–ด๊ฐ€ ํ˜ผ์žฌํ•  ๊ฒฝ์šฐ ๊ฒ€์ถœ๊ณผ ์ธ์‹ ์‚ฌ์ด์— ์–ธ์–ด๋ถ„๋ฅ˜ ๋‹จ๊ณ„๋ฅผ ํ•œ๋ฒˆ ๋” ๊ฑฐ์น˜๋Š” ๊ฒƒ์ด ๋ณดํ†ต ์ด๋‹ค. ๋ณธ์—ฐ๊ตฌ๋Š”๋ฌธ์ž๊ฒ€์ถœ์ดํ›„์˜๋‹จ๊ณ„์—์„œ์ด๋ฏธ์ง€ํŒจ์น˜๋“ค์„๊ฐ์–ธ์–ด์—๋”ฐ๋ผ ๋ถ„๋ฅ˜ํ•˜๋Š” ๊ฒƒ์„ ๋ชฉํ‘œ๋กœ ํ•œ๋‹ค. ๋ถ„๋ฅ˜์ž‘์—…๋งŒ์„ ์ „๋ฌธ์ ์œผ๋กœ ๋‹ค๋ฃฌ ์„ ํ–‰์—ฐ๊ตฌ๊ฐ€ ์—†์œผ ๋ฏ€๋กœ, ์ผ๋ฐ˜์ ์ธ ๊ฐ์ฒด๊ฒ€์ถœ์—์„œ ์“ฐ์ด๋Š” ๋„คํŠธ์›Œํฌ ์ค‘์—์„œ ์ ์ ˆํ•œ ๊ฒƒ์„ ์„ ํƒํ•˜๊ณ  ์‘์šฉํ•˜์˜€๋‹ค. ResNeSt๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœํ•œ ๋„คํŠธ์›Œํฌ์™€ ์ž๋™ํ™”๋œ ์ „์ฒ˜๋ฆฌ ๊ณผ์ •์„ ํ†ตํ•ด ๊ณต๊ฐœ๋œ ๋ฒค์น˜๋งˆํฌ ๋ฐ์ดํ„ฐ์…‹์„ ๊ธฐ์ค€์œผ๋กœ ๊ฐ€์žฅ ์ข‹์€ ๊ธฐ๋ก์„ ๋‹ฌ์„ฑํ•  ์ˆ˜ ์žˆ์—ˆ๋‹ค.Abstract i 1 Introduction 1 1.1 OpticalCharacterRecognition.................. 1 1.2 DeepLearning........................... 2 2 Backgrounds 4 2.1 Detection ............................. 4 2.2 Recognition ............................ 5 2.3 LanguageClassification...................... 6 2.4 Multi-lingualText(MLT)..................... 7 2.5 ConvolutionalNeuralNetwork(CNN) . . . . . . . . . . . . . . 7 2.6 AttentionMechanism....................... 8 2.7 RelatedWorks........................... 9 2.7.1 Detectors ......................... 9 2.7.2 Recognizers ........................ 14 2.7.3 End-to-end methods (detector + recognizer) . . . . . . 14 2.8 Dataset .............................. 15 2.8.1 ICDARMLT ....................... 15 2.8.2 Syntheticdata:Gupta.................. 17 2.8.3 COCO-Text........................ 17 3 Proposed Methods 18 3.1 BaseNetworkSelection...................... 18 3.1.1 Googlenet ......................... 18 3.1.2 ShufflenetV2 ....................... 20 3.1.3 Resnet........................... 21 3.1.4 WideResnet........................ 23 3.1.5 ResNeXt.......................... 24 3.1.6 ResNeSt(Split-Attention network) ............ 24 3.1.7 Densenet.......................... 25 3.1.8 EfficientNet ........................ 25 3.1.9 Automaticsearch:AutoSTR .............. 27 3.2 Methods.............................. 28 3.2.1 Groundtruthcleansing.................. 28 3.2.2 Divide-and-stack ..................... 32 3.2.3 Usingadditionaldata................... 33 3.2.4 OHEM........................... 34 3.2.5 Network using the number of characters . . . . . . . . 35 3.2.6 UseofR-CNNstructure ................. 36 3.2.7 Highresolutioninput................... 39 3.2.8 Handling outliers using variant of OHEM . . . . . . . . 39 3.2.9 Variable sized input images using the attention . . . . 41 3.2.10 Classbalancing ...................... 41 3.2.11 Finetuningonspecificclasses.............. 42 3.2.12 Optimizerselection.................... 42 3.3 Result ............................... 42 4 Conclusion 44 Abstract (in Korean) 49Docto

    Development of Automatic Digitization of Truck Number in Open Cast Mines Using Microcontroller

    Get PDF
    Geological condition in mines appears to be extremely complicated and there are many intelligence security problems. Production is falsely transfer by the unauthorized truck from mine pits also at loading point. It also lifted in wrong ways by malfunctioning of the truck weight in Weigh Bridge. Mining organizations are under the control of mafia and countless can be added to the mines mafia. An intelligence security system is need to monitor truck number in automatically using image acquisition method, automatic detection, recognition process, communication technology, information technology and microcontroller innovation to understand the working specification of the mining region. Tracking of the number plate from the truck is an important task, which demands intelligent solution. Intelligent surveillance in open casts mine security network using data accession is a prime task that protects the secure production of mines. So automatic truck number recognition technique is used to recognize the registration number of the truck which is used for transferring the mine production as well as track record the amount of the production. It also preserves the mines and thus improving its security. For extraction and recognition of number plate from truck image the system is uses MATLAB software tool. It is assumed that images of the truck have been captured from digital camera. The data acquisition terminal uses the PIC16F877A microcontroller as a core chip for sending data. The data are communicated through USB to TTL converter (RS232) with the main circuit to realize intelligent monitoring. To store the data in permanently it is uses EEPROM chip. Alphanumeric Characters on plate has been extracted and recognized using template images of alphanumeric characters. The proposed system performs the real time data monitoring to recognize the registration number plate of the trucks for getting required important information. It also provides to maintenance the history of data and support access contro
    corecore