Search CORE

3 research outputs found

Printed Thai Character Recognition using Shape Classification in Video Sequence along a Line

Author: Chaiwatanaphan Sirikwan
Pluempitiwiriyawej Charnchai
Wangsiripitak Somkiat
Publication venue: 'Faculty of Engineering, Chulalongkorn University'
Publication date: 31/10/2017
Field of study

This paper presents a novel method for recognition of 68 printed Thai characters in image sequences captured along a line of characters, based on their shape appearance such as the height and width, the top, bottom, and right edges, the numbers and positions of the circles (head of Thai characters) and the end points.  Since each character appears in more than one frame of the image sequence that moves along the line, an algorithm to identify the arrangement of the characters in each line is necessary for accurate recognition results.  We tested our system on image sequences with four different Thai fonts. The recognition rate is about 85.64% correct

Engineering Journal (Faculty of Engineering, Chulalongkorn University, Bangkok)

Automated Data Digitization System for Vehicle Registration Certificates Using Google Cloud Vision API

Author: Intakosum Sarun
Kongkla Prateep
Sirisathitkul Yaowarat
Thammarak Karanrat
Publication venue: 'Ital Publication'
Publication date: 01/07/2022
Field of study

This study aims to develop an automated data digitization system for the Thai vehicle registration certificate. It is the first system developed as a web service Application Programming Interface (API), which is essential for any enterprise to increase its business value. Currently, this system is available on “www.carjaidee.com”. The system involves four steps: 1) an embedded frame aligns a document to be correctly recognised in the image acquisition step; 2) sharpening and brightness filtering techniques to enhance image quality are applied in the pre-processing step; 3) the Google Cloud Vision API receives a prompt to proceed in the recognition step; 4) a specific domain dictionary to improve accuracy rate is developed for the post-processing step. This study defines 92 images for the experiment by counting the correct words and terms from the output. The findings suggest that the proposed method, which had an average accuracy of 93.28%, was significantly more accurate than the original method using only the Google Cloud Vision API. However, the system is limited because the dictionaries cannot automatically recognise a new word. In the future, we will explore solutions to this problem using natural language processing techniques. Doi: 10.28991/CEJ-2022-08-07-09 Full Text: PD

Civil Engineering Journal (C.E.J)

Comparative analysis of Tesseract and Google Cloud Vision for Thai vehicle registration certificate

Author: Intakosum Sarun
Kongkla Prateep
Sirisathitkul Yaowarat
Thammarak Karanrat
Publication venue: 'Institute of Advanced Engineering and Science'
Publication date: 01/04/2022
Field of study

Optical character recognition (OCR) is a technology to digitize a paper-based document to digital form. This research studies the extraction of the characters from a Thai vehicle registration certificate via a Google Cloud Vision API and a Tesseract OCR. The recognition performance of both OCR APIs is also examined. The 84 color image files comprised three image sizes/resolutions and five image characteristics. For suitable image type comparison, the greyscale and binary image are converted from color images. Furthermore, the three pre-processing techniques, sharpening, contrast adjustment, and brightness adjustment, are also applied to enhance the quality of image before applying the two OCR APIs. The recognition performance was evaluated in terms of accuracy and readability. The results showed that the Google Cloud Vision API works well for the Thai vehicle registration certificate with an accuracy of 84.43%, whereas the Tesseract OCR showed an accuracy of 47.02%. The highest accuracy came from the color image with 1024×768 px, 300dpi, and using sharpening and brightness adjustment as pre-processing techniques. In terms of readability, the Google Cloud Vision API has more readability than the Tesseract. The proposed conditions facilitate the possibility of the implementation for Thai vehicle registration certificate recognition system

ZENODO

Institute of Advanced Engineering and Science