7,867 research outputs found

    AUTOMATIC ASSESSMENT MARK ENTRY SYSTEM USING LOCAL BINARY PATTERN (LBP)

    Get PDF
    Offline handwritten recognition continues to be a fundamental research problem in document analysis and retrieval. The common method used in extracting handwritten mark from assessment forms is to assign a person to manually type in the marks into a spreadsheet. This method is found to be very time consuming, not cost effective and prone to human mistakes. In this project, a number recognition system is developed using local binary pattern (LBP) technique to extract and convert students’ identity numbers and handwritten marks on assessment forms into a spreadsheet. The template of the score sheet is designed as in Appendix 1 to collect sample of handwritten numbers. The training data contain three sets of LBP histograms for each digit. The recognition rate of handwritten digits using LBP is about 50% because LBP could not fully describe the structure of the digits. Instead, LBP is useful in term of arranging the digits ‘0 to 9’ from highest similarity score to the lowest similarity score as compared to sample using chi square distance. The recognition rate is greatly improved to about 95% by verifying the output of chi square distance with the salient structural features of digits

    Off-line Thai handwriting recognition in legal amount

    Get PDF
    Thai handwriting in legal amounts is a challenging problem and a new field in the area of handwriting recognition research. The focus of this thesis is to implement Thai handwriting recognition system. A preliminary data set of Thai handwriting in legal amounts is designed. The samples in the data set are characters and words of the Thai legal amounts and a set of legal amounts phrases collected from a number of native Thai volunteers. At the preprocessing and recognition process, techniques are introduced to improve the characters recognition rates. The characters are divided into two smaller subgroups by their writing levels named body and high groups. The recognition rates of both groups are increased based on their distinguished features. The writing level separation algorithms are implemented using the size and position of characters. Empirical experiments are set to test the best combination of the feature to increase the recognition rates. Traditional recognition systems are modified to give the accumulative top-3 ranked answers to cover the possible character classes. At the postprocessing process level, the lexicon matching algorithms are implemented to match the ranked characters with the legal amount words. These matched words are joined together to form possible choices of amounts. These amounts will have their syntax checked in the last stage. Several syntax violations are caused by consequence faulty character segmentation and recognition resulting from connecting or broken characters. The anomaly in handwriting caused by these characters are mainly detected by their size and shape. During the recovery process, the possible word boundary patterns can be pre-defined and used to segment the hypothesis words. These words are identified by the word recognition and the results are joined with previously matched words to form the full amounts and checked by the syntax rules again. From 154 amounts written by 10 writers, the rejection rate is 14.9 percent with the recovery processes. The recognition rate for the accepted amount is 100 percent

    Hand Written Odia Character Recognition

    Get PDF
    The world is fast moving towards digitalization. In the age of super-fast computational capabilities, everything has to be made digitalized so as to make the computer understand and thereby process the given information. Optical character recognition is a method by which the computer is made to learn, understand and interpret the languages used and written by the human beings. It provides us a whole new way by which computer can interact with human beings, in their own languages. Hence OCR has been a topic of interest for researchers all around the globe in the past decade and research paper involving OCR is increasing day by day. It is seen that efficient algorithms have increased the speed and accuracy of character recognition. A substantial amount of work has been done on foreign languages such as English , Chinese etc. but very few paper are there for Indian languages baring a few for Hindi and Bengali. Hence our research work was directed towards development of a novel algorithm for Odia character recognition. Odia is one of the eighteen languages recognized by the Indian constituency. It is also one of the oldest languages and is spoken by more than 44 million people in the state of Odisha. Recognition of this particular language is difficult because of a number of similar looking characters and the presence of complex characters. A novel technique is proposed and implemented for the feature extraction method where by a set of 81 feature vectors are extracted to uniquely identify a particular character. The recognition is based on finding the minimum error by implementing the Euclidean distance method. After the implementation of the above technique, accuracy was found to be about 70 % which is much better than many techniques earlier available

    Character recognition and information retrieval

    Full text link
    Presented are two technologies, character recognition and information retrieval, that are used for text processing. Character recognition translates text image data to a computer-coded format; information retrieval stores these data and provides efficient access to the text. The necessity of their eventual coupling is obvious. Their sequential application though (with no manual intervention) has been considered impractical at best. Our experimentation exploits these two technologies in just this way. We identify problems with their combined use, as well as show that the technologies have come to a point where they can be applied in succession

    Deep Learning-based Recognition of Devanagari Handwritten Characters

    Get PDF
    Numerous techniques have been used over many years to study handwriting recognition. There are two methods for reading handwriting, one of which is online and the other offline. Image recognition is the main part of the handwriting recognition process. Image recognition gives careful consideration to the picture's dimensions, viewing angle, and image quality. Machine learning and deep learning techniques are the two areas of focus for developers looking to increase the intelligence of computers. A person may learn to perform a task by repeatedly exercising it until they recall how to do it. His brain's neurons begin to work automatically, enabling him to carry out the task he has quickly learned. This and deep learning are fairly similar. It uses a variety of neural network designs to address a range of problems. The convolution neural network (CNN) is a very effective technique for handwriting and picture detection
    corecore