2,906 research outputs found

    Handwritten Arabic character recognition: which feature extraction method?

    Get PDF
    Recognition of Arabic handwriting characters is a difficult task due to similar appearance of some different characters. However, the selection of the method for feature extraction remains the most important step for achieving high recognition accuracy. The purpose of this paper is to compare the effectiveness of Discrete Cosine Transform and Discrete Wavelet transform to capture discriminative features of Arabic handwritten characters. A new database containing 5600 characters covering all shapes of Arabic handwriting characters has also developed for the purpose of the analysis. The coefficients of both techniques have been used for classification based on a Artificial Neural Network implementation. The results have been analysed and the finding have demonstrated that a Discrete Cosine Transform based feature extraction yields a superior recognition than its counterpart

    Comprehensive collection for Arabic characters and numbers written by hand

    Get PDF
    An Optical Character Recognition system for Arabic language should recognize Arabic handwritten words. However, it is difficult to find a freely accessible and comprehensive database of all Arabic words that can be employed for this purpose. Therefore, it is more efficient to divide the Arabic words into sub-words or characters. As there is no comprehensive Arabic handwritten character database that is accessible free of charge, interested researchers can utilize the database developed as a part of this work in recognition system training and output testing.In the present paper, a database is presented containing scanned images of 700 Arabic handwritten characters, Hindi numbers used in Arabic countries, and some special characters utilized in Arabic alphabet, along with their different positions (e.g., standalone, initial, medial and terminal), different sizes, styles and font colors. The aim is to provide sufficient samples for all character shapes for software training, resulting in greater accuracy in the recognition phase.These forms were filled by students of the Applied Sciences College, University of Technology, Baghdad, Iraq and were scanned at the 200, 300, and 600 dpi resolution. A graphical user interface (GUI) software environment is employed to make the manipulation of the created database easier, and provide many image processing functions that are allowed to be built the database easie

    Recognition of isolated handwritten Arabic characters

    Get PDF
    The challenges that face the handwritten Arabic recognition are overwhelming such as different varieties of handwriting and few public databases available. Also, teaching the non-Arabic speaker at the young age is very difficult due to the unfamiliarity of the words and meanings. So, this project is focused on building a model of a deep learning architecture with convolutional neural network (CNN) and multilayer perceptron (MLP) neural network by using python programming language. This project analyzes the performance of a public database which is Arabic Handwritten Characters Dataset (AHCD). However, training this database with CNN model has achieved a test accuracy of 95.27% while training it with MLP model achieved 72.08%. Therefore, the CNN model is suitable to be used in the application device

    Huruf: An Application for Arabic Handwritten Character Recognition Using Deep Learning

    Full text link
    Handwriting Recognition has been a field of great interest in the Artificial Intelligence domain. Due to its broad use cases in real life, research has been conducted widely on it. Prominent work has been done in this field focusing mainly on Latin characters. However, the domain of Arabic handwritten character recognition is still relatively unexplored. The inherent cursive nature of the Arabic characters and variations in writing styles across individuals makes the task even more challenging. We identified some probable reasons behind this and proposed a lightweight Convolutional Neural Network-based architecture for recognizing Arabic characters and digits. The proposed pipeline consists of a total of 18 layers containing four layers each for convolution, pooling, batch normalization, dropout, and finally one Global average pooling and a Dense layer. Furthermore, we thoroughly investigated the different choices of hyperparameters such as the choice of the optimizer, kernel initializer, activation function, etc. Evaluating the proposed architecture on the publicly available 'Arabic Handwritten Character Dataset (AHCD)' and 'Modified Arabic handwritten digits Database (MadBase)' datasets, the proposed model respectively achieved an accuracy of 96.93% and 99.35% which is comparable to the state-of-the-art and makes it a suitable solution for real-life end-level applications.Comment: Accepted in 25th ICCIT (6 pages, 4 tables, 4 figures

    Spatial and Textural Aspects for Arabic Handwritten Characters Recognition

    Get PDF
    The purpose of the present paper is the recognition of handwritten Arabic characters in their isolated form. The specificity of Arabic characters is taken into consideration, each of the proposed feature extraction method integrates one of the two aspects: spatial and textural. In the first step, a modified Bitmap Sampling method is proposed, which converts the character’s images into a binary Matrix and then constructs a Mask for each class. A matching rate is used between the input binary matrix and the masks to determinate the corresponding class. In the second step we investigate the use of an Artificial Neural Network as classifier with the binary matrices as features and then the histograms of Local Binary Patterns to capture the texture aspect of the characters. Finally, the results of these two methods are combined to take into consideration both aspects at the same time. Tested on the Arabic set of the Isolated Farsi Handwritten Character Database, the proposed method has 2.82% error rate

    Component-based Segmentation of words from handwritten Arabic text

    Get PDF
    Efficient preprocessing is very essential for automatic recognition of handwritten documents. In this paper, techniques on segmenting words in handwritten Arabic text are presented. Firstly, connected components (ccs) are extracted, and distances among different components are analyzed. The statistical distribution of this distance is then obtained to determine an optimal threshold for words segmentation. Meanwhile, an improved projection based method is also employed for baseline detection. The proposed method has been successfully tested on IFN/ENIT database consisting of 26459 Arabic words handwritten by 411 different writers, and the results were promising and very encouraging in more accurate detection of the baseline and segmentation of words for further recognition
    • …
    corecore