2,906 research outputs found
Handwritten Arabic character recognition: which feature extraction method?
Recognition of Arabic handwriting characters is a difficult task due to similar appearance of some different characters. However, the selection of the method for feature extraction remains the most important step for achieving high recognition accuracy. The purpose of this paper is to compare the effectiveness of Discrete Cosine Transform and Discrete Wavelet transform to capture discriminative features of Arabic handwritten characters. A new database containing 5600 characters covering all shapes of Arabic handwriting characters has also developed for the purpose of the analysis. The coefficients of both techniques have been used for classification based on a Artificial Neural Network implementation. The results have been analysed and the finding have demonstrated that a Discrete Cosine Transform based feature extraction yields a superior recognition than its counterpart
Comprehensive collection for Arabic characters and numbers written by hand
An Optical Character Recognition system for Arabic language should recognize Arabic handwritten words. However, it is difficult to find a freely accessible and comprehensive database of all Arabic words that can be employed for this purpose. Therefore, it is more efficient to divide the Arabic words into sub-words or characters. As there is no comprehensive Arabic handwritten character database that is accessible free of charge, interested researchers can utilize the database developed as a part of this work in recognition system training and output testing.In the present paper, a database is presented containing scanned images of 700 Arabic handwritten characters, Hindi numbers used in Arabic countries, and some special characters utilized in Arabic alphabet, along with their different positions (e.g., standalone, initial, medial and terminal), different sizes, styles and font colors. The aim is to provide sufficient samples for all character shapes for software training, resulting in greater accuracy in the recognition phase.These forms were filled by students of the Applied Sciences College, University of Technology, Baghdad, Iraq and were scanned at the 200, 300, and 600 dpi resolution. A graphical user interface (GUI) software environment is employed to make the manipulation of the created database easier, and provide many image processing functions that are allowed to be built the database easie
Recognition of isolated handwritten Arabic characters
The challenges that face the handwritten Arabic recognition are overwhelming such as different varieties of handwriting and
few public databases available. Also, teaching the non-Arabic speaker at the young age is very difficult due to the
unfamiliarity of the words and meanings. So, this project is focused on building a model of a deep learning architecture with
convolutional neural network (CNN) and multilayer perceptron (MLP) neural network by using python programming
language. This project analyzes the performance of a public database which is Arabic Handwritten Characters Dataset
(AHCD). However, training this database with CNN model has achieved a test accuracy of 95.27% while training it with MLP
model achieved 72.08%. Therefore, the CNN model is suitable to be used in the application device
Huruf: An Application for Arabic Handwritten Character Recognition Using Deep Learning
Handwriting Recognition has been a field of great interest in the Artificial
Intelligence domain. Due to its broad use cases in real life, research has been
conducted widely on it. Prominent work has been done in this field focusing
mainly on Latin characters. However, the domain of Arabic handwritten character
recognition is still relatively unexplored. The inherent cursive nature of the
Arabic characters and variations in writing styles across individuals makes the
task even more challenging. We identified some probable reasons behind this and
proposed a lightweight Convolutional Neural Network-based architecture for
recognizing Arabic characters and digits. The proposed pipeline consists of a
total of 18 layers containing four layers each for convolution, pooling, batch
normalization, dropout, and finally one Global average pooling and a Dense
layer. Furthermore, we thoroughly investigated the different choices of
hyperparameters such as the choice of the optimizer, kernel initializer,
activation function, etc. Evaluating the proposed architecture on the publicly
available 'Arabic Handwritten Character Dataset (AHCD)' and 'Modified Arabic
handwritten digits Database (MadBase)' datasets, the proposed model
respectively achieved an accuracy of 96.93% and 99.35% which is comparable to
the state-of-the-art and makes it a suitable solution for real-life end-level
applications.Comment: Accepted in 25th ICCIT (6 pages, 4 tables, 4 figures
Spatial and Textural Aspects for Arabic Handwritten Characters Recognition
The purpose of the present paper is the recognition of handwritten Arabic characters in their isolated form. The specificity of Arabic characters is taken into consideration, each of the proposed feature extraction method integrates one of the two aspects: spatial and textural. In the first step, a modified Bitmap Sampling method is proposed, which converts the character’s images into a binary Matrix and then constructs a Mask for each class. A matching rate is used between the input binary matrix and the masks to determinate the corresponding class. In the second step we investigate the use of an Artificial Neural Network as classifier with the binary matrices as features and then the histograms of Local Binary Patterns to capture the texture aspect of the characters. Finally, the results of these two methods are combined to take into consideration both aspects at the same time. Tested on the Arabic set of the Isolated Farsi Handwritten Character Database, the proposed method has 2.82% error rate
Component-based Segmentation of words from handwritten Arabic text
Efficient preprocessing is very essential for automatic recognition of handwritten documents. In this paper, techniques on segmenting words in handwritten Arabic text are presented. Firstly, connected components (ccs) are extracted, and distances among different components are analyzed. The statistical distribution of this distance is then obtained to determine an optimal threshold for words segmentation. Meanwhile, an improved projection based method is also employed for baseline detection. The proposed method has been successfully tested on IFN/ENIT database consisting of 26459 Arabic words handwritten by 411 different writers, and the results were promising and very encouraging in more accurate detection of the baseline and segmentation of words for further recognition
- …