49 research outputs found

    Handwritten Arabic Digit Recognition Using Convolutional Neural Network

    Get PDF
    In Computer vision systems, computer vision works by imitating humans in their vision way which is known as the human vision system (HVS). In HVS, humans use their eyes and brains in order to see and classify any object around them. Hence, computer vision systems imitate HSV by developing several algorithms for classifying images and objects. The main goal of this paper is to propose a model for identifying and classifying the Arabic handwritten digits with high accuracy.  The concept of deep learning via the convolutional neural network (CNN) with the ADBase database is used to achieve the goal. The training is done by having a 3*3 and 5*5 filters. Basically, while the classification phase distinct learning rates are used to train the network. The obtained results are encouraging and promising

    NEW APPROACH FOR ONLINE ARABIC MANUSCRIPT RECOGNITION BY DEEP BELIEF NETWORK

    Get PDF
    In this paper, we present a neural approach for an unconstrained Arabic manuscript recognition using the online writing signal rather than images. First, we build the database which contains 2800 characters and 4800 words collected from 20 different handwritings. Thereafter, we will perform the pretreatment, feature extraction and classification phases, respectively. The use of a classical neural network methods has been beneficial for the character recognition, but revealed some limitations for the recognition rate of Arabic words. To remedy this, we used a deep learning through the Deep Belief Network (DBN) that resulted in a 97.08% success rate of recognition for Arabic words

    Ensemble learning using multi-objective optimisation for arabic handwritten words

    Get PDF
    Arabic handwriting recognition is a dynamic and stimulating field of study within pattern recognition. This system plays quite a significant part in today's global environment. It is a widespread and computationally costly function due to cursive writing, a massive number of words, and writing style. Based on the literature, the existing features lack data supportive techniques and building geometric features. Most ensemble learning approaches are based on the assumption of linear combination, which is not valid due to differences in data types. Also, the existing approaches of classifier generation do not support decision-making for selecting the most suitable classifier, and it requires enabling multi-objective optimisation to handle these differences in data types. In this thesis, new type of feature for handwriting using Segments Interpolation (SI) to find the best fitting line in each of the windows with a model for finding the best operating point window size for SI features. Multi-Objective Ensemble Oriented (MOEO) formulated to control the classifier topology and provide feedback support for changing the classifiers' topology and weights based on the extension of Non-dominated Sorting Genetic Algorithm (NSGA-II). It is designated as the Random Subset based Parents Selection (RSPS-NSGA-II) to handle neurons and accuracy. Evaluation metrics from two perspectives classification and Multiobjective optimization. The experimental design based on two subsets of the IFN/ENIT database. The first one consists of 10 classes (C10) and 22 classes (C22). The features were tested with Support Vector Machine (SVM) and Extreme Learning Machine (ELM). This work improved due to the SI feature. SI shows a significant result with SVM with 88.53% for C22. RSPS for C10 at k=2 achieved 91% accuracy with fewer neurons than NSGA-II, and for C22 at k=10, accuracy has been increased 81% compared to NSGA-II 78%. Future work may consider introducing more features to the system, applying them to other languages, and integrating it with sequence learning for more accuracy

    ONLINE ARABIC TEXT RECOGNITION USING STATISTICAL TECHNIQUES

    Get PDF

    Threshold center-symmetric local binary convolutional neural networks for bilingual handwritten digit recognition

    Get PDF
    Arabic and English handwritten digit recognition is a challenging problem because the writing style differs from one writer to another. In middle east countries, many official forms are prepared to be written using either Arabic or English languages. However, some people fill the form using both languages (Arabic and English), which adds more challenges to recognize digits. Nowadays, deep learning approaches are considered the hot trend of new research, including Convolutional Neural Networks (CNN). CNN is used in many applications and modified to produce other models such as Local Binary Convolutional Neural Networks (LBCNN). LBCNN was created by fusing Local Binary Pattern (LBP) with CNN by reformulating LBP as a convolution layer called Local Binary Convolution (LBC). However, LBCNN suffers from the random assign 1, 0, or -1 to LBC weights, making LBCNN less robust. Nevertheless, using another LBP-based technique such as Center-Symmetric Local Binary Patterns (CS-LBP) can address such issues. In this thesis, a new model based on CS-LBP is proposed called Center-Symmetric Local Binary Convolutional Neural Networks (CS-LBCNN) that addresses the issues of LBCNN. Further, an enhanced version of CS-LBCNN is proposed called Threshold Center-Symmetric Local Binary Convolutional Neural Networks (TCSLBCNN) that addresses another issue related to the zero-thresholding function. The proposed models are compared against state-of-the-art techniques that used the MNIST and MADBase as a bilingual dataset. The proposed TCS-LBCNN model proves its ability to give a more accurate and significant classification rate than the existing LBCNN models. For the bilingual dataset, the TCS-LBCNN enhances the performance of LBCNN and CS-LBCNN, in terms of accuracy, by 0.15% and 0.03%, respectively. In addition, the comparison shows that the accuracy acquired by TCS-LBCNN is the second-highest using the MNIST and MADBase datasets

    Automatic handwriter identification using advanced machine learning

    Get PDF
    Handwriter identification a challenging problem especially for forensic investigation. This topic has received significant attention from the research community and several handwriter identification systems were developed for various applications including forensic science, document analysis and investigation of the historical documents. This work is part of an investigation to develop new tools and methods for Arabic palaeography, which is is the study of handwritten material, particularly ancient manuscripts with missing writers, dates, and/or places. In particular, the main aim of this research project is to investigate and develop new techniques and algorithms for the classification and analysis of ancient handwritten documents to support palaeographic studies. Three contributions were proposed in this research. The first is concerned with the development of a text line extraction algorithm on colour and greyscale historical manuscripts. The idea uses a modified bilateral filtering approach to adaptively smooth the images while still preserving the edges through a nonlinear combination of neighboring image values. The proposed algorithm aims to compute a median and a separating seam and has been validated to deal with both greyscale and colour historical documents using different datasets. The results obtained suggest that our proposed technique yields attractive results when compared against a few similar algorithms. The second contribution proposes to deploy a combination of Oriented Basic Image features and the concept of graphemes codebook in order to improve the recognition performances. The proposed algorithm is capable to effectively extract the most distinguishing handwriter’s patterns. The idea consists of judiciously combining a multiscale feature extraction with the concept of grapheme to allow for the extraction of several discriminating features such as handwriting curvature, direction, wrinkliness and various edge-based features. The technique was validated for identifying handwriters using both Arabic and English writings captured as scanned images using the IAM dataset for English handwriting and ICFHR 2012 dataset for Arabic handwriting. The results obtained clearly demonstrate the effectiveness of the proposed method when compared against some similar techniques. The third contribution is concerned with an offline handwriter identification approach based on the convolutional neural network technology. At the first stage, the Alex-Net architecture was employed to learn image features (handwritten scripts) and the features obtained from the fully connected layers of the model. Then, a Support vector machine classifier is deployed to classify the writing styles of the various handwriters. In this way, the test scripts can be classified by the CNN training model for further classification. The proposed approach was evaluated based on Arabic Historical datasets; Islamic Heritage Project (IHP) and Qatar National Library (QNL). The obtained results demonstrated that the proposed model achieved superior performances when compared to some similar method

    A review on deep-learning-based cyberbullying detection

    Get PDF
    Bullying is described as an undesirable behavior by others that harms an individual physically, mentally, or socially. Cyberbullying is a virtual form (e.g., textual or image) of bullying or harassment, also known as online bullying. Cyberbullying detection is a pressing need in today’s world, as the prevalence of cyberbullying is continually growing, resulting in mental health issues. Conventional machine learning models were previously used to identify cyberbullying. However, current research demonstrates that deep learning surpasses traditional machine learning algorithms in identifying cyberbullying for several reasons, including handling extensive data, efficiently classifying text and images, extracting features automatically through hidden layers, and many others. This paper reviews the existing surveys and identifies the gaps in those studies. We also present a deep-learning-based defense ecosystem for cyberbullying detection, including data representation techniques and different deep-learning-based models and frameworks. We have critically analyzed the existing DL-based cyberbullying detection techniques and identified their significant contributions and the future research directions they have presented. We have also summarized the datasets being used, including the DL architecture being used and the tasks that are accomplished for each dataset. Finally, several challenges faced by the existing researchers and the open issues to be addressed in the future have been presented
    corecore