1,797 research outputs found

    Handwriting-Based Gender Classification Using End-to-End Deep Neural Networks

    Full text link
    Handwriting-based gender classification is a well-researched problem that has been approached mainly by traditional machine learning techniques. In this paper, we propose a novel deep learning-based approach for this task. Specifically, we present a convolutional neural network (CNN), which performs automatic feature extraction from a given handwritten image, followed by classification of the writer's gender. Also, we introduce a new dataset of labeled handwritten samples, in Hebrew and English, of 405 participants. Comparing the gender classification accuracy on this dataset against human examiners, our results show that the proposed deep learning-based approach is substantially more accurate than that of humans

    A Multi-Feature Selection Approach for Gender Identification of Handwriting based on Kernel Mutual Information

    Get PDF
    This paper presents a new flexible approach to predict the gender of the writers from their handwriting samples. Handwriting features like slant, curvature, line separation, chain code, character shapes, and more, can be extracted from different methods. Therefore, the multi-feature sets are irrelevant and redundant. The conflict of the features exists in the sets, which affects the accuracy of classification and the computing cost. This paper proposes an approach, named Kernel Mutual Information (KMI), that focuses on feature selection. The KMI approach can decrease redundancies and conflicts. In addition, it extracts an optimal subset of features from the writing samples produced by male and female writers. To ensure that KMI can apply the various features, this paper describes the handwriting segmentation and handwritten text recognition technology used. The classification is carried out using a Support Vector Machine (SVM) on two databases. The first database comes from the ICDAR 2013 competition on gender prediction, which provides the samples in both Arabic and English. The other database contains the Registration-Document-Form (RDF) database in Chinese. The proposed and compared methods were evaluated on both databases. Results from the methods highlight the importance of feature selection for gender prediction from handwriting

    The self-concept of Arabic and English speaking bilingual and monolingual pupils with specific literacy difficulties

    Get PDF
    Researchers have conducted many studies to examine the academic, social and general self-concept of pupils of differing ages and in varied settings. Yet, not very much is known about the varied facets of self-concept of bilingual pupils and the monolingual who have specific literacy difficulties (SpLD). Furthermore, the influence of learning a second language on the self-concept or the motivation to learn a second language in the Arabic- English pupils in the Middle Eastern region has also not been addressed by any researchers. So, the main focus of this study was to examine the self-concept of bilingual (Arabic-English) and monolingual pupils who have specific literacy difficulties. The motivation to learn a foreign language and its impact on the pupils' English and general self-concept was also studied. This study used a mixed methodology design using a systematic survey followed by purposive case studies. Established measures were used to examine each facet of the self-concept moving from the literacy in both English and Arabic (reading, writing and spelling) to the maths self-concept and to a more general self-concept, academic self-concept and school self-concept. Furthermore, this study also examined the non-academic self-concept such as athletic self-concept and social self-concept among a group of bilingual (Arabic-English) and monolingual (Arabic) who have SpLD. The study was conducted in Oman in a bilingual private schools and monolingual state schools which included 99 pupils. A Foreign Language Learning Orientation Scale/ intrinsic – extrinsic motivation was also designed to measure the motivation to learning English as a second language. In phase two, this study examined the consistency between the pupils’ and Arabic and English teachers’ interview reports and the pupil's questionnaire for 6 bilingual pupils who had SpLD. This study compared 4 groups (monolingual SpLD, bilingual SpLD, monolingual typical literacy level and bilingual typical literacy level). The quantitative results showed differences between the four groups in terms of the self-concept. There were no differences in terms of the self-concept between the monolingual SpLD and bilingual SpLD in any facets of the self-concept. However, there were a significantly lower Arabic handwriting self concept, Arabic spelling self-concept and general school self-concept for monolingual SpLD pupils in comparison to their peers who had typical literacy level. Also bilingual pupils with SpLD showed significantly lower English reading self-concept, English spelling self-concept, and the general school self-concept than for the bilingual typical literacy pupils. The last comparison showed that there were significantly lower Arabic reading, Arabic handwriting, and Arabic spelling self-concept for the monolingual typical literacy levels in comparison to their bilingual typical literacy peers. In terms of intrinsic extrinsic motivation there were no significant differences shown between the SpLD bilingual and the bilingual typical literacy levels groups. According to the case study analysis there was a general inconsistency between the pupils’ interview and their questionnaire reports for their general, English and Arabic self-concept and the intrinsic and the extrinsic motivation for learning a foreign language. In many cases the pupils were negative about their literacy self-concept according to the questionnaire, but they perceived themselves more positively in the interview. In general, there was a tendency for both quantitative and qualitative results to indicate positive social self-concept for the bilingual and monolingual pupils who had SpLD and the 6 case studies. It was concluded that as research into self-concept of the bilingual (Arabic- English) is not well developed, more research is need in this area, especially in the Middle East using the same methods from this study. It is concluded that it is important for language assessors to consider assessing the literacy difficulties in two languages when the pupils are bilingual

    Automatic handwriter identification using advanced machine learning

    Get PDF
    Handwriter identification a challenging problem especially for forensic investigation. This topic has received significant attention from the research community and several handwriter identification systems were developed for various applications including forensic science, document analysis and investigation of the historical documents. This work is part of an investigation to develop new tools and methods for Arabic palaeography, which is is the study of handwritten material, particularly ancient manuscripts with missing writers, dates, and/or places. In particular, the main aim of this research project is to investigate and develop new techniques and algorithms for the classification and analysis of ancient handwritten documents to support palaeographic studies. Three contributions were proposed in this research. The first is concerned with the development of a text line extraction algorithm on colour and greyscale historical manuscripts. The idea uses a modified bilateral filtering approach to adaptively smooth the images while still preserving the edges through a nonlinear combination of neighboring image values. The proposed algorithm aims to compute a median and a separating seam and has been validated to deal with both greyscale and colour historical documents using different datasets. The results obtained suggest that our proposed technique yields attractive results when compared against a few similar algorithms. The second contribution proposes to deploy a combination of Oriented Basic Image features and the concept of graphemes codebook in order to improve the recognition performances. The proposed algorithm is capable to effectively extract the most distinguishing handwriter’s patterns. The idea consists of judiciously combining a multiscale feature extraction with the concept of grapheme to allow for the extraction of several discriminating features such as handwriting curvature, direction, wrinkliness and various edge-based features. The technique was validated for identifying handwriters using both Arabic and English writings captured as scanned images using the IAM dataset for English handwriting and ICFHR 2012 dataset for Arabic handwriting. The results obtained clearly demonstrate the effectiveness of the proposed method when compared against some similar techniques. The third contribution is concerned with an offline handwriter identification approach based on the convolutional neural network technology. At the first stage, the Alex-Net architecture was employed to learn image features (handwritten scripts) and the features obtained from the fully connected layers of the model. Then, a Support vector machine classifier is deployed to classify the writing styles of the various handwriters. In this way, the test scripts can be classified by the CNN training model for further classification. The proposed approach was evaluated based on Arabic Historical datasets; Islamic Heritage Project (IHP) and Qatar National Library (QNL). The obtained results demonstrated that the proposed model achieved superior performances when compared to some similar method

    Information Preserving Processing of Noisy Handwritten Document Images

    Get PDF
    Many pre-processing techniques that normalize artifacts and clean noise induce anomalies due to discretization of the document image. Important information that could be used at later stages may be lost. A proposed composite-model framework takes into account pre-printed information, user-added data, and digitization characteristics. Its benefits are demonstrated by experiments with statistically significant results. Separating pre-printed ruling lines from user-added handwriting shows how ruling lines impact people\u27s handwriting and how they can be exploited for identifying writers. Ruling line detection based on multi-line linear regression reduces the mean error of counting them from 0.10 to 0.03, 6.70 to 0.06, and 0.13 to 0.02, com- pared to an HMM-based approach on three standard test datasets, thereby reducing human correction time by 50%, 83%, and 72% on average. On 61 page images from 16 rule-form templates, the precision and recall of form cell recognition are increased by 2.7% and 3.7%, compared to a cross-matrix approach. Compensating for and exploiting ruling lines during feature extraction rather than pre-processing raises the writer identification accuracy from 61.2% to 67.7% on a 61-writer noisy Arabic dataset. Similarly, counteracting page-wise skew by subtracting it or transforming contours in a continuous coordinate system during feature extraction improves the writer identification accuracy. An implementation study of contour-hinge features reveals that utilizing the full probabilistic probability distribution function matrix improves the writer identification accuracy from 74.9% to 79.5%

    Design of an Offline Handwriting Recognition System Tested on the Bangla and Korean Scripts

    Get PDF
    This dissertation presents a flexible and robust offline handwriting recognition system which is tested on the Bangla and Korean scripts. Offline handwriting recognition is one of the most challenging and yet to be solved problems in machine learning. While a few popular scripts (like Latin) have received a lot of attention, many other widely used scripts (like Bangla) have seen very little progress. Features such as connectedness and vowels structured as diacritics make it a challenging script to recognize. A simple and robust design for offline recognition is presented which not only works reliably, but also can be used for almost any alphabetic writing system. The framework has been rigorously tested for Bangla and demonstrated how it can be transformed to apply to other scripts through experiments on the Korean script whose two-dimensional arrangement of characters makes it a challenge to recognize. The base of this design is a character spotting network which detects the location of different script elements (such as characters, diacritics) from an unsegmented word image. A transcript is formed from the detected classes based on their corresponding location information. This is the first reported lexicon-free offline recognition system for Bangla and achieves a Character Recognition Accuracy (CRA) of 94.8%. This is also one of the most flexible architectures ever presented. Recognition of Korean was achieved with a 91.2% CRA. Also, a powerful technique of autonomous tagging was developed which can drastically reduce the effort of preparing a dataset for any script. The combination of the character spotting method and the autonomous tagging brings the entire offline recognition problem very close to a singular solution. Additionally, a database named the Boise State Bangla Handwriting Dataset was developed. This is one of the richest offline datasets currently available for Bangla and this has been made publicly accessible to accelerate the research progress. Many other tools were developed and experiments were conducted to more rigorously validate this framework by evaluating the method against external datasets (CMATERdb 1.1.1, Indic Word Dataset and REID2019: Early Indian Printed Documents). Offline handwriting recognition is an extremely promising technology and the outcome of this research moves the field significantly ahead

    Transfer effects from language processing to visual attention dynamics: The impact of orthographic transparency

    Get PDF
    The consistency between letters and sounds varies across languages. These differences have been proposed to be associated with different reading mechanisms (lexical vs. phonological), processing grain sizes (coarse vs. fine) and attentional windows (whole words vs. individual letters). This study aimed to extend this idea to writing to dictation. For that purpose, we evaluated whether the use of different types of processing has a differential impact on local windowing attention: phonological (local) processing in a transparent language (Spanish) and lexical (global) processing of an opaque language (English). Spanish and English monolinguals (Experiment 1) and Spanish–English bilinguals (Experiment 2) performed a writing to dictation task followed by a global–local task. The first key performance showed a critical dissociation between languages: the response times (RTs) from the Spanish writing to dictation task was modulated by word length, whereas the RTs from the English writing to dictation task was modulated by word frequency and age of acquisition, as evidence that language transparency biases processing towards phonological or lexical strategies. In addition, after a Spanish task, participants more efficiently processed local information, which resulted in both the benefit of global congruent information and the reduced cost of incongruent global information. Additionally, the results showed that bilinguals adapt their attentional processing depending on the orthographic transparency.Doctoral Research Grant, Spanish Government FPU16/01748Feder Andalucia A-CTS111-UGR18 P20.00107Ministerio de Ciencia, Innovacion y Universidades-Fondos Feder A-SEJ-416-UGR20 PID2019-111359GB-I00/AEI/10.13039/501100011033 PGC2018-093786-B-I0

    End-Shape Analysis for Automatic Segmentation of Arabic Handwritten Texts

    Get PDF
    Word segmentation is an important task for many methods that are related to document understanding especially word spotting and word recognition. Several approaches of word segmentation have been proposed for Latin-based languages while a few of them have been introduced for Arabic texts. The fact that Arabic writing is cursive by nature and unconstrained with no clear boundaries between the words makes the processing of Arabic handwritten text a more challenging problem. In this thesis, the design and implementation of an End-Shape Letter (ESL) based segmentation system for Arabic handwritten text is presented. This incorporates four novel aspects: (i) removal of secondary components, (ii) baseline estimation, (iii) ESL recognition, and (iv) the creation of a new off-line CENPARMI ESL database. Arabic texts include small connected components, also called secondary components. Removing these components can improve the performance of several systems such as baseline estimation. Thus, a robust method to remove secondary components that takes into consideration the challenges in the Arabic handwriting is introduced. The methods reconstruct the image based on some criteria. The results of this method were subsequently compared with those of two other methods that used the same database. The results show that the proposed method is effective. Baseline estimation is a challenging task for Arabic texts since it includes ligature, overlapping, and secondary components. Therefore, we propose a learning-based approach that addresses these challenges. Our method analyzes the image and extracts baseline dependent features. Then, the baseline is estimated using a classifier. Algorithms dealing with text segmentation usually analyze the gaps between connected components. These algorithms are based on metric calculation, finding threshold, and/or gap classification. We use two well-known metrics: bounding box and convex hull to test metric-based method on Arabic handwritten texts, and to include this technique in our approach. To determine the threshold, an unsupervised learning approach, known as the Gaussian Mixture Model, is used. Our ESL-based segmentation approach extracts the final letter of a word using rule-based technique and recognizes these letters using the implemented ESL classifier. To demonstrate the benefit of text segmentation, a holistic word spotting system is implemented. For this system, a word recognition system is implemented. A series of experiments with different sets of features are conducted. The system shows promising results

    A review of Arabic text recognition dataset

    Get PDF
    Building a robust Optical Character Recognition (OCR) system for languages, such as Arabic with cursive scripts, has always been challenging. These challenges increase if the text contains diacritics of different sizes for characters and words. Apart from the complexity of the used font, these challenges must be addressed in recognizing the text of the Holy Quran. To solve these challenges, the OCR system would have to undergo different phases. Each problem would have to be addressed using different approaches, thus, researchers are studying these challenges and proposing various solutions. This has motivate this study to review Arabic OCR dataset because the dataset plays a major role in determining the nature of the OCR systems. State-of-the-art approaches in segmentation and recognition are discovered with the implementation of Recurrent Neural Networks (Long Short-Term Memory-LSTM and Gated Recurrent Unit-GRU) with the use of the Connectionist Temporal Classification (CTC). This also includes deep learning model and implementation of GRU in the Arabic domain. This paper has contribute in profiling the Arabic text recognition dataset thus determining the nature of OCR system developed and has identified research direction in building Arabic text recognition dataset
    • …
    corecore