3,022 research outputs found

    Character-Aware Neural Language Models

    Full text link
    We describe a simple neural language model that relies only on character-level inputs. Predictions are still made at the word-level. Our model employs a convolutional neural network (CNN) and a highway network over characters, whose output is given to a long short-term memory (LSTM) recurrent neural network language model (RNN-LM). On the English Penn Treebank the model is on par with the existing state-of-the-art despite having 60% fewer parameters. On languages with rich morphology (Arabic, Czech, French, German, Spanish, Russian), the model outperforms word-level/morpheme-level LSTM baselines, again with fewer parameters. The results suggest that on many languages, character inputs are sufficient for language modeling. Analysis of word representations obtained from the character composition part of the model reveals that the model is able to encode, from characters only, both semantic and orthographic information.Comment: AAAI 201

    Performance Comparison of Radial Basis Function Networks and Probabilistic Neural Networks for Telugu Character Recognition

    Get PDF
    The research on recognition of hand written scanned images of documents has witnessed several problems, some of which include recognition of almost similar characters. Therefore it received attention from the fields of image processing and pattern recognition. The system of pattern recognition comprises a two step process. The first stage is the feature extraction and the second stage is the classification. In this paper, the authors propose two classification methods, both of which are based on artificial neural networks as a means to recognize hand written characters of Telugu, a language spoken by more than 100 million people of south India(Negi et al. ,2001). In this model, the authors used Radial Basis Function (RBF) networks and Probabilistic Neural Networks (PNN) for classification. These classifiers were further evaluated using performance metrics such as accuracy, sensitivity, specificity, Positive Predictive Value (PPV), Negative Predictive Value (NPV) and F measure. This paper is a comparison of results obtained with both the methods. The values of F measure are quite satisfactory and this is a good indication of the suitability of the methods for classification of characters. The values of F-Measure for both the methods approach the value of 1, which is a good indication and out of the two, RBF is a better method than PNN

    An Efficient Hidden Markov Model for Offline Handwritten Numeral Recognition

    Full text link
    Traditionally, the performance of ocr algorithms and systems is based on the recognition of isolated characters. When a system classifies an individual character, its output is typically a character label or a reject marker that corresponds to an unrecognized character. By comparing output labels with the correct labels, the number of correct recognition, substitution errors misrecognized characters, and rejects unrecognized characters are determined. Nowadays, although recognition of printed isolated characters is performed with high accuracy, recognition of handwritten characters still remains an open problem in the research arena. The ability to identify machine printed characters in an automated or a semi automated manner has obvious applications in numerous fields. Since creating an algorithm with a one hundred percent correct recognition rate is quite probably impossible in our world of noise and different font styles, it is important to design character recognition algorithms with these failures in mind so that when mistakes are inevitably made, they will at least be understandable and predictable to the person working with theComment: 6pages, 5 figure
    • …
    corecore