3,022 research outputs found
Character-Aware Neural Language Models
We describe a simple neural language model that relies only on
character-level inputs. Predictions are still made at the word-level. Our model
employs a convolutional neural network (CNN) and a highway network over
characters, whose output is given to a long short-term memory (LSTM) recurrent
neural network language model (RNN-LM). On the English Penn Treebank the model
is on par with the existing state-of-the-art despite having 60% fewer
parameters. On languages with rich morphology (Arabic, Czech, French, German,
Spanish, Russian), the model outperforms word-level/morpheme-level LSTM
baselines, again with fewer parameters. The results suggest that on many
languages, character inputs are sufficient for language modeling. Analysis of
word representations obtained from the character composition part of the model
reveals that the model is able to encode, from characters only, both semantic
and orthographic information.Comment: AAAI 201
Performance Comparison of Radial Basis Function Networks and Probabilistic Neural Networks for Telugu Character Recognition
The research on recognition of hand written scanned images of documents has witnessed several problems, some of which include recognition of almost similar characters. Therefore it received attention from the fields of image processing and pattern recognition. The system of pattern recognition comprises a two step process. The first stage is the feature extraction and the second stage is the classification. In this paper, the authors propose two classification methods, both of which are based on artificial neural networks as a means to recognize hand written characters of Telugu, a language spoken by more than 100 million people of south India(Negi et al. ,2001). In this model, the authors used Radial Basis Function (RBF) networks and Probabilistic Neural Networks (PNN) for classification. These classifiers were further evaluated using performance metrics such as accuracy, sensitivity, specificity, Positive Predictive Value (PPV), Negative Predictive Value (NPV) and F measure. This paper is a comparison of results obtained with both the methods. The values of F measure are quite satisfactory and this is a good indication of the suitability of the methods for classification of characters. The values of F-Measure for both the methods approach the value of 1, which is a good indication and out of the two, RBF is a better method than PNN
An Efficient Hidden Markov Model for Offline Handwritten Numeral Recognition
Traditionally, the performance of ocr algorithms and systems is based on the
recognition of isolated characters. When a system classifies an individual
character, its output is typically a character label or a reject marker that
corresponds to an unrecognized character. By comparing output labels with the
correct labels, the number of correct recognition, substitution errors
misrecognized characters, and rejects unrecognized characters are determined.
Nowadays, although recognition of printed isolated characters is performed with
high accuracy, recognition of handwritten characters still remains an open
problem in the research arena. The ability to identify machine printed
characters in an automated or a semi automated manner has obvious applications
in numerous fields. Since creating an algorithm with a one hundred percent
correct recognition rate is quite probably impossible in our world of noise and
different font styles, it is important to design character recognition
algorithms with these failures in mind so that when mistakes are inevitably
made, they will at least be understandable and predictable to the person
working with theComment: 6pages, 5 figure
- …