4,733 research outputs found

    Performance Comparison of Radial Basis Function Networks and Probabilistic Neural Networks for Telugu Character Recognition

    Get PDF
    The research on recognition of hand written scanned images of documents has witnessed several problems, some of which include recognition of almost similar characters. Therefore it received attention from the fields of image processing and pattern recognition. The system of pattern recognition comprises a two step process. The first stage is the feature extraction and the second stage is the classification. In this paper, the authors propose two classification methods, both of which are based on artificial neural networks as a means to recognize hand written characters of Telugu, a language spoken by more than 100 million people of south India(Negi et al. ,2001). In this model, the authors used Radial Basis Function (RBF) networks and Probabilistic Neural Networks (PNN) for classification. These classifiers were further evaluated using performance metrics such as accuracy, sensitivity, specificity, Positive Predictive Value (PPV), Negative Predictive Value (NPV) and F measure. This paper is a comparison of results obtained with both the methods. The values of F measure are quite satisfactory and this is a good indication of the suitability of the methods for classification of characters. The values of F-Measure for both the methods approach the value of 1, which is a good indication and out of the two, RBF is a better method than PNN

    Visual Dynamics: Stochastic Future Generation via Layered Cross Convolutional Networks

    Full text link
    We study the problem of synthesizing a number of likely future frames from a single input image. In contrast to traditional methods that have tackled this problem in a deterministic or non-parametric way, we propose to model future frames in a probabilistic manner. Our probabilistic model makes it possible for us to sample and synthesize many possible future frames from a single input image. To synthesize realistic movement of objects, we propose a novel network structure, namely a Cross Convolutional Network; this network encodes image and motion information as feature maps and convolutional kernels, respectively. In experiments, our model performs well on synthetic data, such as 2D shapes and animated game sprites, and on real-world video frames. We present analyses of the learned network representations, showing it is implicitly learning a compact encoding of object appearance and motion. We also demonstrate a few of its applications, including visual analogy-making and video extrapolation.Comment: Journal preprint of arXiv:1607.02586 (IEEE TPAMI, 2019). The first two authors contributed equally to this work. Project page: http://visualdynamics.csail.mit.ed

    SymbolDesign: A User-centered Method to Design Pen-based Interfaces and Extend the Functionality of Pointer Input Devices

    Full text link
    A method called "SymbolDesign" is proposed that can be used to design user-centered interfaces for pen-based input devices. It can also extend the functionality of pointer input devices such as the traditional computer mouse or the Camera Mouse, a camera-based computer interface. Users can create their own interfaces by choosing single-stroke movement patterns that are convenient to draw with the selected input device and by mapping them to a desired set of commands. A pattern could be the trace of a moving finger detected with the Camera Mouse or a symbol drawn with an optical pen. The core of the SymbolDesign system is a dynamically created classifier, in the current implementation an artificial neural network. The architecture of the neural network automatically adjusts according to the complexity of the classification task. In experiments, subjects used the SymbolDesign method to design and test the interfaces they created, for example, to browse the web. The experiments demonstrated good recognition accuracy and responsiveness of the user interfaces. The method provided an easily-designed and easily-used computer input mechanism for people without physical limitations, and, with some modifications, has the potential to become a computer access tool for people with severe paralysis.National Science Foundation (IIS-0093367, IIS-0308213, IIS-0329009, EIA-0202067
    • …
    corecore