1,665 research outputs found

    Shape Representation in Primate Visual Area 4 and Inferotemporal Cortex

    Get PDF
    The representation of contour shape is an essential component of object recognition, but the cortical mechanisms underlying it are incompletely understood, leaving it a fundamental open question in neuroscience. Such an understanding would be useful theoretically as well as in developing computer vision and Brain-Computer Interface applications. We ask two fundamental questions: “How is contour shape represented in cortex and how can neural models and computer vision algorithms more closely approximate this?” We begin by analyzing the statistics of contour curvature variation and develop a measure of salience based upon the arc length over which it remains within a constrained range. We create a population of V4-like cells – responsive to a particular local contour conformation located at a specific position on an object’s boundary – and demonstrate high recognition accuracies classifying handwritten digits in the MNIST database and objects in the MPEG-7 Shape Silhouette database. We compare the performance of the cells to the “shape-context” representation (Belongie et al., 2002) and achieve roughly comparable recognition accuracies using a small test set. We analyze the relative contributions of various feature sensitivities to recognition accuracy and robustness to noise. Local curvature appears to be the most informative for shape recognition. We create a population of IT-like cells, which integrate specific information about the 2-D boundary shapes of multiple contour fragments, and evaluate its performance on a set of real images as a function of the V4 cell inputs. We determine the sub-population of cells that are most effective at identifying a particular category. We classify based upon cell population response and obtain very good results. We use the Morris-Lecar neuronal model to more realistically illustrate the previously explored shape representation pathway in V4 – IT. We demonstrate recognition using spatiotemporal patterns within a winnerless competition network with FitzHugh-Nagumo model neurons. Finally, we use the Izhikevich neuronal model to produce an enhanced response in IT, correlated with recognition, via gamma synchronization in V4. Our results support the hypothesis that the response properties of V4 and IT cells, as well as our computer models of them, function as robust shape descriptors in the object recognition process

    Human Identification of Problematic Handwritten Digits for Pattern Recognition

    Get PDF
    After decades of work in pattern recognition, humans are still considered the best recognizers of images and symbols especially in unconstrained everyday applications. This has made the human visual model a major topic of interest in pattern recognition research. A number of studies have presented promising recognition models that incorporate different aspects of the human model such as selective attention, biologically plausible saliency detection and top-down recognition. On the other hand, the last hundred years of research in human eye movement behaviour has revived the ancient philosophical idea that we see in our mind’s eye. Several computational models of eye movement control were suggested that successfully predict eye movement behaviour demonstrating a close coupling between eye movements and underlying oculomotor and cognitive processes. In the present study, the author evaluates a combined approach to identifying features of interest for Pattern Recognition applications. In the data collection stage, sixty participants are asked to verbally identify fifty-four problematic and twenty prototypical handwritten digits. Both verbal responses and visual fixations are recorded for further analysis. In the analysis stage, a smaller set of ambiguous digit images is identified based on how often participants change their minds about the numeral they represent. For each digit, visual fixations are grouped based on the numeral that participants called out. Each fixation group is then combined into a single fixation heat map. Results show that by comparing and contrasting heat maps for a given digit the features deemed most disambiguating by the human model can be identified

    Fast and reliable recognition of human motion from motion trajectories using wavelet analysis

    Get PDF
    Recognition of human motion provides hints to understand human activities and gives opportunities to the development of new human-computer interface. Recent studies, however, are limited to extracting motion history image and recognizing gesture or locomotion of human body parts. Although the approach employed, i.e. the transformation of the 3D space-time (x-y-t) analysis to the 2D image analysis, is faster than analyzing 3D motion feature, it is less accurate and less robust in nature. In this paper, a fast trajectory-classification algorithm for interpreting movement of human body parts using wavelet analysis is proposed to increase the accuracy and robustness of human motion recognition. By tracking human body in real time, the motion trajectory (x-y-t) can be extracted. The motion trajectory is then broken down into wavelets that form a set of wavelet features. Classification based on the wavelet features can then be done to interpret the human motion. An online hand drawing digit recognition system was built using the proposed algorithm. Experiments show that the proposed algorithm is able to recognize digits from human movement accurately in real time.postprintThe 2004 IFIP International Conference on Artificial Intelligence Applications and Innovation, Toulouse, France, 22-27 August 2004. In Proceedings of the IFIP International Conference on Artificial Intelligence Applications and Innovation, 2004, p. 1-1
    • …
    corecore