16,086 research outputs found

    Do You See What I Mean? Visual Resolution of Linguistic Ambiguities

    Get PDF
    Understanding language goes hand in hand with the ability to integrate complex contextual information obtained via perception. In this work, we present a novel task for grounded language understanding: disambiguating a sentence given a visual scene which depicts one of the possible interpretations of that sentence. To this end, we introduce a new multimodal corpus containing ambiguous sentences, representing a wide range of syntactic, semantic and discourse ambiguities, coupled with videos that visualize the different interpretations for each sentence. We address this task by extending a vision model which determines if a sentence is depicted by a video. We demonstrate how such a model can be adjusted to recognize different interpretations of the same underlying sentence, allowing to disambiguate sentences in a unified fashion across the different ambiguity types.Comment: EMNLP 201

    A Survey on the Use of Pattern Recognition Techniques

    Get PDF
    Pattern recognition is an innate cognitive process of matching information from the environment with the information stored in memory. Core methods are successful in many areas of numerical analysis, pattern recognition and machine learning. These are methods which generate an abstracting model from given observations (objects, measurements) in a training step, which subsequently allows generalizing statements for new observations. Various approaches are used to implement a pattern recognition system. In this paper we will discuss Statistical, Structural, hybrid and Neural Network based approach

    Decision Fusion and Contextual Information for Arabic Words Recognition for Computing and Informatics

    Get PDF
    The study of multiple classifier systems has become recently an area of intensive research in pattern recognition. Also in handwriting recognition, systems combining several classifiers have been investigated. An approach for recognizing the legal amount for handwritten Arabic bank check is described in this article. The solution uses multiple information sources to recognize words. The recognition step is preformed with a parallel combination of three kinds of classifiers using holistic word structural features. The classification stage results are first normalized, and the sum combination is performed as a decision fusion scheme, after which a syntactic analyzer makes final decision on the candidate words. Using this approach, the obtained results are very interesting and promising

    Integration of traditional imaging, expert systems, and neural network techniques for enhanced recognition of handwritten information

    Get PDF
    Includes bibliographical references (p. 33-37).Research supported by the I.F.S.R.C. at M.I.T.Amar Gupta, John Riordan, Evelyn Roman
    corecore