16,086 research outputs found
Do You See What I Mean? Visual Resolution of Linguistic Ambiguities
Understanding language goes hand in hand with the ability to integrate
complex contextual information obtained via perception. In this work, we
present a novel task for grounded language understanding: disambiguating a
sentence given a visual scene which depicts one of the possible interpretations
of that sentence. To this end, we introduce a new multimodal corpus containing
ambiguous sentences, representing a wide range of syntactic, semantic and
discourse ambiguities, coupled with videos that visualize the different
interpretations for each sentence. We address this task by extending a vision
model which determines if a sentence is depicted by a video. We demonstrate how
such a model can be adjusted to recognize different interpretations of the same
underlying sentence, allowing to disambiguate sentences in a unified fashion
across the different ambiguity types.Comment: EMNLP 201
A Survey on the Use of Pattern Recognition Techniques
Pattern recognition is an innate cognitive process of matching information from the environment with the information stored in memory. Core methods are successful in many areas of numerical analysis, pattern recognition and machine learning. These are methods which generate an abstracting model from given observations (objects, measurements) in a training step, which subsequently allows generalizing statements for new observations. Various approaches are used to implement a pattern recognition system. In this paper we will discuss Statistical, Structural, hybrid and Neural Network based approach
Decision Fusion and Contextual Information for Arabic Words Recognition for Computing and Informatics
The study of multiple classifier systems has become recently an area of intensive research in pattern recognition. Also in handwriting recognition, systems combining several classifiers have been investigated. An approach for recognizing the legal amount for handwritten Arabic bank check is described in this article. The solution uses multiple information sources to recognize words. The recognition step is preformed with a parallel combination of three kinds of classifiers using holistic word structural features. The classification stage results are first normalized, and the sum combination is performed as a decision fusion scheme, after which a syntactic analyzer makes final decision on the candidate words. Using this approach, the obtained results are very interesting and promising
Integration of traditional imaging, expert systems, and neural network techniques for enhanced recognition of handwritten information
Includes bibliographical references (p. 33-37).Research supported by the I.F.S.R.C. at M.I.T.Amar Gupta, John Riordan, Evelyn Roman
- …