9 research outputs found

    A language-independent, openvocabulary system based on HMMs for recognition of ultra low resolution words

    Get PDF
    Abstract: In this paper, we introduce and evaluate a system capable of recognizing words extracted from ultra low resolution images such as those frequently embedded on web pages. The design of the system has been driven by the following constraints. First, the system has to recognize small font sizes between 6-12 points where anti-aliasing and resampling filters are applied. Such procedures add noise between adjacent characters in the words and complicate any a priori segmentation of the characters. Second, the system has to be able to recognize any words in an open vocabulary setting, potentially mixing different languages in Latin alphabet. Finally, the training procedure must be automatic, i.e. without requesting to extract, segment and label manually a large set of data. These constraints led us to an architecture based on ergodic HMMs where states are associated to the characters. We also introduce several improvements of the performance increasing the order of the emission probability estimators, including minimum and maximum width constraints on the character models and a training set consisting all possible adjacency cases of Latin characters. The proposed system is evaluated on different font sizes and families, showing good robustness for sizes down to 6 points

    A language-independent, openvocabulary system based on HMMs for recognition of ultra low resolution words

    Get PDF
    ABSTRACT In this paper, we introduce and evaluate a system capable of recognizing ultra low resolution words extracted from images such as those frequently embedded on web pages. The design of the system has been driven by the following constraints. First, the system has to recognize small font sizes where antialiasing and resampling procedures have been applied. Such procedures add noise on the patterns and complicate any a priori segmentation of the characters. Second, the system has to be able to recognize any words in an open vocabulary setting, potentially mixing different languages. Finally, the training procedure must be automatic, i.e. without requesting to extract, segment and label manually a large set of data. These constraints led us to an architecture based on ergodic HMMs where states are associated to the characters. We also introduce several improvements of the performance increasing the order of the emission probability estimators and including minimum and maximum duration constraints on the character models. The proposed system is evaluated on different font sizes and families, showing good robustness for sizes down to 6 points

    Text Detection and Recognition for Person Identification in Video

    Get PDF
    Demo: cluster 1International audienceThis article presents a demo of person search in audiovisual broadcast using only the text available in a video and in resources external to the video. We also present the different steps used to recognize characters in video for multi-modal person recognition systems. Text detection is realized using the text features (texture, color, contrast, geometry, temporal information). The text recognition itself is performed by the Google Tesseract free software. The method was successfully evaluated on a broadcast news corpus that contains 59 videos from the France 2 French TV channel

    A HMM-Based Approach to Recognize Ultra Low Resolution Anti-Aliased Words

    No full text

    A HMM-Based Approach to Recognize Ultra Low Resolution Anti-Aliased Words

    No full text
    Abstract. In this paper, we present a HMM based system that is used to recognize ultra low resolution text such as those frequently embedded in images available on the web. We propose a system that takes specifically the challenges of recognizing text in ultra low resolution images into account. In addition to this, we show in this paper that word models can be advantageously built connecting together sub-HMM-character models and inter-character state. Finally we report on the promising performance of the system using HMM topologies which have been improved to take into account the presupposed minimum length of each character

    Target recognition techniques for multifunction phased array radar

    Get PDF
    This thesis, submitted for the degree of Doctor of Philosophy at University College London, is a discussion and analysis of combined stepped-frequency and pulse-Doppler target recognition methods which enable a multifunction phased array radar designed for automatic surveillance and multi-target tracking to offer a Non Cooperative Target Recognition (NCTR) capability. The primary challenge is to investigate the feasibility of NCTR via the use of high range resolution profiles. Given stepped frequency waveforms effectively trade time for enhanced bandwidth, and thus resolution, attention is paid to the design of a compromise between resolution and dwell time. A secondary challenge is to investigate the additional benefits to overall target classification when the number of coherent pulses within an NCTR wavefrom is expanded to enable the extraction of spectral features which can help to differentiate particular classes of target. As with increased range resolution, the price for this extra information is a further increase in dwell time. The response to the primary and secondary challenges described above has involved the development of a number of novel techniques, which are summarized below: ā€¢ Design and execution of a series of experiments to further the understanding of multifunction phased array Radar NCTR techniques ā€¢ Development of a ā€˜Hybridā€™ stepped frequency technique which enables a significant extension of range profiles without the proportional trade in resolution as experienced with ā€˜Classicalā€™ techniques ā€¢ Development of an ā€˜end to endā€™ NCTR processing and visualization pipeline ā€¢ Use of ā€˜Doppler fractionā€™ spectral features to enable aircraft target classification via propulsion mechanism. Combination of Doppler fraction and physical length features to enable broad aircraft type classification. ā€¢ Optimization of NCTR method classification performance as a function of feature and waveform parameters. ā€¢ Generic waveform design tools to enable delivery of time costly NCTR waveforms within operational constraints. The thesis is largely based upon an analysis of experimental results obtained using the multifunction phased array radar MESAR2, based at BAE Systems on the Isle of Wight. The NCTR mode of MESAR2 consists of the transmission and reception of successive multi-pulse coherent bursts upon each target being tracked. Each burst is stepped in frequency resulting in an overall bandwidth sufficient to provide sub-metre range resolution. A sequence of experiments, (static trials, moving point target trials and full aircraft trials) are described and an analysis of the robustness of target length and Doppler spectra feature measurements from NCTR mode data recordings is presented. A recorded data archive of 1498 NCTR looks upon 17 different trials aircraft using five different varieties of stepped frequency waveform is used to determine classification performance as a function of various signal processing parameters and extent (numbers of pulses) of the data used. From analysis of the trials data, recommendations are made with regards to the design of an NCTR mode for an operational system that uses stepped frequency techniques by design choice

    Untangling hotel industryā€™s inefficiency: An SFA approach applied to a renowned Portuguese hotel chain

    Get PDF
    The present paper explores the technical efficiency of four hotels from Teixeira Duarte Group - a renowned Portuguese hotel chain. An efficiency ranking is established from these four hotel units located in Portugal using Stochastic Frontier Analysis. This methodology allows to discriminate between measurement error and systematic inefficiencies in the estimation process enabling to investigate the main inefficiency causes. Several suggestions concerning efficiency improvement are undertaken for each hotel studied.info:eu-repo/semantics/publishedVersio
    corecore