9 research outputs found
A language-independent, openvocabulary system based on HMMs for recognition of ultra low resolution words
Abstract: In this paper, we introduce and evaluate a system capable of recognizing words extracted from ultra low resolution images such as those frequently embedded on web pages. The design of the system has been driven by the following constraints. First, the system has to recognize small font sizes between 6-12 points where anti-aliasing and resampling filters are applied. Such procedures add noise between adjacent characters in the words and complicate any a priori segmentation of the characters. Second, the system has to be able to recognize any words in an open vocabulary setting, potentially mixing different languages in Latin alphabet. Finally, the training procedure must be automatic, i.e. without requesting to extract, segment and label manually a large set of data. These constraints led us to an architecture based on ergodic HMMs where states are associated to the characters. We also introduce several improvements of the performance increasing the order of the emission probability estimators, including minimum and maximum width constraints on the character models and a training set consisting all possible adjacency cases of Latin characters. The proposed system is evaluated on different font sizes and families, showing good robustness for sizes down to 6 points
A language-independent, openvocabulary system based on HMMs for recognition of ultra low resolution words
ABSTRACT In this paper, we introduce and evaluate a system capable of recognizing ultra low resolution words extracted from images such as those frequently embedded on web pages. The design of the system has been driven by the following constraints. First, the system has to recognize small font sizes where antialiasing and resampling procedures have been applied. Such procedures add noise on the patterns and complicate any a priori segmentation of the characters. Second, the system has to be able to recognize any words in an open vocabulary setting, potentially mixing different languages. Finally, the training procedure must be automatic, i.e. without requesting to extract, segment and label manually a large set of data. These constraints led us to an architecture based on ergodic HMMs where states are associated to the characters. We also introduce several improvements of the performance increasing the order of the emission probability estimators and including minimum and maximum duration constraints on the character models. The proposed system is evaluated on different font sizes and families, showing good robustness for sizes down to 6 points
Text Detection and Recognition for Person Identification in Video
Demo: cluster 1International audienceThis article presents a demo of person search in audiovisual broadcast using only the text available in a video and in resources external to the video. We also present the different steps used to recognize characters in video for multi-modal person recognition systems. Text detection is realized using the text features (texture, color, contrast, geometry, temporal information). The text recognition itself is performed by the Google Tesseract free software. The method was successfully evaluated on a broadcast news corpus that contains 59 videos from the France 2 French TV channel
A HMM-Based Approach to Recognize Ultra Low Resolution Anti-Aliased Words
Abstract. In this paper, we present a HMM based system that is used to recognize ultra low resolution text such as those frequently embedded in images available on the web. We propose a system that takes specifically the challenges of recognizing text in ultra low resolution images into account. In addition to this, we show in this paper that word models can be advantageously built connecting together sub-HMM-character models and inter-character state. Finally we report on the promising performance of the system using HMM topologies which have been improved to take into account the presupposed minimum length of each character
Target recognition techniques for multifunction phased array radar
This thesis, submitted for the degree of Doctor of Philosophy at University College London, is a
discussion and analysis of combined stepped-frequency and pulse-Doppler target recognition methods
which enable a multifunction phased array radar designed for automatic surveillance and multi-target
tracking to offer a Non Cooperative Target Recognition (NCTR) capability. The primary challenge
is to investigate the feasibility of NCTR via the use of high range resolution profiles. Given stepped
frequency waveforms effectively trade time for enhanced bandwidth, and thus resolution, attention is
paid to the design of a compromise between resolution and dwell time. A secondary challenge is to
investigate the additional benefits to overall target classification when the number of coherent pulses
within an NCTR wavefrom is expanded to enable the extraction of spectral features which can help
to differentiate particular classes of target. As with increased range resolution, the price for this extra
information is a further increase in dwell time. The response to the primary and secondary challenges
described above has involved the development of a number of novel techniques, which are summarized
below:
ā¢ Design and execution of a series of experiments to further the understanding of multifunction
phased array Radar NCTR techniques
ā¢ Development of a āHybridā stepped frequency technique which enables a significant extension
of range profiles without the proportional trade in resolution as experienced with āClassicalā
techniques
ā¢ Development of an āend to endā NCTR processing and visualization pipeline
ā¢ Use of āDoppler fractionā spectral features to enable aircraft target classification via propulsion
mechanism. Combination of Doppler fraction and physical length features to enable broad
aircraft type classification.
ā¢ Optimization of NCTR method classification performance as a function of feature and waveform
parameters.
ā¢ Generic waveform design tools to enable delivery of time costly NCTR waveforms within operational
constraints.
The thesis is largely based upon an analysis of experimental results obtained using the multifunction
phased array radar MESAR2, based at BAE Systems on the Isle of Wight. The NCTR
mode of MESAR2 consists of the transmission and reception of successive multi-pulse coherent bursts
upon each target being tracked. Each burst is stepped in frequency resulting in an overall bandwidth
sufficient to provide sub-metre range resolution. A sequence of experiments, (static trials, moving
point target trials and full aircraft trials) are described and an analysis of the robustness of target
length and Doppler spectra feature measurements from NCTR mode data recordings is presented. A
recorded data archive of 1498 NCTR looks upon 17 different trials aircraft using five different varieties
of stepped frequency waveform is used to determine classification performance as a function of
various signal processing parameters and extent (numbers of pulses) of the data used. From analysis
of the trials data, recommendations are made with regards to the design of an NCTR mode for an
operational system that uses stepped frequency techniques by design choice
Untangling hotel industryās inefficiency: An SFA approach applied to a renowned Portuguese hotel chain
The present paper explores the technical efficiency of four hotels from Teixeira Duarte Group - a renowned Portuguese hotel chain. An efficiency ranking is established from these four hotel units located in Portugal using Stochastic Frontier Analysis. This methodology allows to discriminate between measurement error and systematic inefficiencies in the estimation process enabling to investigate the main inefficiency causes. Several suggestions concerning efficiency improvement are undertaken for each hotel studied.info:eu-repo/semantics/publishedVersio