Search CORE

2,719 research outputs found

A statistical multiresolution approach for face recognition using structural hidden Markov models

Author: Amira A
Bouchaffra D
Nicholl P
Perrott R H
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2007
Field of study

This paper introduces a novel methodology that combines the multiresolution feature of the discrete wavelet transform (DWT) with the local interactions of the facial structures expressed through the structural hidden Markov model (SHMM). A range of wavelet filters such as Haar, biorthogonal 9/7, and Coiflet, as well as Gabor, have been implemented in order to search for the best performance. SHMMs perform a thorough probabilistic analysis of any sequential pattern by revealing both its inner and outer structures simultaneously. Unlike traditional HMMs, the SHMMs do not perform the state conditional independence of the visible observation sequence assumption. This is achieved via the concept of local structures introduced by the SHMMs. Therefore, the long-range dependency problem inherent to traditional HMMs has been drastically reduced. SHMMs have not previously been applied to the problem of face identification. The results reported in this application have shown that SHMM outperforms the traditional hidden Markov model with a 73% increase in accuracy

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

Brunel University Research Archive

Fusion of SPOT5 multispectral and Ikonos panchromatic images

Author: Alonso Reyes R.
Fernandez S.
Melgar M.
Ranchin Thierry
Thomas Claire
Wald Lucien
Publication venue: Millpress
Publication date: 25/05/2004
Field of study

International audienceThe offer of high spectral and high spatial resolutions images has grown in the last decade. It is know possible to obtain data from different sources with different spatial and spec-tral resolutions. The field of data fusion of remotely sensed data grown also very fast in the last years. In this paper, an algorithm allowing the merging of SPOT 5 images and Ikonos images are proposed. This algorithm is based on the ARSIS concept and presents an implementation for a ratio of spatial resolution equal to 10. The ARSIS concept is first detailed. Then, the way of de-fining a new implementation based on this concept is presented, allowing to understand how to define new implementations and to develop new solutions based on this concept. The proposed algorithm is developed, describing the different steps for building a fused product from a SPOT 5 multispectral data at 10 m and from a IKONOS panchromatic data at 1 m. Some other methods are proposed. The evaluation of the quality of the different methods is achieved using a set of quantitative quality parameters. The visual quality of the products are evaluated by a set of inter-preters. Conclusions are drawn on the quality of the proposed products

HAL-MINES ParisTech

Analysis, estimation and control for perturbed and singular systems for systems subject to discrete events.

Author
Publication venue: Laboratory for Information and Decision Systems, Massachusetts Institute of Technology
Publication date: 01/01/1991
Field of study

"The principle investigator for this effort is Professor Alan S. Willsky, and Professor George C. Verghese is co-principal investigator."--P. [3].Includes bibliographical references (p. [20]-[25]).Final technical report for grant AFOSR-88-0032.Supported by the AFOSR. AFOSR-88-003

DSpace@MIT

Probabilistic modeling and statistical inference for random fields and space-time processes

Author
Publication venue: Massachusetts Institute of Technology, Laboratory for Information and Decision Systems
Publication date: 01/01/1995
Field of study

Author from publisher's list. Cover title.Final report for ONR Grant N00014-91-J-100

DSpace@MIT

AN OVERVIEW OF IMAGE SEGMENTATION ALGORITHMS

Author: A.KHAIRE PUSHPAJIT
THAKUR NILESHSINGH V.
Publication venue: Institute for Project Management Pvt. Ltd
Publication date: 24/08/2020
Field of study

Image segmentation is a puzzled problem even after four decades of research. Research on image segmentation is currently conducted in three levels. Development of image segmentation methods, evaluation of segmentation algorithms and performance and study of these evaluation methods. Hundreds of techniques have been proposed for segmentation of natural images, noisy images, medical images etc. Currently most of the researchers are evaluating the segmentation algorithms using ground truth evaluation of (Berkeley segmentation database) BSD images. In this paper an overview of various segmentation algorithms is discussed. The discussion is mainly based on the soft computing approaches used for segmentation of images without noise and noisy images and the parameters used for evaluating these algorithms. Some of these techniques used are Markov Random Field (MRF) model, Neural Network, Clustering, Particle Swarm optimization, Fuzzy Logic approach and different combinations of these soft techniques

Interscience Research Network

Audio-visual football video analysis, from structure detection to attention analysis

Author: Ren Reede
Publication venue
Publication date: 01/01/2008
Field of study

Sport video is an important video genre. Content-based sports video analysis attracts great interest from both industry and academic ﬁelds. A sports video is characterised by repetitive temporal structures, relatively plain contents, and strong spatio-temporal variations, such as quick camera switches and swift local motions. It is necessary to develop speciﬁc techniques for content-based sports video analysis to utilise these characteristics. For an efﬁcient and effective sports video analysis system, there are three fundamental questions: (1) what are key stories for sports videos; (2) what incurs viewer’s interest; and (3) how to identify game highlights. This thesis is developed around these questions. We approached these questions from two different perspectives and in turn three research contributions are presented, namely, replay detection, attack temporal structure decomposition, and attention-based highlight identiﬁcation. Replay segments convey the most important contents in sports videos. It is an efﬁcient approach to collect game highlights by detecting replay segments. However, replay is an artefact of editing, which improves with advances in video editing tools. The composition of replay is complex, which includes logo transitions, slow motions, viewpoint switches and normal speed video clips. Since logo transition clips are pervasive in game collections of FIFA World Cup 2002, FIFA World Cup 2006 and UEFA Championship 2006, we take logo transition detection as an effective replacement of replay detection. A two-pass system was developed, including a ﬁve-layer adaboost classiﬁer and a logo template matching throughout an entire video. The ﬁve-layer adaboost utilises shot duration, average game pitch ratio, average motion, sequential colour histogram and shot frequency between two neighbouring logo transitions, to ﬁlter out logo transition candidates. Subsequently, a logo template is constructed and employed to ﬁnd all transition logo sequences. The precision and recall of this system in replay detection is 100% in a ﬁve-game evaluation collection. An attack structure is a team competition for a score. Hence, this structure is a conceptually fundamental unit of a football video as well as other sports videos. We review the literature of content-based temporal structures, such as play-break structure, and develop a three-step system for automatic attack structure decomposition. Four content-based shot classes, namely, play, focus, replay and break were identiﬁed by low level visual features. A four-state hidden Markov model was trained to simulate transition processes among these shot classes. Since attack structures are the longest repetitive temporal unit in a sports video, a sufﬁx tree is proposed to ﬁnd the longest repetitive substring in the label sequence of shot class transitions. These occurrences of this substring are regarded as a kernel of an attack hidden Markov process. Therefore, the decomposition of attack structure becomes a boundary likelihood comparison between two Markov chains. Highlights are what attract notice. Attention is a psychological measurement of “notice ”. A brief survey of attention psychological background, attention estimation from vision and auditory, and multiple modality attention fusion is presented. We propose two attention models for sports video analysis, namely, the role-based attention model and the multiresolution autoregressive framework. The role-based attention model is based on the perception structure during watching video. This model removes reﬂection bias among modality salient signals and combines these signals by reﬂectors. The multiresolution autoregressive framework (MAR) treats salient signals as a group of smooth random processes, which follow a similar trend but are ﬁlled with noise. This framework tries to estimate a noise-less signal from these coarse noisy observations by a multiple resolution analysis. Related algorithms are developed, such as event segmentation on a MAR tree and real time event detection. The experiment shows that these attention-based approach can ﬁnd goal events at a high precision. Moreover, results of MAR-based highlight detection on the ﬁnal game of FIFA 2002 and 2006 are highly similar to professionally labelled highlights by BBC and FIFA

Glasgow Theses Service

CiteSeerX

OpenGrey Repository