Search CORE

178 research outputs found

Compact color texture descriptor based on rank transform and product ordering in the RGB color space

Author: Bianconi F
Fernandez A
IEEE
Lima D
Smeraldi F
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 02/01/2018
Field of study

Robust recognition and segmentation of human actions using HMMs with missing observations

Author: Bui Hung H.
Peursum Patrick
Venkatesh Svetha
West Geoff
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2005
Field of study

This paper describes the integration of missing observation data with hidden Markov models to create a framework that is able to segment and classify individual actions from a stream of human motion using an incomplete 3D human pose estimation. Based on this framework, a model is trained to automatically segment and classify an activity sequence into its constituent subactions during inferencing. This is achieved by introducing action labels into the observation vector and setting these labels as missing data during inferencing, thus forcing the system to infer the probability of each action label. Additionally, missing data provides recognition-level support for occlusions and imperfect silhouette segmentation, permitting the use of a fast (real-time) pose estimation that delegates the burden of handling undetected limbs onto the action recognition system. Findings show that the use of missing data to segment activities is an accurate and elegant approach. Furthermore, action recognition can be accurate even when almost half of the pose feature data is missing due to occlusions, since not all of the pose data is important all of the time

Deakin Research Online

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

Improving End-to-End Text Image Translation From the Auxiliary Text Translation Task

Author: Han Xu
Ma Cong
Tu Mei
Wu Linghui
Zhang Yaping
Zhao Yang
Zhou Yu
Publication venue
Publication date: 07/10/2022
Field of study

End-to-end text image translation (TIT), which aims at translating the source language embedded in images to the target language, has attracted intensive attention in recent research. However, data sparsity limits the performance of end-to-end text image translation. Multi-task learning is a non-trivial way to alleviate this problem via exploring knowledge from complementary related tasks. In this paper, we propose a novel text translation enhanced text image translation, which trains the end-to-end model with text translation as an auxiliary task. By sharing model parameters and multi-task training, our model is able to take full advantage of easily-available large-scale text parallel corpus. Extensive experimental results show our proposed method outperforms existing end-to-end methods, and the joint multi-task learning with both text translation and recognition tasks achieves better results, proving translation and recognition auxiliary tasks are complementary.Comment: Accepted at the 26TH International Conference on Pattern Recognition (ICPR 2022

arXiv.org e-Print Archive

Optimal Features Subset Selection and Classification for Iris Recognition

Author
Publication venue: Springer
Publication date
Field of study

Springer - Publisher Connector

Video enhancement using adaptive spatio-temporal connective filter and piecewise mapping

Author: A Polesel
C Tomasi
D Barash
D-C Chang
E Abreu
EP Bennett
EP Bennett
EW Dijkstra
F Durand
G Pok
G Zhu
J Portilla
JC Brailean
JJ Francis
JK Aggarwal
K Jostschulte
KH Goh
M Bister
MB Alp
P Perona
R Garnett
R van den Boomgaard
RC Gonzalez
RC Hardie
S Jackson
S Peng
S Roth
S Schulte
SH Lee
V Kober
W-Y Han
Z Chen
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2008
Field of study

This paper presents a novel video enhancement system based on an adaptive spatio-temporal connective (ASTC) noise filter and an adaptive piecewise mapping function (APMF). For ill-exposed videos or those with much noise, we first introduce a novel local image statistic to identify impulse noise pixels, and then incorporate it into the classical bilateral filter to form ASTC, aiming to reduce the mixture of the most two common types of noises - Gaussian and impulse noises in spatial and temporal directions. After noise removal, we enhance the video contrast with APMF based on the statistical information of frame segmentation results. The experiment results demonstrate that, for diverse low-quality videos corrupted by mixed noise, underexposure, overexposure, or any mixture of the above, the proposed system can automatically produce satisfactory results

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

University of Bedfordshire Repository

Toward An Efficient Fingerprint Classification

Author: Ali Ismail Awad
Kensuke Baba
Publication venue: 'IntechOpen'
Publication date: 04/04/2011
Field of study

IntechOpen

EURASIP Journal on Applied Signal Processing 2005:13, 2110–2126 c ○ 2005 Hindawi Publishing Corporation Robust Recognition and Segmentation of Human Actions Using HMMs with Missing Observations

Author: Hung H. Bui
Patrick Peursum
Svetha Venkatesh
Publication venue
Publication date
Field of study

This paper describes the integration of missing observation data with hidden Markov models to create a framework that is able to segment and classify individual actions from a stream of human motion using an incomplete 3D human pose estimation. Based on this framework, a model is trained to automatically segment and classify an activity sequence into its constituent subactions during inferencing. This is achieved by introducing action labels into the observation vector and setting these labels as missing data during inferencing, thus forcing the system to infer the probability of each action label. Additionally, missing data provides recognitionlevel support for occlusions and imperfect silhouette segmentation, permitting the use of a fast (real-time) pose estimation that delegates the burden of handling undetected limbs onto the action recognition system. Findings show that the use of missing data to segment activities is an accurate and elegant approach. Furthermore, action recognition can be accurate even when almost half of the pose feature data is missing due to occlusions, since not all of the pose data is important all of the time

CiteSeerX