Search CORE

3 research outputs found

A unified framework for spatiotemporal salient region detection

Author: A Bruhn
A Bur
A Yeh
Bo Wu
C Guo
C Jung
D Gao
F Meng
H Li
H Li
H Li
H Li
H Li
H Seo
J Li
J You
K Fukuchi
K Liu
L Itti
L Itti
L Xu
L Xu
Liaoyuan Zeng
Linfeng Xu
M Cerf
M Cheng
O Le Meur
R Achanta
R Kountchev
S Frintrop
T Brox
T Judd
T Liu
V Courboulay
V Mahadevan
W Cheng
W Freeman
W Kim
W Luo
X Hou
Y Zhai
Yan Wang
Zhengning Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Removing label ambiguity in learning-based visual saliency estimation

Author: Gao Wen
Li Jia
Xu Dong
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2011
Field of study

Visual saliency is a useful clue to depict visually important image/video contents in many multimedia applications. In visual saliency estimation, a feasible solution is to learn a “feature-saliency” mapping model from the user data obtained by manually labeling activities or eye-tracking devices. However, label ambiguities may also arise due to the inaccurate and inadequate user data. To process the noisy training data, we propose a multi-instance learning to rank approach for visual saliency estimation. In our approach, the correlations between various image patches are incorporated into an ordinal regression framework. By iteratively refining a ranking model and relabeling the image patches with respect to their mutual correlations, the label ambiguities can be effectively removed from the training data. Consequently, visual saliency can be effectively estimated by the ranking model, which can pop out real targets and suppress real distractors. Extensive experiments on two public image data sets show that our approach outperforms 11 state-of-the-art methods remarkably in visual saliency estimation

DR-NTU (Digital Repository of NTU)

Multimodal Computational Attention for Scene Understanding

Author: Schauerte Boris
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2014
Field of study

Robotic systems have limited computational capacities. Hence, computational attention models are important to focus on specific stimuli and allow for complex cognitive processing. For this purpose, we developed auditory and visual attention models that enable robotic platforms to efficiently explore and analyze natural scenes. To allow for attention guidance in human-robot interaction, we use machine learning to integrate the influence of verbal and non-verbal social signals into our models

KITopen