Learning Multimodal Latent Attributes

Fu, Y; Gong, S; Hospedales, TM; Xiang, T

research

Learning Multimodal Latent Attributes

Authors: Y Fu
S Gong
TM Hospedales
T Xiang
Publication date: 1 February 2014
Publisher: 'Institute of Electrical and Electronics Engineers (IEEE)'
Doi

Abstract

Abstract—The rapid development of social media sharing has created a huge demand for automatic media classification and annotation techniques. Attribute learning has emerged as a promising paradigm for bridging the semantic gap and addressing data sparsity via transferring attribute knowledge in object recognition and relatively simple action classification. In this paper, we address the task of attribute learning for understanding multimedia data with sparse and incomplete labels. In particular we focus on videos of social group activities, which are particularly challenging and topical examples of this task because of their multi-modal content and complex and unstructured nature relative to the density of annotations. To solve this problem, we (1) introduce a concept of semi-latent attribute space, expressing user-defined and latent attributes in a unified framework, and (2) propose a novel scalable probabilistic topic model for learning multi-modal semi-latent attributes, which dramatically reduces requirements for an exhaustive accurate attribute ontology and expensive annotation effort. We show that our framework is able to exploit latent attributes to outperform contemporary approaches for addressing a variety of realistic multimedia sparse data learning tasks including: multi-task learning, learning with label noise, N-shot transfer learning and importantly zero-shot learning

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Supporting member

Queen Mary Research Online

oai:qmro.qmul.ac.uk:123456789/...

Last time updated on 05/04/2016

CiteSeerX

oai:CiteSeerX.psu:10.1.1.638.3...

Last time updated on 29/10/2017