Search CORE

16,734 research outputs found

Fusing image representations for classification using support vector machines

Author: Cherifi Hocine
Demirkesen Can
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 23/11/2009
Field of study

In order to improve classification accuracy different image representations are usually combined. This can be done by using two different fusing schemes. In feature level fusion schemes, image representations are combined before the classification process. In classifier fusion, the decisions taken separately based on individual representations are fused to make a decision. In this paper the main methods derived for both strategies are evaluated. Our experimental results show that classifier fusion performs better. Specifically Bayes belief integration is the best performing strategy for image classification task.Comment: Image and Vision Computing New Zealand, 2009. IVCNZ '09. 24th International Conference, Wellington : Nouvelle-Z\'elande (2009

arXiv.org e-Print Archive

HAL-uB

Crossref

Automatic annotation of tennis games: An integration of audio, vision, and learning

Author: Fei Yan
Josef Kittler
David Windridge
William Christmas
Krystian Mikolajczyk
Stephen Cox
Qiang Huang
Kijak
Kolonias
Huang
Yu
Ekinci
Zhu
Yan
Christmas
Yan
Kijak
Coldefy
Zhu
Lai
Hartley
Kittler
Huang
Tsochantaridis
Joachims
Altun
Taskar
Ng
Publication venue: 'Elsevier BV'
Publication date: 01/01/1999
Field of study

Fully automatic annotation of tennis game using broadcast video is a task with a great potential but with enormous challenges. In this paper we describe our approach to this task, which integrates computer vision, machine listening, and machine learning. At the low level processing, we improve upon our previously proposed state-of-the-art tennis ball tracking algorithm and employ audio signal processing techniques to detect key events and construct features for classifying the events. At high level analysis, we model event classification as a sequence labelling problem, and investigate four machine learning techniques using simulated event sequences. Finally, we evaluate our proposed approach on three real world tennis games, and discuss the interplay between audio, vision and learning. To the best of our knowledge, our system is the only one that can annotate tennis game at such a detailed level

Crossref

Middlesex University Research Repository

Institutional Repository Universiteit Antwerpen

University of East Anglia digital repository

Surrey Research Insight

Tropmed Central Antwerp

Deep Affordance-grounded Sensorimotor Object Recognition

Author: Daras Petros
Papadopoulos Georgios Th.
Potamianos Gerasimos
Thermos Spyridon
Publication venue
Publication date: 10/04/2017
Field of study

It is well-established by cognitive neuroscience that human perception of objects constitutes a complex process, where object appearance information is combined with evidence about the so-called object "affordances", namely the types of actions that humans typically perform when interacting with them. This fact has recently motivated the "sensorimotor" approach to the challenging task of automatic object recognition, where both information sources are fused to improve robustness. In this work, the aforementioned paradigm is adopted, surpassing current limitations of sensorimotor object recognition research. Specifically, the deep learning paradigm is introduced to the problem for the first time, developing a number of novel neuro-biologically and neuro-physiologically inspired architectures that utilize state-of-the-art neural networks for fusing the available information sources in multiple ways. The proposed methods are evaluated using a large RGB-D corpus, which is specifically collected for the task of sensorimotor object recognition and is made publicly available. Experimental results demonstrate the utility of affordance information to object recognition, achieving an up to 29% relative error reduction by its inclusion.Comment: 9 pages, 7 figures, dataset link included, accepted to CVPR 201

arXiv.org e-Print Archive

Crossref

PAC-Bayesian Majority Vote for Late Classifier Fusion

Author: Ayache Stéphane
Habrard Amaury
Morvant Emilie
Publication venue
Publication date: 01/01/2012
Field of study

A lot of attention has been devoted to multimedia indexing over the past few years. In the literature, we often consider two kinds of fusion schemes: The early fusion and the late fusion. In this paper we focus on late classifier fusion, where one combines the scores of each modality at the decision level. To tackle this problem, we investigate a recent and elegant well-founded quadratic program named MinCq coming from the Machine Learning PAC-Bayes theory. MinCq looks for the weighted combination, over a set of real-valued functions seen as voters, leading to the lowest misclassification rate, while making use of the voters' diversity. We provide evidence that this method is naturally adapted to late fusion procedure. We propose an extension of MinCq by adding an order- preserving pairwise loss for ranking, helping to improve Mean Averaged Precision measure. We confirm the good behavior of the MinCq-based fusion approaches with experiments on a real image benchmark.Comment: 7 pages, Research repor

arXiv.org e-Print Archive

HAL-UJM

HAL AMU