Search CORE

6 research outputs found

Deep reinforcement active learning for human-in-the-loop person re-identification

Author: Gong S
IEEE International Conference on Computer Vision
Liu Z
Lu H
Tao D
Wang J
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/10/2019
Field of study

Most existing person re-identification(Re-ID) approaches achieve superior results based on the assumption that a large amount of pre-labelled data is usually available and can be put into training phrase all at once. However, this assumption is not applicable to most real-world deployment of the Re-ID task. In this work, we propose an alternative reinforcement learning based human-in-the-loop model which releases the restriction of pre-labelling and keeps model upgrading with progressively collected data. The goal is to minimize human annotation efforts while maximizing Re-ID performance. It works in an iteratively updating framework by refining the RL policy and CNN parameters alternately. In particular, we formulate a Deep Reinforcement Active Learning (DRAL) method to guide an agent (a model in a reinforcement learning process) in selecting training samples on-the-fly by a human user/annotator. The reinforcement learning reward is the uncertainty value of each human selected sample. A binary feedback (positive or negative) labelled by the human annotator is used to select the samples of which are used to fine-tune a pre-trained CNN Re-ID model. Extensive experiments demonstrate the superiority of our DRAL method for deep reinforcement learning based human-in-the-loop person Re-ID when compared to existing unsupervised and transfer learning models as well as active learning models

Crossref

Queen Mary Research Online

Striking a Balance between Stability and Plasticity for Class-Incremental Learning

Author: 2021 IEEE/CVF International Conference on Computer Vision (ICCV)
Gong S
Queen PL
Wu G
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 17/10/2021
Field of study

Crossref

Queen Mary Research Online

ARTHuS: Adaptive real-time human segmentation in sports through online distillation

Author: Cioppa Anthony
De Vleeschouwer Christophe
Deliège Adrien
IEEE/CVF International conference on Computer Vision and Pattern Recognition workshop on Computer Vision in Sports
Istasse Maxime
Van Droogenbroeck Marc
Publication venue: IEEE Explore
Publication date: 01/01/2019
Field of study

Semantic segmentation can be regarded as a useful tool for global scene understanding in many areas, including sports, but has inherent difficulties, such as the need for pixel-wise annotated training data and the absence of well-performing real-time universal algorithms. To alleviate these issues, we sacrifice universality by developing a general method, named ARTHuS, that produces adaptive real-time game-specific networks for human segmentation in sports videos, without requiring any manual annotation. This is done by an online knowledge distillation process, in which a fast student network is trained to mimic the output of an existing slow but effective universal teacher network, while being periodically updated to adjust to the latest play conditions. As a result, ARTHuS allows to build highly effective real-time human segmentation networks that evolve through the match and that sometimes outperform their teacher. The usefulness of producing adaptive game-specific networks and their excellent performances are demonstrated quantitatively and qualitatively for soccer and basketball games

DIAL UCLouvain

CNN-based morphological decomposition of X-ray images for details and defects contrast enhancement

Author: De Vleeschouwer Christophe
Delinte Nicolas
IEEE International Conference on Computer Vision and Pattern Recognition Workshops - WiCV @ CVPR 2021
Madmad Tahani
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2021
Field of study

This paper introduces a new learning based framework for X-ray images that relies on a morphological decomposition of the signal into two main components, separating images into local textures and piecewise smooth (cartoon) parts. The piecewise smooth component corresponds to the spatial variation of the average density of the objects, whereas the local texture component presents the inspected objects singularities. Our method builds on two convolutional neural network (CNN) branches to decompose an input image into its two morphological components. This CNN is trained with synthetic data, generated by randomly picking piecewise smooth and singular patterns in a parametric dictionary and enforcing the sum of the CNN branches to approximate the identity mapping. We demonstrate the relevance of the decomposition by enhancing the local textures component compared to the piecewise smooth one. Those enhanced images compare favorably to the ones obtained with existing works destined to visualize High Dynamic Range (HDR) images such as tone-mapping algorithms

DIAL UCLouvain

Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation

Author: Do H
IEEE/CVF International Conference on Computer Vision and Pattern Recognition 2024
Kim J
Kim S
Oh C
Sohn K
Publication venue
Publication date: 26/02/2024
Field of study

Queen Mary Research Online

Basics and Advances in Monocular vSLAM

Crossref