364 research outputs found
Read, Watch, and Move: Reinforcement Learning for Temporally Grounding Natural Language Descriptions in Videos
The task of video grounding, which temporally localizes a natural language
description in a video, plays an important role in understanding videos.
Existing studies have adopted strategies of sliding window over the entire
video or exhaustively ranking all possible clip-sentence pairs in a
pre-segmented video, which inevitably suffer from exhaustively enumerated
candidates. To alleviate this problem, we formulate this task as a problem of
sequential decision making by learning an agent which regulates the temporal
grounding boundaries progressively based on its policy. Specifically, we
propose a reinforcement learning based framework improved by multi-task
learning and it shows steady performance gains by considering additional
supervised boundary information during training. Our proposed framework
achieves state-of-the-art performance on ActivityNet'18 DenseCaption dataset
and Charades-STA dataset while observing only 10 or less clips per video.Comment: AAAI 201
Multi-kernel Correntropy Regression: Robustness, Optimality, and Application on Magnetometer Calibration
This paper investigates the robustness and optimality of the multi-kernel
correntropy (MKC) on linear regression. We first derive an upper error bound
for a scalar regression problem in the presence of arbitrarily large outliers
and reveal that the kernel bandwidth should be neither too small nor too big in
the sense of the lowest upper error bound. Meanwhile, we find that the proposed
MKC is related to a specific heavy-tail distribution, and the level of the
heavy tail is controlled by the kernel bandwidth solely. Interestingly, this
distribution becomes the Gaussian distribution when the bandwidth is set to be
infinite, which allows one to tackle both Gaussian and non-Gaussian problems.
We propose an expectation-maximization (EM) algorithm to estimate the parameter
vectors and explore the kernel bandwidths alternatively. The results show that
our algorithm is equivalent to the traditional linear regression under Gaussian
noise and outperforms the conventional method under heavy-tailed noise. Both
numerical simulations and experiments on a magnetometer calibration application
verify the effectiveness of the proposed method
- …