128 research outputs found
SKFlow: Learning Optical Flow with Super Kernels
Optical flow estimation is a classical yet challenging task in computer
vision. One of the essential factors in accurately predicting optical flow is
to alleviate occlusions between frames. However, it is still a thorny problem
for current top-performing optical flow estimation methods due to insufficient
local evidence to model occluded areas. In this paper, we propose the Super
Kernel Flow Network (SKFlow), a CNN architecture to ameliorate the impacts of
occlusions on optical flow estimation. SKFlow benefits from the super kernels
which bring enlarged receptive fields to complement the absent matching
information and recover the occluded motions. We present efficient super kernel
designs by utilizing conical connections and hybrid depth-wise convolutions.
Extensive experiments demonstrate the effectiveness of SKFlow on multiple
benchmarks, especially in the occluded areas. Without pre-trained backbones on
ImageNet and with a modest increase in computation, SKFlow achieves compelling
performance and ranks among currently published methods on the
Sintel benchmark. On the challenging Sintel clean and final passes (test),
SKFlow surpasses the best-published result in the unmatched areas ( and
) by and . The code is available at
\href{https://github.com/littlespray/SKFlow}{https://github.com/littlespray/SKFlow}.Comment: Accepted to NeurIPS 202
SSD-GAN: Measuring the Realness in the Spatial and Spectral Domains
This paper observes that there is an issue of high frequencies missing in the
discriminator of standard GAN, and we reveal it stems from downsampling layers
employed in the network architecture. This issue makes the generator lack the
incentive from the discriminator to learn high-frequency content of data,
resulting in a significant spectrum discrepancy between generated images and
real images. Since the Fourier transform is a bijective mapping, we argue that
reducing this spectrum discrepancy would boost the performance of GANs. To this
end, we introduce SSD-GAN, an enhancement of GANs to alleviate the spectral
information loss in the discriminator. Specifically, we propose to embed a
frequency-aware classifier into the discriminator to measure the realness of
the input in both the spatial and spectral domains. With the enhanced
discriminator, the generator of SSD-GAN is encouraged to learn high-frequency
content of real data and generate exact details. The proposed method is general
and can be easily integrated into most existing GANs framework without
excessive cost. The effectiveness of SSD-GAN is validated on various network
architectures, objective functions, and datasets. Code will be available at
https://github.com/cyq373/SSD-GAN.Comment: Accepted to AAAI 2021. Code: https://github.com/cyq373/SSD-GA
- …