Search CORE

134 research outputs found

Learning continuity for image and video recognition

Author: Zhao J.
Publication venue
Publication date: 01/01/2022
Field of study

International Migration, Integration and Social Cohesion online publications

Discovery of Visual Semantics by Unsupervised and Self-Supervised Representation Learning

Author: Larsson Gustav
Publication venue
Publication date: 19/08/2017
Field of study

The success of deep learning in computer vision is rooted in the ability of deep networks to scale up model complexity as demanded by challenging visual tasks. As complexity is increased, so is the need for large amounts of labeled data to train the model. This is associated with a costly human annotation effort. To address this concern, with the long-term goal of leveraging the abundance of cheap unlabeled data, we explore methods of unsupervised "pre-training." In particular, we propose to use self-supervised automatic image colorization. We show that traditional methods for unsupervised learning, such as layer-wise clustering or autoencoders, remain inferior to supervised pre-training. In search for an alternative, we develop a fully automatic image colorization method. Our method sets a new state-of-the-art in revitalizing old black-and-white photography, without requiring human effort or expertise. Additionally, it gives us a method for self-supervised representation learning. In order for the model to appropriately re-color a grayscale object, it must first be able to identify it. This ability, learned entirely self-supervised, can be used to improve other visual tasks, such as classification and semantic segmentation. As a future direction for self-supervision, we investigate if multiple proxy tasks can be combined to improve generalization. This turns out to be a challenging open problem. We hope that our contributions to this endeavor will provide a foundation for future efforts in making self-supervision compete with supervised pre-training.Comment: Ph.D. thesi

arXiv.org e-Print Archive

Knowledge UChicago

Learning continuity for image and video recognition

Author: Zhao J.
Publication venue
Publication date: 01/01/2022
Field of study

International Migration, Integration and Social Cohesion online publications

Deep Geometrized Cartoon Line Inbetweening

Author: Ding Henghui
Gu Tianpei
Liu Ziwei
Loy Chen Change
Siyao Li
Xiao Weiye
Publication venue
Publication date: 28/09/2023
Field of study

We aim to address a significant but understudied problem in the anime industry, namely the inbetweening of cartoon line drawings. Inbetweening involves generating intermediate frames between two black-and-white line drawings and is a time-consuming and expensive process that can benefit from automation. However, existing frame interpolation methods that rely on matching and warping whole raster images are unsuitable for line inbetweening and often produce blurring artifacts that damage the intricate line structures. To preserve the precision and detail of the line drawings, we propose a new approach, AnimeInbet, which geometrizes raster line drawings into graphs of endpoints and reframes the inbetweening task as a graph fusion problem with vertex repositioning. Our method can effectively capture the sparsity and unique structure of line drawings while preserving the details during inbetweening. This is made possible via our novel modules, i.e., vertex geometric embedding, a vertex correspondence Transformer, an effective mechanism for vertex repositioning and a visibility predictor. To train our method, we introduce MixamoLine240, a new dataset of line drawings with ground truth vectorization and matching labels. Our experiments demonstrate that AnimeInbet synthesizes high-quality, clean, and complete intermediate line drawings, outperforming existing methods quantitatively and qualitatively, especially in cases with large motions. Data and code are available at https://github.com/lisiyao21/AnimeInbet.Comment: ICCV 202

arXiv.org e-Print Archive

A Variational Aggregation Framework for Patch-Based Optical Flow Estimation

Author: A Ayvaci
A Bruhn
A Bugeau
A Chambolle
A Criminisi
AN Stein
B Horn
C Barnes
C Fermüller
Charles Kervrann
CR Vogel
D Cremers
D Fortun
D Sun
Denis Fortun
DJ Fleet
E Mémin
F Heitz
F Pierre
H Nagel
H Zimmer
J Barron
J Bigun
J Kybic
J Odobez
J Salmon
J Yang
JY Bouguet
K He
L Xu
M Black
M Mohamed
M Mozerov
MJ Black
N Komodakis
N Papadakis
P Arias
Patrick Bouthemy
PM Jodoin
S Baker
S Boyd
S Ince
T Brox
T Corpetti
TF Chan
W Dong
W Enkelmann
Y Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref