7 research outputs found
Font Representation Learning via Paired-glyph Matching
Fonts can convey profound meanings of words in various forms of glyphs.
Without typography knowledge, manually selecting an appropriate font or
designing a new font is a tedious and painful task. To allow users to explore
vast font styles and create new font styles, font retrieval and font style
transfer methods have been proposed. These tasks increase the need for learning
high-quality font representations. Therefore, we propose a novel font
representation learning scheme to embed font styles into the latent space. For
the discriminative representation of a font from others, we propose a
paired-glyph matching-based font representation learning model that attracts
the representations of glyphs in the same font to one another, but pushes away
those of other fonts. Through evaluations on font retrieval with query glyphs
on new fonts, we show our font representation learning scheme achieves better
generalization performance than the existing font representation learning
techniques. Finally on the downstream font style transfer and generation tasks,
we confirm the benefits of transfer learning with the proposed method. The
source code is available at https://github.com/junhocho/paired-glyph-matching.Comment: Accepted to BMVC202
Symmetric Graph Convolutional Autoencoder for Unsupervised Graph Representation Learning
We propose a symmetric graph convolutional autoencoder which produces a
low-dimensional latent representation from a graph. In contrast to the existing
graph autoencoders with asymmetric decoder parts, the proposed autoencoder has
a newly designed decoder which builds a completely symmetric autoencoder form.
For the reconstruction of node features, the decoder is designed based on
Laplacian sharpening as the counterpart of Laplacian smoothing of the encoder,
which allows utilizing the graph structure in the whole processes of the
proposed autoencoder architecture. In order to prevent the numerical
instability of the network caused by the Laplacian sharpening introduction, we
further propose a new numerically stable form of the Laplacian sharpening by
incorporating the signed graphs. In addition, a new cost function which finds a
latent representation and a latent affinity matrix simultaneously is devised to
boost the performance of image clustering tasks. The experimental results on
clustering, link prediction and visualization tasks strongly support that the
proposed model is stable and outperforms various state-of-the-art algorithms.Comment: 10 pages, 3 figures, ICCV 2019 accepte
High-speed High-performance Visual Tracker via Correlation Filter with Compressed Deep Feature
This paper introduces a context-aware correlation filter based tracker to achieve both high speed and high performance. We achieve high speed via deep feature compression based on a context-aware scheme utilizing multiple expert auto-encoders. To achieve high performance with the compressed feature map, we introduce extrinsic denoising processes and a new orthogonality loss term for pre-training and fine-tuning of the expert auto-encoders. In experiments, the proposed tracker is verified to achieve a comparable performance to state-of-the-art with running at over 100 fps.N
Symmetric Graph Convolutional Autoencoder for Unsupervised Graph Representation Learning
We propose a symmetric graph convolutional autoencoder which produces a low-dimensional latent representation from a graph. In contrast to the existing graph autoencoders with asymmetric decoder parts, the proposed autoencoder has a newly designed decoder which builds a completely symmetric autoencoder form. For the reconstruction of node features, the decoder is designed based on Laplacian sharpening as the counterpart of Laplacian smoothing of the encoder, which allows utilizing the graph structure in the whole processes of the proposed autoencoder architecture. In order to prevent the numerical instability of the network caused by the Laplacian sharpening introduction, we further propose a new numerically stable form of the Laplacian sharpening by incorporating the signed graphs. In addition, a new cost function which finds a latent representation and a latent affinity matrix simultaneously is devised to boost the performance of image clustering tasks. The experimental results on clustering, link prediction and visualization tasks strongly support that the proposed model is stable and outperforms various state-of-the-art algorithms.N
Motion-aware ensemble of three-mode trackers for unmanned aerial vehicles
To tackle problems arising from unexpected camera motions in unmanned aerial vehicles (UAVs), we propose a three-mode ensemble tracker where each mode specializes in distinctive situations. The proposed ensemble tracker is composed of appearance-based tracking mode, homography-based tracking mode, and momentum-based tracking mode. The appearance-based tracking mode tracks a moving object well when the UAV is nearly stopped, whereas the homography-based tracking mode shows good tracking performance under smooth UAV or object motion. The momentum-based tracking mode copes with large or abrupt motion of either the UAV or the object. We evaluate the proposed tracking scheme on a widely-used UAV123 benchmark dataset. The proposed motion-aware ensemble shows a 5.3% improvement in average precision compared to the baseline correlation filter tracker, which effectively employs deep features while achieving a tracking speed of at least 80fps in our experimental settings. In addition, the proposed method outperforms existing real-time correlation filter trackers.N