Search CORE

12,709 research outputs found

Detach and Adapt: Learning Cross-Domain Disentangled Deep Representation

Author: Chiu Wei-Chen
Fu Tzu-Chien
Liu Yen-Cheng
Wang Sheng-De
Wang Yu-Chiang Frank
Yeh Yu-Ying
Publication venue
Publication date: 01/05/2018
Field of study

While representation learning aims to derive interpretable features for describing visual data, representation disentanglement further results in such features so that particular image attributes can be identified and manipulated. However, one cannot easily address this task without observing ground truth annotation for the training data. To address this problem, we propose a novel deep learning model of Cross-Domain Representation Disentangler (CDRD). By observing fully annotated source-domain data and unlabeled target-domain data of interest, our model bridges the information across data domains and transfers the attribute information accordingly. Thus, cross-domain joint feature disentanglement and adaptation can be jointly performed. In the experiments, we provide qualitative results to verify our disentanglement capability. Moreover, we further confirm that our model can be applied for solving classification tasks of unsupervised domain adaptation, and performs favorably against state-of-the-art image disentanglement and translation methods.Comment: CVPR 2018 Spotligh

arXiv.org e-Print Archive

Action-Conditional Video Prediction using Deep Networks in Atari Games

Author: Guo Xiaoxiao
Lee Honglak
Lewis Richard
Oh Junhyuk
Singh Satinder
Publication venue
Publication date: 21/12/2015
Field of study

Motivated by vision-based reinforcement learning (RL) problems, in particular Atari games from the recent benchmark Aracade Learning Environment (ALE), we consider spatio-temporal prediction problems where future (image-)frames are dependent on control variables or actions as well as previous frames. While not composed of natural scenes, frames in Atari games are high-dimensional in size, can involve tens of objects with one or more objects being controlled by the actions directly and many other objects being influenced indirectly, can involve entry and departure of objects, and can involve deep partial observability. We propose and evaluate two deep neural network architectures that consist of encoding, action-conditional transformation, and decoding layers based on convolutional neural networks and recurrent neural networks. Experimental results show that the proposed architectures are able to generate visually-realistic frames that are also useful for control over approximately 100-step action-conditional futures in some games. To the best of our knowledge, this paper is the first to make and evaluate long-term predictions on high-dimensional video conditioned by control inputs.Comment: Published at NIPS 2015 (Advances in Neural Information Processing Systems 28

arXiv.org e-Print Archive

CiteSeerX

BlockDrop: Dynamic Inference Paths in Residual Networks

Author: Davis Larry S.
Feris Rogerio
Grauman Kristen
Kumar Abhishek
Nagarajan Tushar
Rennie Steven
Wu Zuxuan
Publication venue
Publication date: 28/01/2019
Field of study

Very deep convolutional neural networks offer excellent recognition results, yet their computational expense limits their impact for many real-world applications. We introduce BlockDrop, an approach that learns to dynamically choose which layers of a deep network to execute during inference so as to best reduce total computation without degrading prediction accuracy. Exploiting the robustness of Residual Networks (ResNets) to layer dropping, our framework selects on-the-fly which residual blocks to evaluate for a given novel image. In particular, given a pretrained ResNet, we train a policy network in an associative reinforcement learning setting for the dual reward of utilizing a minimal number of blocks while preserving recognition accuracy. We conduct extensive experiments on CIFAR and ImageNet. The results provide strong quantitative and qualitative evidence that these learned policies not only accelerate inference but also encode meaningful visual information. Built upon a ResNet-101 model, our method achieves a speedup of 20\% on average, going as high as 36\% for some images, while maintaining the same 76.4\% top-1 accuracy on ImageNet.Comment: CVPR 201

arXiv.org e-Print Archive

An Exploratory Study of Forces and Frictions affecting Large-Scale Model-Driven Development

Author: A. Forward
B. Anda
B. Dobing
F.P. Brooks
J. Hutchinson
J. Hutchinson
J.L. Bentley
J.M. Corbin
P. Mohagheghi
S. Adolph
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

In this paper, we investigate model-driven engineering, reporting on an exploratory case-study conducted at a large automotive company. The study consisted of interviews with 20 engineers and managers working in different roles. We found that, in the context of a large organization, contextual forces dominate the cognitive issues of using model-driven technology. The four forces we identified that are likely independent of the particular abstractions chosen as the basis of software development are the need for diffing in software product lines, the needs for problem-specific languages and types, the need for live modeling in exploratory activities, and the need for point-to-point traceability between artifacts. We also identified triggers of accidental complexity, which we refer to as points of friction introduced by languages and tools. Examples of the friction points identified are insufficient support for model diffing, point-to-point traceability, and model changes at runtime.Comment: To appear in proceedings of MODELS 2012, LNCS Springe

arXiv.org e-Print Archive

CiteSeerX

From single steps to mass migration: the problem of scale in the movement ecology of the Serengeti wildebeest

Author: Couzin Iain D.
Hopcraft J. Grant C.
Levin Simon A.
Morrison Thomas A.
Torney Colin J.
Publication venue: 'The Royal Society'
Publication date: 04/01/2018
Field of study

A central question in ecology is how to link processes that occur over different scales. The daily interactions of individual organisms ultimately determine community dynamics, population fluctuations and the functioning of entire ecosystems. Observations of these multiscale ecological processes are constrained by various technological, biological or logistical issues, and there are often vast discrepancies between the scale at which observation is possible and the scale of the question of interest. Animal movement is characterized by processes that act over multiple spatial and temporal scales. Second-by-second decisions accumulate to produce annual movement patterns. Individuals influence, and are influenced by, collective movement decisions, which then govern the spatial distribution of populations and the connectivity of meta-populations. While the field of movement ecology is experiencing unprecedented growth in the availability of movement data, there remain challenges in integrating observations with questions of ecological interest. In this article, we present the major challenges of addressing these issues within the context of the Serengeti wildebeest migration, a keystone ecological phenomena that crosses multiple scales of space, time and biological complexity. This article is part of the theme issue ’Collective movement ecology’

Enlighten: Research Data (University of Glasgow)

Enlighten