Search CORE

7 research outputs found

Image to Image Translation for Domain Adaptation

Author: Kim Kyungnam
Kolouri Soheil
Kriegman David
Murez Zak
Ramamoorthi Ravi
Publication venue
Publication date: 01/12/2017
Field of study

We propose a general framework for unsupervised domain adaptation, which allows deep neural networks trained on a source domain to be tested on a different target domain without requiring any training annotations in the target domain. This is achieved by adding extra networks and losses that help regularize the features extracted by the backbone encoder network. To this end we propose the novel use of the recently proposed unpaired image-toimage translation framework to constrain the features extracted by the encoder network. Specifically, we require that the features extracted are able to reconstruct the images in both domains. In addition we require that the distribution of features extracted from images in the two domains are indistinguishable. Many recent works can be seen as specific cases of our general framework. We apply our method for domain adaptation between MNIST, USPS, and SVHN datasets, and Amazon, Webcam and DSLR Office datasets in classification tasks, and also between GTA5 and Cityscapes datasets for a segmentation task. We demonstrate state of the art performance on each of these datasets

arXiv.org e-Print Archive

Crossref

Linking vision and motion for self-supervised object-centric perception

Author: Badrinarayanan Vijay
Burgess Christopher P.
Kendall Alex
Murez Zak
Shotton Jamie
Stocking Kaylene C.
Tomlin Claire
Publication venue
Publication date: 14/07/2023
Field of study

Object-centric representations enable autonomous driving algorithms to reason about interactions between many independent agents and scene features. Traditionally these representations have been obtained via supervised learning, but this decouples perception from the downstream driving task and could harm generalization. In this work we adapt a self-supervised object-centric vision model to perform object decomposition using only RGB video and the pose of the vehicle as inputs. We demonstrate that our method obtains promising results on the Waymo Open perception dataset. While object mask quality lags behind supervised methods or alternatives that use more privileged information, we find that our model is capable of learning a representation that fuses multiple camera viewpoints over time and successfully tracks many vehicles and pedestrians in the dataset. Code for our model is available at https://github.com/wayveai/SOCS.Comment: Presented at the CVPR 2023 Vision-Centric Autonomous Driving worksho

arXiv.org e-Print Archive

Photometric Stereo in a Scattering Medium

Author: David J. Kriegman
Ravi Ramamoorthi
Tali Treibitz
Zak Murez
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

FIERY: Future Instance Prediction in Bird’s-Eye View from Surround Monocular Cameras

Author: Badrinarayanan Vijay
Cipolla Roberto
Dudas Sofia
Hawke Jeffrey
Hu Anthony
Kendall Alex
Mohan Nikhil
Murez Zak
Publication venue
Publication date: 18/10/2021
Field of study

Driving requires interacting with road agents and predicting their future behaviour in order to navigate safely. We present FIERY: a probabilistic future prediction model in bird’s-eye view from monocular cameras. Our model predicts future instance segmentation and motion of dynamic agents that can be transformed into non-parametric future trajectories. Our approach combines the perception, sensor fusion and prediction components of a traditional autonomous driving stack by estimating bird’s-eye-view prediction directly from surround RGB monocular camera inputs. FIERY learns to model the inherent stochastic nature of the future solely from camera driving data in an end-to-end manner, without relying on HD maps, and predicts multimodal future trajectories. We show that our model outperforms previous prediction baselines on the NuScenes and Lyft datasets. The code and trained models are available at https://github.com/wayveai/fiery.Toshiba Europe, grant G100453

arXiv.org e-Print Archive

Apollo (Cambridge)

Recommended from our members

Model-Based Imitation Learning for Urban Driving

Author: Cipolla Roberto
Corrado Gianluca
Griffiths Nicolas
Gurau Corina
Hu Anthony
Kendall Alex
Murez Zak
Shotton Jamie
Yeo Hudson
Publication venue: Department of Engineering
Publication date: 03/01/2023
Field of study

An accurate model of the environment and the dynamic agents acting in it offers great potential for improving motion planning. We present MILE: a Model-based Imitation LEarning approach to jointly learn a model of the world and a policy for autonomous driving. Our method leverages 3D geometry as an inductive bias and learns a highly compact latent space directly from high-resolution videos of expert demonstrations. Our model is trained on an offline corpus of urban driving data, without any online interaction with the environment. MILE improves upon prior state-of-the-art by 31% in driving score on the CARLA simulator when deployed in a completely new town and new weather conditions. Our model can predict diverse and plausible states and actions, that can be interpretably decoded to bird's-eye view semantic segmentation. Further, we demonstrate that it can execute complex driving manoeuvres from plans entirely predicted in imagination. Our approach is the first camera-only method that models static scene, dynamic scene, and ego-behaviour in an urban driving environment. The code and model weights are available at https://github.com/wayveai/mile.Toshiba Europe grant G10045

Apollo (Cambridge)

Depth and Image Restoration from Light Field in a Scattering Medium

Author: Cui T(崔童)
Kriegman David
Murez Zak
Ramamoorthi Ravi
Tian JD(田建东)
Zhang Z(张箴)
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

Traditional imaging methods and computer vision algorithms are often ineffective when images are acquired in scattering media, such as underwater, fog, and biological tissue. Here, we explore the use of light field imaging and algorithms for image restoration and depth estimation that address the image degradation from the medium. Towards this end, we make the following three contributions. First, we present a new single image restoration algorithm which removes backscatter and attenuation from images better than existing methods do, and apply it to each view in the light field. Second, we combine a novel transmission based depth cue with existing correspondence and defocus cues to improve light field depth estimation. In densely scattering media, our transmission depth cue is critical for depth estimation since the images have low signal to noise ratios which significantly degrades the performance of the correspondence and defocus cues. Finally, we propose shearing and refocusing multiple views of the light field to recover a single image of higher quality than what is possible from a single view. We demonstrate the benefits of our method through extensive experimental results in a water tank

Institutional Repository of Institute of Automation, CAS