Play and Learn: Using Video Games to Train Computer Vision Models

Little, James J.; Schmidt, Mark; Shafaei, Alireza

research

Play and Learn: Using Video Games to Train Computer Vision Models

Authors: James J. Little
Mark Schmidt
Alireza Shafaei
Publication date: 1 January 2016
Publisher
Doi

Abstract

Video games are a compelling source of annotated data as they can readily provide fine-grained groundtruth for diverse tasks. However, it is not clear whether the synthetically generated data has enough resemblance to the real-world images to improve the performance of computer vision models in practice. We present experiments assessing the effectiveness on real-world data of systems trained on synthetic RGB images that are extracted from a video game. We collected over 60000 synthetic samples from a modern video game with similar conditions to the real-world CamVid and Cityscapes datasets. We provide several experiments to demonstrate that the synthetically generated RGB images can be used to improve the performance of deep neural networks on both image segmentation and depth estimation. These results show that a convolutional network trained on synthetic data achieves a similar test error to a network that is trained on real-world data for dense image classification. Furthermore, the synthetically generated RGB images can provide similar or better results compared to the real-world datasets if a simple domain adaptation technique is applied. Our results suggest that collaboration with game developers for an accessible interface to gather data is potentially a fruitful direction for future work in computer vision.Comment: To appear in the British Machine Vision Conference (BMVC), September 2016. -v2: fixed a typo in the reference

Similar works

Full text

Available Versions

Crossref

info:doi/10.5244%2Fc.30.26

Last time updated on 01/04/2019