Search CORE

1,514 research outputs found

A Hierarchical and Contextual Model for Aerial Image Parsing

Author: A. Barbu
B. Yao
Jake Porway
K. S. Fu
K. Siddiqi
M. Fischler
M. Wainwright
P. Felzenszwalb
Qiongchen Wang
S.-C. Zhu
S.-C. Zhu
Song Chun Zhu
T. Matsuyama
Y. Keselman
Y. Ohta
Z. Tu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

DOTA: A Large-scale Dataset for Object Detection in Aerial Images

Author: Bai Xiang
Belongie Serge
Datcu Mihai
Ding Jian
Luo Jiebo
Pelillo Marcello
Xia Gui-Song
Zhang Liangpei
Zhu Zhen
Publication venue
Publication date: 01/01/2018
Field of study

Object detection is an important and challenging problem in computer vision. Although the past decade has witnessed major advances in object detection in natural scenes, such successes have been slow to aerial imagery, not only because of the huge variation in the scale, orientation and shape of the object instances on the earth's surface, but also due to the scarcity of well-annotated datasets of objects in aerial scenes. To advance object detection research in Earth Vision, also known as Earth Observation and Remote Sensing, we introduce a large-scale Dataset for Object deTection in Aerial images (DOTA). To this end, we collect

2806

aerial images from different sensors and platforms. Each image is of the size about 4000-by-4000 pixels and contains objects exhibiting a wide variety of scales, orientations, and shapes. These DOTA images are then annotated by experts in aerial image interpretation using

15

common object categories. The fully annotated DOTA images contains

188,282

instances, each of which is labeled by an arbitrary (8 d.o.f.) quadrilateral To build a baseline for object detection in Earth Vision, we evaluate state-of-the-art object detection algorithms on DOTA. Experiments demonstrate that DOTA well represents real Earth Vision applications and are quite challenging.Comment: Accepted to CVPR 201

arXiv.org e-Print Archive

Institute of Transport Research:Publications

Crossref

Archivio istituzionale della ricerca - Università degli Studi di Venezia Ca' Foscari

Lifting GIS Maps into Strong Geometric Context for Scene Understanding

Author: Díaz Raúl
Fowlkes Charless C.
Lee Minhaeng
Schubert Jochen
Publication venue
Publication date: 08/01/2016
Field of study

Contextual information can have a substantial impact on the performance of visual tasks such as semantic segmentation, object detection, and geometric estimation. Data stored in Geographic Information Systems (GIS) offers a rich source of contextual information that has been largely untapped by computer vision. We propose to leverage such information for scene understanding by combining GIS resources with large sets of unorganized photographs using Structure from Motion (SfM) techniques. We present a pipeline to quickly generate strong 3D geometric priors from 2D GIS data using SfM models aligned with minimal user input. Given an image resectioned against this model, we generate robust predictions of depth, surface normals, and semantic labels. We show that the precision of the predicted geometry is substantially more accurate other single-image depth estimation methods. We then demonstrate the utility of these contextual constraints for re-scoring pedestrian detections, and use these GIS contextual features alongside object detection score maps to improve a CRF-based semantic segmentation framework, boosting accuracy over baseline models

arXiv.org e-Print Archive

Crossref