1,170 research outputs found
AT-GIS: highly parallel spatial query processing with associative transducers
Users in many domains, including urban planning, transportation, and environmental science want to execute analytical queries over continuously updated spatial datasets. Current solutions for largescale spatial query processing either rely on extensions to RDBMS, which entails expensive loading and indexing phases when the data changes, or distributed map/reduce frameworks, running on resource-hungry compute clusters. Both solutions struggle with the sequential bottleneck of parsing complex, hierarchical spatial data formats, which frequently dominates query execution time. Our goal is to fully exploit the parallelism offered by modern multicore CPUs for parsing and query execution, thus providing the performance of a cluster with the resources of a single machine. We describe AT-GIS, a highly-parallel spatial query processing system that scales linearly to a large number of CPU cores. ATGIS integrates the parsing and querying of spatial data using a new computational abstraction called associative transducers(ATs). ATs can form a single data-parallel pipeline for computation without requiring the spatial input data to be split into logically independent blocks. Using ATs, AT-GIS can execute, in parallel, spatial query operators on the raw input data in multiple formats, without any pre-processing. On a single 64-core machine, AT-GIS provides 3× the performance of an 8-node Hadoop cluster with 192 cores for containment queries, and 10× for aggregation queries
Enhancment of dense urban digital surface models from VHR optical satellite stereo data by pre-segmentation and object detection
The generation of digital surface models (DSM) of urban areas from very high resolution (VHR) stereo satellite imagery requires advanced methods. In the classical approach of DSM generation from stereo satellite imagery, interest points are extracted and correlated between the stereo mates using an area based matching followed by a least-squares sub-pixel refinement step. After a region growing the 3D point list is triangulated to the resulting DSM. In urban areas this approach fails due to the size of the correlation window, which smoothes out the usual steep edges of buildings. Also missing correlations as for partly – in one or both of the images – occluded areas will simply be interpolated in the triangulation step. So an urban DSM generated with the classical approach results in a very smooth DSM with missing steep walls, narrow streets and courtyards. To overcome these problems algorithms from computer vision are introduced and adopted to satellite imagery. These algorithms do not work using local optimisation like the area-based matching but try to optimize a (semi-)global cost function. Analysis shows that dynamic programming approaches based on epipolar images like dynamic line warping or semiglobal matching yield the best results according to accuracy and processing time. These algorithms can also detect occlusions – areas not visible in one or both of the stereo images. Beside these also the time and memory consuming step of handling and triangulating large point lists can be omitted due to the direct operation on epipolar images and direct generation of a so called disparity image fitting exactly on the first of the stereo images. This disparity image – representing already a sort of a dense DSM – contains the distances measured in pixels in the epipolar direction (or a no-data value for a detected occlusion) for each pixel in the image. Despite the global optimization of the cost function many outliers, mismatches and erroneously detected occlusions remain, especially if only one stereo pair is available. To enhance these dense DSM – the disparity image – a pre-segmentation approach is presented in this paper. Since the disparity image is fitting exactly on the first of the two stereo partners (beforehand transformed to epipolar geometry) a direct
correlation between image pixels and derived heights (the disparities) exist. This feature of the disparity image is exploited to integrate additional knowledge from the image into the DSM. This is done by segmenting the stereo image, transferring the segmentation information to the DSM and performing a statistical analysis on each of the created DSM segments. Based on this analysis and spectral information a coarse object detection and classification can be performed and in turn the DSM can be enhanced. After the description of the proposed method some results are shown and discussed
Going beyond semantic image segmentation, towards holistic scene understanding, with associative hierarchical random fields
In this thesis we exploit the generality and expressive power of the Associative Hierarchical
Random Field (AHRF) graphical model to take its use beyond that of semantic image segmentation,
into object-classes, towards a framework for holistic scene understanding. We provide a
working definition for the holistic approach to scene understanding, which allows for the integration
of existing, disparate, applications into an unifying ensemble. We believe that modelling
such an ensemble as an AHRF is both a principled and pragmatic solution. We present a hierarchy
that shows several methods for fusing applications together with the AHRF graphical model.
Each of the three; feature, potential and energy, layers subsumes its predecessor in generality
and together give rise to many options for integration. With applications on street scenes we
demonstrate an implementation of each layer. The first layer application joins appearance and
geometric features. For our second layer we implement a things and stuff co-junction using
higher order AHRF potentials for object detectors, with the goal of answering the classic questions:
What? Where? and How many? A holistic approach to recognition-and-reconstruction
is realised within our third layer by linking two energy based formulations of both applications.
Each application is evaluated qualitatively and quantitatively. In all cases our holistic approach
shows improvement over baseline methods
Improving Global Multi-target Tracking with Local Updates
Conference dates: September 6-7 & 12, 2014We propose a scheme to explicitly detect and resolve ambiguous situations in multiple target tracking. During periods of uncertainty, our method applies multiple local single target trackers to hypothesise short term tracks. These tracks are combined with the tracks obtained by a global multi-target tracker, if they result in a reduction in the global cost function. Since tracking failures typically arise when targets become occluded, we propose a local data association scheme to maintain the target identities in these situations. We demonstrate a reduction of up to 50% in the global cost function, which in turn leads to superior performance on several challenging benchmark sequences. Additionally, we show tracking results in sports videos where poor video quality and frequent and severe occlusions between multiple players pose difficulties for state-of-the-art trackers.Anton Milan, Rikke Gade, Anthony Dick, Thomas B. Moeslund, and Ian Rei
FutureMapping 2: Gaussian Belief Propagation for Spatial AI
We argue the case for Gaussian Belief Propagation (GBP) as a strong
algorithmic framework for the distributed, generic and incremental
probabilistic estimation we need in Spatial AI as we aim at high performance
smart robots and devices which operate within the constraints of real products.
Processor hardware is changing rapidly, and GBP has the right character to take
advantage of highly distributed processing and storage while estimating global
quantities, as well as great flexibility. We present a detailed tutorial on
GBP, relating to the standard factor graph formulation used in robotics and
computer vision, and give several simulation examples with code which
demonstrate its properties
- …