3,008 research outputs found

    Efficient Irregular Wavefront Propagation Algorithms on Hybrid CPU-GPU Machines

    Full text link
    In this paper, we address the problem of efficient execution of a computation pattern, referred to here as the irregular wavefront propagation pattern (IWPP), on hybrid systems with multiple CPUs and GPUs. The IWPP is common in several image processing operations. In the IWPP, data elements in the wavefront propagate waves to their neighboring elements on a grid if a propagation condition is satisfied. Elements receiving the propagated waves become part of the wavefront. This pattern results in irregular data accesses and computations. We develop and evaluate strategies for efficient computation and propagation of wavefronts using a multi-level queue structure. This queue structure improves the utilization of fast memories in a GPU and reduces synchronization overheads. We also develop a tile-based parallelization strategy to support execution on multiple CPUs and GPUs. We evaluate our approaches on a state-of-the-art GPU accelerated machine (equipped with 3 GPUs and 2 multicore CPUs) using the IWPP implementations of two widely used image processing operations: morphological reconstruction and euclidean distance transform. Our results show significant performance improvements on GPUs. The use of multiple CPUs and GPUs cooperatively attains speedups of 50x and 85x with respect to single core CPU executions for morphological reconstruction and euclidean distance transform, respectively.Comment: 37 pages, 16 figure

    Voronoi image segmentation and its applications to geoinformatics

    Get PDF
    As various geospatial images are available for analysis, there is a strong need for an intelligent geospatial image processing method. Segmenting and districting digital images is a core process and is of great importance in many geo-related applications. We propose a flexible image segmentation framework based on generalized Voronoi diagrams through Euclidean distance transforms. We introduce a three-scan algorithm that segments images in O(N) time when N is the number of pixels. The algorithm is capable of handling generators of complex types (point, line and area), Minkowski metrics and different weights. This paper also provides applications of the proposed method in various geoinformation datasets. Illustrated examples demonstrate the usefulness and robustness of our proposed method

    Mining Point Cloud Local Structures by Kernel Correlation and Graph Pooling

    Full text link
    Unlike on images, semantic learning on 3D point clouds using a deep network is challenging due to the naturally unordered data structure. Among existing works, PointNet has achieved promising results by directly learning on point sets. However, it does not take full advantage of a point's local neighborhood that contains fine-grained structural information which turns out to be helpful towards better semantic learning. In this regard, we present two new operations to improve PointNet with a more efficient exploitation of local structures. The first one focuses on local 3D geometric structures. In analogy to a convolution kernel for images, we define a point-set kernel as a set of learnable 3D points that jointly respond to a set of neighboring data points according to their geometric affinities measured by kernel correlation, adapted from a similar technique for point cloud registration. The second one exploits local high-dimensional feature structures by recursive feature aggregation on a nearest-neighbor-graph computed from 3D positions. Experiments show that our network can efficiently capture local information and robustly achieve better performances on major datasets. Our code is available at http://www.merl.com/research/license#KCNetComment: Accepted in CVPR'18. *indicates equal contributio

    Reduction of continuous symmetries of chaotic flows by the method of slices

    Full text link
    We study continuous symmetry reduction of dynamical systems by the method of slices (method of moving frames) and show that a `slice' defined by minimizing the distance to a single generic `template' intersects the group orbit of every point in the full state space. Global symmetry reduction by a single slice is, however, not natural for a chaotic / turbulent flow; it is better to cover the reduced state space by a set of slices, one for each dynamically prominent unstable pattern. Judiciously chosen, such tessellation eliminates the singular traversals of the inflection hyperplane that comes along with each slice, an artifact of using the template's local group linearization globally. We compute the jump in the reduced state space induced by crossing the inflection hyperplane. As an illustration of the method, we reduce the SO(2) symmetry of the complex Lorenz equations.Comment: to appear in "Comm. Nonlinear Sci. and Numer. Simulat. (2011)" 12 pages, 8 figure

    Antipodally invariant metrics for fast regression-based super-resolution

    Get PDF
    Dictionary-based super-resolution (SR) algorithms usually select dictionary atoms based on the distance or similarity metrics. Although the optimal selection of the nearest neighbors is of central importance for such methods, the impact of using proper metrics for SR has been overlooked in literature, mainly due to the vast usage of Euclidean distance. In this paper, we present a very fast regression-based algorithm, which builds on the densely populated anchored neighborhoods and sublinear search structures. We perform a study of the nature of the features commonly used for SR, observing that those features usually lie in the unitary hypersphere, where every point has a diametrically opposite one, i.e., its antipode, with same module and angle, but the opposite direction. Even though, we validate the benefits of using antipodally invariant metrics, most of the binary splits use Euclidean distance, which does not handle antipodes optimally. In order to benefit from both the worlds, we propose a simple yet effective antipodally invariant transform that can be easily included in the Euclidean distance calculation. We modify the original spherical hashing algorithm with this metric in our antipodally invariant spherical hashing scheme, obtaining the same performance as a pure antipodally invariant metric. We round up our contributions with a novel feature transform that obtains a better coarse approximation of the input image thanks to iterative backprojection. The performance of our method, which we named antipodally invariant SR, improves quality (Peak Signal to Noise Ratio) and it is faster than any other state-of-the-art method.Peer ReviewedPostprint (author's final draft
    • …
    corecore