2 research outputs found

    High performance two-dimensional phase unwrapping on gpus

    No full text
    Phase unwrapping is an important procedure in digital im-age and signal processing, and has been widely used in many elds, such as optical and microwave interferometry, mag-netic resonance imaging, synthetic aperture radar, adaptive optics. Phase unwrapping is a time consuming process with large amount of calculations and complicated data depen-dency. A number of algorithms with different features have been developed to solve this problem. Among all of them, Goldstein's algorithm is one of the most widely used algo-rithms, and has been included in some standard libraries (such as MATLAB). In this paper we propose an innovative implementation of Goldstein's algorithm on GPU. Several important approaches and optimizations are proposed for the GPU algorithm. For example, by introducing a local-matching step, we were able to parallelize the branchcut step effciently, getting much better performance than ex-isting work. With a cascaded propagation model, another important operation in the algorithm, floodfill, is able to make good use of the computing power of GPU. We tested our GPU algorithm on NVIDIA C2050 and K20 GPUs, and achieved speedup of up to 781 and 896 over the CPU imple-mentation respectively. To the best of our knowledge, this is the best performance of unwrap ever achieved on GPUs. © 2014 ACM.Phase unwrapping is an important procedure in digital im-age and signal processing, and has been widely used in many elds, such as optical and microwave interferometry, mag-netic resonance imaging, synthetic aperture radar, adaptive optics. Phase unwrapping is a time consuming process with large amount of calculations and complicated data depen-dency. A number of algorithms with different features have been developed to solve this problem. Among all of them, Goldstein's algorithm is one of the most widely used algo-rithms, and has been included in some standard libraries (such as MATLAB). In this paper we propose an innovative implementation of Goldstein's algorithm on GPU. Several important approaches and optimizations are proposed for the GPU algorithm. For example, by introducing a local-matching step, we were able to parallelize the branchcut step effciently, getting much better performance than ex-isting work. With a cascaded propagation model, another important operation in the algorithm, floodfill, is able to make good use of the computing power of GPU. We tested our GPU algorithm on NVIDIA C2050 and K20 GPUs, and achieved speedup of up to 781 and 896 over the CPU imple-mentation respectively. To the best of our knowledge, this is the best performance of unwrap ever achieved on GPUs. © 2014 ACM
    corecore