9 research outputs found

    The non-parametric Parzen's window in stereo vision matching

    Get PDF
    This paper presents an approach to the local stereovision matching problem using edge segments as features with four attributes. From these attributes we compute a matching probability between pairs of features of the stereo images. A correspondence is said true when such a probability is maximum. We introduce a nonparametric strategy based on Parzen's window to estimate a probability density function (PDF) which is used to obtain the matching probability. This is the main finding of the paper. A comparative analysis of other recent matching methods is included to show that this finding can be justified theoretically. A generalization of the proposed method is made in order to give guidelines about its use with the similarity constraint and also in different environments where other features and attributes are more suitable

    Stereo matching Using Edge information and a Genetic algorithm

    Get PDF
    ๋ณธ ๋…ผ๋ฌธ์€ ๊ฒฝ๊ณ„์„  ์ •๋ณด์™€ ์œ ์ „ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ด์šฉํ•œ ์Šคํ…Œ๋ ˆ์˜ค ์ •ํ•ฉ ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•˜๊ณ  ์žˆ๋‹ค. ์ œ์•ˆํ•œ ๋ฐฉ๋ฒ•์€ ์กฐ๋ฐ€ํ•œ ๊ธฐ์ค€์˜์ƒ์˜ ๊ฒฝ๊ณ„์„ ์„ ๊ฒ€์ถœํ•˜๊ธฐ ์œ„ํ•ด LoG ์—ฐ์‚ฐ์ž๋ฅผ ์‚ฌ์šฉํ•˜๊ณ  ์ด๋ฅผ ์ด์šฉํ•ด ๋™์ผํ•œ ๋ณ€์œ„(disparity)๊ฐ’์„ ๊ฐ–๋Š” ๊ฐœ์ฒด์˜ ์˜์—ญ๊ณผ ์—ผ์ƒ‰์ฒด์˜ ๊ตฌ์กฐ๋ฅผ ์ •์˜ํ•˜์˜€๋‹ค. ๊ทธ๋ฆฌ๊ณ  ์Šคํ…Œ๋ ˆ์˜ค ์ •ํ•ฉ๋ฌธ์ œ๋ฅผ ์ตœ์ ํ™” ๋ฌธ์ œ๋กœ ๊ฐ„์ฃผํ•˜๊ณ  ์ตœ์ ํ™” ๋ฌธ์ œ ํ•ด๊ฒฐ์— ์ข‹์€ ์„ฑ๋Šฅ์„ ๋ณด์ด๋Š” ์œ ์ „ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์Šคํ…Œ๋ ˆ์˜ค ์ •ํ•ฉ ํ™˜๊ฒฝ์— ๋งž๊ฒŒ ๋ณ€ํ˜•ํ•˜์—ฌ ์ ์šฉํ•˜์˜€๋‹ค. ์ œ์•ˆํ•œ ๋ฐฉ๋ฒ•์€ ๊ฒฝ๊ณ„์„  ์ •๋ณด๋ฅผ ์ด์šฉํ•˜์—ฌ ๊ฐœ์ฒด์˜ ์˜์—ญ์„ ์‰ฝ๊ฒŒ ์ถ”์ •ํ•  ์ˆ˜ ์žˆ์—ˆ๊ณ  ์ด๋ฅผ ํ†ตํ•ด ๊ฒฝ๊ณ„์„  ๋ถ€๊ทผ์—์„œ ์˜ค์ •ํ•ฉ์„ ๋ฐœ์ƒ์‹œํ‚ค๋Š” ๊ธฐ์กด์˜ ์˜์—ญ๊ธฐ๋ฐ˜ ์ •ํ•ฉ์˜ ๋‹จ์ ์„ ๋ณด์™„ํ•  ์ˆ˜ ์žˆ์—ˆ๋‹ค. ๊ทธ๋ฆฌ๊ณ  ๊ฒ€์ถœํ•œ ๊ฒฝ๊ณ„์„  ์ •๋ณด๋กœ ์—ผ์ƒ‰์ฒด์˜ ๊ตฌ์กฐ๋ฅผ ์˜์ƒ์— ๋”ฐ๋ผ ์ ์‘์ ์œผ๋กœ ์ •์˜ํ•  ์ˆ˜ ์žˆ๊ธฐ ๋•Œ๋ฌธ์— ์œ ์ „ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ ์šฉํ•˜๊ธฐ๊ฐ€ ์šฉ์ดํ–ˆ๋‹ค. ์ •ํ•ฉ๋น„์šฉํ•จ์ˆ˜๋ฅผ ์˜์—ญ๊ธฐ๋ฐ˜ ์ •ํ•ฉ์˜ ์ •ํ•ฉ๋น„์šฉํ•จ์ˆ˜์™€ ๋ณ€์œ„ ํ‰ํ™œ์„ฑ, ๋ณ€์œ„ ์ˆœ์„œ์„ฑ์˜ ์กฐํ•ฉ์œผ๋กœ ์ •์˜ํ•˜๊ณ  ์œ ์ „ ์•Œ๊ณ ๋ฆฌ์ฆ˜์˜ ๋ชฉ์ ํ•จ์ˆ˜๋กœ ์‚ฌ์šฉํ•˜์—ฌ ์ง„ํ™”๋ฅผ ํ†ตํ•ด ํšจ์œจ์ ์ธ ์ •ํ•ฉ์ด ๋˜๋„๋ก ํ–ˆ๋‹ค. ์ œ์•ˆํ•œ ๋ฐฉ๋ฒ•์—์„œ ๊ธฐ์กด์˜ ํŠน์ง•๊ธฐ๋ฐ˜ ์ •ํ•ฉ๋ณด๋‹ค ์ •ํ™•๋„๊ฐ€ ํ–ฅ์ƒ๋˜๊ณ , ์˜์—ญ๊ธฐ๋ฐ˜ ์ •ํ•ฉ๋ณด๋‹ค ๊ฒฝ๊ณ„์„  ๋ถ€๋ถ„์˜ ๋ณ€์œ„๊ฐ€ ๊ฐœ์„ ๋˜๋Š” ๊ฒฐ๊ณผ๋ฅผ ๋ณผ ์ˆ˜ ์žˆ์—ˆ๋‹ค. ์‹คํ—˜์—์„œ ๊ธฐ์กด์˜ ๋‹ค๋ฅธ ๋ฐฉ๋ฒ•๋“ค๊ณผ ๋น„๊ตํ•˜์—ฌ ์ œ์•ˆํ•œ ๋ฐฉ๋ฒ•์ด ์šฐ์ˆ˜ํ•จ์„ ๋ณด์˜€๋‹ค.1. ์„œ ๋ก  ................................................................ 1 2. ์Šคํ…Œ๋ ˆ์˜ค ์‹œ๊ฐ๊ณผ ์œ ์ „ ์•Œ๊ณ ๋ฆฌ์ฆ˜................................ 4 2.1. ์Šคํ…Œ๋ ˆ์˜ค ์‹œ๊ฐ.................................................. 4 2.2. ์Šคํ…Œ๋ ˆ์˜ค ์ •ํ•ฉ.................................................. 7 2.3. ์œ ์ „ ์•Œ๊ณ ๋ฆฌ์ฆ˜.................................................. 11 3. ๊ฒฝ๊ณ„์„  ์ •๋ณด์™€ ์œ ์ „ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ด์šฉํ•œ ์Šคํ…Œ๋ ˆ์˜ค ์ •ํ•ฉ.. 16 3.1. ๊ฒฝ๊ณ„์„  ์ •๋ณด์— ์˜ํ•œ ์—ผ์ƒ‰์ฒด ๊ตฌ์กฐ ์ •์˜................... 16 3.2. ์œ ์ „ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ด์šฉํ•œ ์Šคํ…Œ๋ ˆ์˜ค ์ •ํ•ฉ................. 20 3.2.1. ์—ผ์ƒ‰์ฒด ํ‘œํ˜„ ๋ฐ ์ง‘๋‹จ์˜ ์ดˆ๊ธฐํ™”........... 20 3.2.2. ์œ ์ „ ์—ฐ์‚ฐ์ž................................... 22 3.2.3. ๋ชฉ์  ํ•จ์ˆ˜ ๋ฐ ์ ํ•ฉ๋„........................ 28 4. ์‹คํ—˜ ๋ฐ ๊ณ ์ฐฐ.......................................................... 32 4.1. ํ•ฉ์„ฑ ์˜์ƒ ์‹คํ—˜.................................................. 33 4.2. ์‹ค์ œ ์˜์ƒ ์‹คํ—˜.................................................. 44 5. ๊ฒฐ ๋ก ................................................................... 48 ์ฐธ๊ณ  ๋ฌธํ—Œ ................................................................. 5

    Three dimensional reconstruction of the cell cytoskeleton from stereo images

    Get PDF
    Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Mechanical Engineering, 1998.Includes bibliographical references (leaves 80-83).Besides its primary application to robot vision, stereo vision also appears promising in the biomedical field. This study examines 3D reconstruction of the cell cytoskeleton. This application of stereo vision to electron micrographs extracts information about the interior structure of cells at the nanometer scale level. We propose two different types of stereo vision approaches: the line-segment and wavelet multiresolution methods. The former is primitive-based and the latter is a point-based approach. Structural information is stressed in both methods. Directional representation is employed to provide an ideal description for filament-type structures. In the line-segment method, line-segments are first extracted from directional representation and then matching is conducted between two line-segment sets of stereo images. A new search algorithm, matrix matching, is proposed to determine the matching globally. In the wavelet multiresolution method, a pyramidal architecture is presented. Bottom-up analysis is first performed to form two pyramids, containing wavelet decompositions and directional representations. Subsequently, top-down matching is carried out. Matching at a high level provides guidance and constraints to the matching at a lower level. Our reconstructed results reveal 3D structure and the relationships of filaments which are otherwise hard to see in the original stereo images. The method is sufficiently robust and accurate to allow the automated analysis of cell structural characteristics from electron microscopy pairs. The method may also have application to a general class of stereo images.by Yuan Cheng.S.M

    A Phase Based Dense Stereo Algorithm Implemented in CUDA

    Get PDF
    Stereo imaging is routinely used in Simultaneous Localization and Mapping (SLAM) systems for the navigation and control of autonomous spacecraft proximity operations, advanced robotics, and robotic mapping and surveying applications. A key step (and generally the most computationally expensive step) in the generation of high fidelity geometric environment models from image data is the solution of the dense stereo correspondence problem. A novel method for solving the stereo correspondence problem to sub-pixel accuracy in the Fourier frequency domain by exploiting the Convolution Theorem is developed. The method is tailored to challenging aerospace applications by incorporation of correction factors for common error sources. Error-checking metrics verify correspondence matches to ensure high quality depth reconstructions are generated. The effect of geometric foreshortening caused by the baseline displacement of the cameras is modeled and corrected, drastically improving correspondence matching on highly off-normal surfaces. A metric for quantifying the strength of correspondence matches is developed and implemented to recognize and reject weak correspondences, and a separate cross-check verification provides a final defense against erroneous matches. The core components of this phase based dense stereo algorithm are implemented and optimized in the Compute Uni ed Device Architecture (CUDA) parallel computation environment onboard an NVIDIA Graphics Processing Unit (GPU). Accurate dense stereo correspondence matching is performed on stereo image pairs at a rate of nearly 10Hz

    A family of stereoscopic image compression algorithms using wavelet transforms

    Get PDF
    With the standardization of JPEG-2000, wavelet-based image and video compression technologies are gradually replacing the popular DCT-based methods. In parallel to this, recent developments in autostereoscopic display technology is now threatening to revolutionize the way in which consumers are used to enjoying the traditional 2-D display based electronic media such as television, computer and movies. However, due to the two-fold bandwidth/storage space requirement of stereoscopic imaging, an essential requirement of a stereo imaging system is efficient data compression. In this thesis, seven wavelet-based stereo image compression algorithms are proposed, to take advantage of the higher data compaction capability and better flexibility of wavelets. [Continues.

    A family of stereoscopic image compression algorithms using wavelet transforms

    Get PDF
    With the standardization of JPEG-2000, wavelet-based image and video compression technologies are gradually replacing the popular DCT-based methods. In parallel to this, recent developments in autostereoscopic display technology is now threatening to revolutionize the way in which consumers are used to enjoying the traditional 2D display based electronic media such as television, computer and movies. However, due to the two-fold bandwidth/storage space requirement of stereoscopic imaging, an essential requirement of a stereo imaging system is efficient data compression. In this thesis, seven wavelet-based stereo image compression algorithms are proposed, to take advantage of the higher data compaction capability and better flexibility of wavelets. In the proposed CODEC I, block-based disparity estimation/compensation (DE/DC) is performed in pixel domain. However, this results in an inefficiency when DWT is applied on the whole predictive error image that results from the DE process. This is because of the existence of artificial block boundaries between error blocks in the predictive error image. To overcome this problem, in the remaining proposed CODECs, DE/DC is performed in the wavelet domain. Due to the multiresolution nature of the wavelet domain, two methods of disparity estimation and compensation have been proposed. The first method is performing DEJDC in each subband of the lowest/coarsest resolution level and then propagating the disparity vectors obtained to the corresponding subbands of higher/finer resolution. Note that DE is not performed in every subband due to the high overhead bits that could be required for the coding of disparity vectors of all subbands. This method is being used in CODEC II. In the second method, DEJDC is performed m the wavelet-block domain. This enables disparity estimation to be performed m all subbands simultaneously without increasing the overhead bits required for the coding disparity vectors. This method is used by CODEC III. However, performing disparity estimation/compensation in all subbands would result in a significant improvement of CODEC III. To further improve the performance of CODEC ill, pioneering wavelet-block search technique is implemented in CODEC IV. The pioneering wavelet-block search technique enables the right/predicted image to be reconstructed at the decoder end without the need of transmitting the disparity vectors. In proposed CODEC V, pioneering block search is performed in all subbands of DWT decomposition which results in an improvement of its performance. Further, the CODEC IV and V are able to perform at very low bit rates(< 0.15 bpp). In CODEC VI and CODEC VII, Overlapped Block Disparity Compensation (OBDC) is used with & without the need of coding disparity vector. Our experiment results showed that no significant coding gains could be obtained for these CODECs over CODEC IV & V. All proposed CODECs m this thesis are wavelet-based stereo image coding algorithms that maximise the flexibility and benefits offered by wavelet transform technology when applied to stereo imaging. In addition the use of a baseline-JPEG coding architecture would enable the easy adaptation of the proposed algorithms within systems originally built for DCT-based coding. This is an important feature that would be useful during an era where DCT-based technology is only slowly being phased out to give way for DWT based compression technology. In addition, this thesis proposed a stereo image coding algorithm that uses JPEG-2000 technology as the basic compression engine. The proposed CODEC, named RASTER is a rate scalable stereo image CODEC that has a unique ability to preserve the image quality at binocular depth boundaries, which is an important requirement in the design of stereo image CODEC. The experimental results have shown that the proposed CODEC is able to achieve PSNR gains of up to 3.7 dB as compared to directly transmitting the right frame using JPEG-2000

    Automatic face recognition using stereo images

    Get PDF
    Face recognition is an important pattern recognition problem, in the study of both natural and artificial learning problems. Compaxed to other biometrics, it is non-intrusive, non- invasive and requires no paxticipation from the subjects. As a result, it has many applications varying from human-computer-interaction to access control and law-enforcement to crowd surveillance. In typical optical image based face recognition systems, the systematic vaxiability arising from representing the three-dimensional (3D) shape of a face by a two-dimensional (21)) illumination intensity matrix is treated as random vaxiability. Multiple examples of the face displaying vaxying pose and expressions axe captured in different imaging conditions. The imaging environment, pose and expressions are strictly controlled and the images undergo rigorous normalisation and pre-processing. This may be implemented in a paxtially or a fully automated system. Although these systems report high classification accuracies (>90%), they lack versatility and tend to fail when deployed outside laboratory conditions. Recently, more sophisticated 3D face recognition systems haxnessing the depth information have emerged. These systems usually employ specialist equipment such as laser scanners and structured light projectors. Although more accurate than 2D optical image based recognition, these systems are equally difficult to implement in a non-co-operative environment. Existing face recognition systems, both 2D and 3D, detract from the main advantages of face recognition and fail to fully exploit its non-intrusive capacity. This is either because they rely too much on subject co-operation, which is not always available, or because they cannot cope with noisy data. The main objective of this work was to investigate the role of depth information in face recognition in a noisy environment. A stereo-based system, inspired by the human binocular vision, was devised using a pair of manually calibrated digital off-the-shelf cameras in a stereo setup to compute depth information. Depth values extracted from 2D intensity images using stereoscopy are extremely noisy, and as a result this approach for face recognition is rare. This was cofirmed by the results of our experimental work. Noise in the set of correspondences, camera calibration and triangulation led to inaccurate depth reconstruction, which in turn led to poor classifier accuracy for both 3D surface matching and 211) 2 depth maps. Recognition experiments axe performed on the Sheffield Dataset, consisting 692 images of 22 individuals with varying pose, illumination and expressions
    corecore