8,855 research outputs found

    Layered Interpretation of Street View Images

    Full text link
    We propose a layered street view model to encode both depth and semantic information on street view images for autonomous driving. Recently, stixels, stix-mantics, and tiered scene labeling methods have been proposed to model street view images. We propose a 4-layer street view model, a compact representation over the recently proposed stix-mantics model. Our layers encode semantic classes like ground, pedestrians, vehicles, buildings, and sky in addition to the depths. The only input to our algorithm is a pair of stereo images. We use a deep neural network to extract the appearance features for semantic classes. We use a simple and an efficient inference algorithm to jointly estimate both semantic classes and layered depth values. Our method outperforms other competing approaches in Daimler urban scene segmentation dataset. Our algorithm is massively parallelizable, allowing a GPU implementation with a processing speed about 9 fps.Comment: The paper will be presented in the 2015 Robotics: Science and Systems Conference (RSS

    Wavelet based stereo images reconstruction using depth images

    Get PDF
    It is believed by many that three-dimensional (3D) television will be the next logical development toward a more natural and vivid home entertaiment experience. While classical 3D approach requires the transmission of two video streams, one for each view, 3D TV systems based on depth image rendering (DIBR) require a single stream of monoscopic images and a second stream of associated images usually termed depth images or depth maps, that contain per-pixel depth information. Depth map is a two-dimensional function that contains information about distance from camera to a certain point of the object as a function of the image coordinates. By using this depth information and the original image it is possible to reconstruct a virtual image of a nearby viewpoint by projecting the pixels of available image to their locations in 3D space and finding their position in the desired view plane. One of the most significant advantages of the DIBR is that depth maps can be coded more efficiently than two streams corresponding to left and right view of the scene, thereby reducing the bandwidth required for transmission, which makes it possible to reuse existing transmission channels for the transmission of 3D TV. This technique can also be applied for other 3D technologies such as multimedia systems. In this paper we propose an advanced wavelet domain scheme for the reconstruction of stereoscopic images, which solves some of the shortcommings of the existing methods discussed above. We perform the wavelet transform of both the luminance and depth images in order to obtain significant geometric features, which enable more sensible reconstruction of the virtual view. Motion estimation employed in our approach uses Markov random field smoothness prior for regularization of the estimated motion field. The evaluation of the proposed reconstruction method is done on two video sequences which are typically used for comparison of stereo reconstruction algorithms. The results demonstrate advantages of the proposed approach with respect to the state-of-the-art methods, in terms of both objective and subjective performance measures

    Object-based 2D-to-3D video conversion for effective stereoscopic content generation in 3D-TV applications

    Get PDF
    Three-dimensional television (3D-TV) has gained increasing popularity in the broadcasting domain, as it enables enhanced viewing experiences in comparison to conventional two-dimensional (2D) TV. However, its application has been constrained due to the lack of essential contents, i.e., stereoscopic videos. To alleviate such content shortage, an economical and practical solution is to reuse the huge media resources that are available in monoscopic 2D and convert them to stereoscopic 3D. Although stereoscopic video can be generated from monoscopic sequences using depth measurements extracted from cues like focus blur, motion and size, the quality of the resulting video may be poor as such measurements are usually arbitrarily defined and appear inconsistent with the real scenes. To help solve this problem, a novel method for object-based stereoscopic video generation is proposed which features i) optical-flow based occlusion reasoning in determining depth ordinal, ii) object segmentation using improved region-growing from masks of determined depth layers, and iii) a hybrid depth estimation scheme using content-based matching (inside a small library of true stereo image pairs) and depth-ordinal based regularization. Comprehensive experiments have validated the effectiveness of our proposed 2D-to-3D conversion method in generating stereoscopic videos of consistent depth measurements for 3D-TV applications

    Cosmic Ray in the Northern Hemisphere: Results from the Telescope Array Experiment

    Full text link
    The Telescope Array (TA) is the largest ultrahigh energy (UHE) cosmic ray observatory in the northern hemisphere TA is a hybrid experiment with a unique combination of fluorescence detectors and a stand-alone surface array of scintillation counters. We will present the spectrum measured by the surface array alone, along with those measured by the fluorescence detectors in monocular, hybrid, and stereo mode. The composition results from stereo TA data will be discussed. Our report will also include results from the search for correlations between the pointing directions of cosmic rays, seen by the TA surface array, with active galactic nuclei.Comment: 8 pages 11 figure, Proceedings of the APS Division of Particle and Fields (DPF) Meeting, Aug 2011, Brown University, Providence, RI, US

    Fluids mobilization in Arabia Terra, Mars: depth of pressurized reservoir from mounds self-similar clustering

    Full text link
    Arabia Terra is a region of Mars where signs of past-water occurrence are recorded in several landforms. Broad and local scale geomorphological, compositional and hydrological analyses point towards pervasive fluid circulation through time. In this work we focus on mound fields located in the interior of three casters larger than 40 km (Firsoff, Kotido and unnamed crater 20 km to the east) and showing strong morphological and textural resemblance to terrestrial mud volcanoes and spring-related features. We infer that these landforms likely testify the presence of a pressurized fluid reservoir at depth and past fluid upwelling. We have performed morphometric analyses to characterize the mound morphologies and consequently retrieve an accurate automated mapping of the mounds within the craters for spatial distribution and fractal clustering analysis. The outcome of the fractal clustering yields information about the possible extent of the percolating fracture network at depth below the craters. We have been able to constrain the depth of the pressurized fluid reservoir between ~2.5 and 3.2 km of depth and hence, we propose that mounds and mounds alignments are most likely associated to the presence of fissure ridges and fluid outflow. Their process of formation is genetically linked to the formation of large intra-crater bulges previously interpreted as large scale spring deposits. The overburden removal caused by the impact crater formation is the inferred triggering mechanism for fluid pressurization and upwelling, that through time led to the formation of the intra-crater bulges and, after compaction and sealing, to the widespread mound fields in their surroundings
    • …
    corecore