11,946 research outputs found

    RGB-D datasets using microsoft kinect or similar sensors: a survey

    Get PDF
    RGB-D data has turned out to be a very useful representation of an indoor scene for solving fundamental computer vision problems. It takes the advantages of the color image that provides appearance information of an object and also the depth image that is immune to the variations in color, illumination, rotation angle and scale. With the invention of the low-cost Microsoft Kinect sensor, which was initially used for gaming and later became a popular device for computer vision, high quality RGB-D data can be acquired easily. In recent years, more and more RGB-D image/video datasets dedicated to various applications have become available, which are of great importance to benchmark the state-of-the-art. In this paper, we systematically survey popular RGB-D datasets for different applications including object recognition, scene classification, hand gesture recognition, 3D-simultaneous localization and mapping, and pose estimation. We provide the insights into the characteristics of each important dataset, and compare the popularity and the difficulty of those datasets. Overall, the main goal of this survey is to give a comprehensive description about the available RGB-D datasets and thus to guide researchers in the selection of suitable datasets for evaluating their algorithms

    Fast detecting and tracking of moving objects in video scenes

    Get PDF
    18 pages. Quelques films de résultats sont disponible sur: http://www.ceremade.dauphine.fr/~pelletieIn this article we present a new method for detecting textured moving objects. Based on a known background estimation and a fixed camera, the algorithm is able to detect moving objects and locates them at video rate, moreover this method is used for object tracking purposes. Our method is multi-step: First, we use level lines to detect pixels of the background which are occluded by moving object. Then, we use an a contrario model as general framework to make an automatic clustering. Thus the moving objects are detected as regions and not only as pixels, eventually we correct this region to better fit the moving object. Experimental results show that the algorithm is very robust to noise and to the quality of the background estimation (e.g. ghosts). The algorithm has been successfully tested in video sequences coming from different databases, including indoor and outdoor sequences
    • …
    corecore