284 research outputs found

    Real-time self-adaptive deep stereo

    Full text link
    Deep convolutional neural networks trained end-to-end are the state-of-the-art methods to regress dense disparity maps from stereo pairs. These models, however, suffer from a notable decrease in accuracy when exposed to scenarios significantly different from the training set, e.g., real vs synthetic images, etc.). We argue that it is extremely unlikely to gather enough samples to achieve effective training/tuning in any target domain, thus making this setup impractical for many applications. Instead, we propose to perform unsupervised and continuous online adaptation of a deep stereo network, which allows for preserving its accuracy in any environment. However, this strategy is extremely computationally demanding and thus prevents real-time inference. We address this issue introducing a new lightweight, yet effective, deep stereo architecture, Modularly ADaptive Network (MADNet) and developing a Modular ADaptation (MAD) algorithm, which independently trains sub-portions of the network. By deploying MADNet together with MAD we introduce the first real-time self-adaptive deep stereo system enabling competitive performance on heterogeneous datasets.Comment: Accepted at CVPR2019 as oral presentation. Code Available https://github.com/CVLAB-Unibo/Real-time-self-adaptive-deep-stere

    Guided Stereo Matching

    Full text link
    Stereo is a prominent technique to infer dense depth maps from images, and deep learning further pushed forward the state-of-the-art, making end-to-end architectures unrivaled when enough data is available for training. However, deep networks suffer from significant drops in accuracy when dealing with new environments. Therefore, in this paper, we introduce Guided Stereo Matching, a novel paradigm leveraging a small amount of sparse, yet reliable depth measurements retrieved from an external source enabling to ameliorate this weakness. The additional sparse cues required by our method can be obtained with any strategy (e.g., a LiDAR) and used to enhance features linked to corresponding disparity hypotheses. Our formulation is general and fully differentiable, thus enabling to exploit the additional sparse inputs in pre-trained deep stereo networks as well as for training a new instance from scratch. Extensive experiments on three standard datasets and two state-of-the-art deep architectures show that even with a small set of sparse input cues, i) the proposed paradigm enables significant improvements to pre-trained networks. Moreover, ii) training from scratch notably increases accuracy and robustness to domain shifts. Finally, iii) it is suited and effective even with traditional stereo algorithms such as SGM.Comment: CVPR 201

    Learning monocular depth estimation with unsupervised trinocular assumptions

    Full text link
    Obtaining accurate depth measurements out of a single image represents a fascinating solution to 3D sensing. CNNs led to considerable improvements in this field, and recent trends replaced the need for ground-truth labels with geometry-guided image reconstruction signals enabling unsupervised training. Currently, for this purpose, state-of-the-art techniques rely on images acquired with a binocular stereo rig to predict inverse depth (i.e., disparity) according to the aforementioned supervision principle. However, these methods suffer from well-known problems near occlusions, left image border, etc inherited from the stereo setup. Therefore, in this paper, we tackle these issues by moving to a trinocular domain for training. Assuming the central image as the reference, we train a CNN to infer disparity representations pairing such image with frames on its left and right side. This strategy allows obtaining depth maps not affected by typical stereo artifacts. Moreover, being trinocular datasets seldom available, we introduce a novel interleaved training procedure enabling to enforce the trinocular assumption outlined from current binocular datasets. Exhaustive experimental results on the KITTI dataset confirm that our proposal outperforms state-of-the-art methods for unsupervised monocular depth estimation trained on binocular stereo pairs as well as any known methods relying on other cues.Comment: 14 pages, 7 figures, 4 tables. Accepted to 3DV 201

    TemporalStereo: Efficient Spatial-Temporal Stereo Matching Network

    Full text link
    We present TemporalStereo, a coarse-to-fine based online stereo matching network which is highly efficient, and able to effectively exploit the past geometry and context information to boost the matching accuracy. Our network leverages sparse cost volume and proves to be effective when a single stereo pair is given, however, its peculiar ability to use spatio-temporal information across frames allows TemporalStereo to alleviate problems such as occlusions and reflective regions while enjoying high efficiency also in the case of stereo sequences. Notably our model trained, once with stereo videos, can run in both single-pair and temporal ways seamlessly. Experiments show that our network relying on camera motion is even robust to dynamic objects when running on videos. We validate TemporalStereo through extensive experiments on synthetic (SceneFlow, TartanAir) and real (KITTI 2012, KITTI 2015) datasets. Detailed results show that our model achieves state-of-the-art performance on any of these datasets. Code is available at \url{https://github.com/youmi-zym/TemporalStereo.git}

    Learning optical flow from still images

    Full text link
    This paper deals with the scarcity of data for training optical flow networks, highlighting the limitations of existing sources such as labeled synthetic datasets or unlabeled real videos. Specifically, we introduce a framework to generate accurate ground-truth optical flow annotations quickly and in large amounts from any readily available single real picture. Given an image, we use an off-the-shelf monocular depth estimation network to build a plausible point cloud for the observed scene. Then, we virtually move the camera in the reconstructed environment with known motion vectors and rotation angles, allowing us to synthesize both a novel view and the corresponding optical flow field connecting each pixel in the input image to the one in the new frame. When trained with our data, state-of-the-art optical flow networks achieve superior generalization to unseen real data compared to the same models trained either on annotated synthetic datasets or unlabeled videos, and better specialization if combined with synthetic images.Comment: CVPR 2021. Project page with supplementary and code: https://mattpoggi.github.io/projects/cvpr2021aleotti

    Excitation of travelling multibreathers in anharmonic chains

    Full text link
    We study the dynamics of the "externally" forced and damped Fermi-Pasta-Ulam (FPU) 1D lattice. The forcing has the spatial symmetry of the Fourier mode with wavenumber p and oscillates sinusoidally in time with the frequency omega. When omega is in the phonon band, the p-mode becomes modulationally unstable above a critical forcing, which we determine analytically in terms of the parameters of the system. For omega above the phonon band, the instability of the p-mode leads to the formation of a travelling multibreather, that, in the low-amplitude limit could be described in terms of soliton solutions of a suitable driven-damped nonlinear Schroedinger (NLS) equation. Similar mechanisms of instability could show up in easy-axis magnetic structures, that are governed by such NLS equations.Comment: To appear in Physica D (2002

    Utopía, mística, revolución. A los inicios de la tragedia alemana del siglo XX

    Get PDF
    Como se deduce por el subtítulo, esta reflexión no se ocupa de la relación entre la utopía, mística y revolución desde un punto de vista general. Entre los tres conceptos – cada uno de los cuales ha sido tratado por una literatura infinita- los nexos siguen estando muy vinculados y en absoluto casuales. Dichos vínculos han sido estudiados muy ampliamente, remontándose a la historia de las ideas filosóficas, teológicas y políticas. Sin embargo, me voy a enfocar en nuestra época y en un momento muy peculiar y importante de la historia de Europa. Voy a examinar algunos aspectos de la relación entre la utopía, la mística y la revolución en la Alemania de los primeras dos décadas del siglo XX. Para esto es imprescindibile un prólogo, breve, pero tampoco tanto.Como se deduce por el subtítulo, esta reflexión no se ocupa de la relación entre la utopía, mística y revolución desde un punto de vista general. Entre los tres conceptos – cada uno de los cuales ha sido tratado por una literatura infinita- los nexos siguen estando muy vinculados y en absoluto casuales. Dichos vínculos han sido estudiados muy ampliamente, remontándose a la historia de las ideas filosóficas, teológicas y políticas. Sin embargo, me voy a enfocar en nuestra época y en un momento muy peculiar y importante de la historia de Europa. Voy a examinar algunos aspectos de la relación entre la utopía, la mística y la revolución en la Alemania de los primeras dos décadas del siglo XX. Para esto es imprescindibile un prólogo, breve, pero tampoco tanto
    corecore