Search CORE

3,746 research outputs found

Optical Flow in Mostly Rigid Scenes

Author: Black Michael J.
Sevilla-Lara Laura
Wulff Jonas
Publication venue
Publication date: 01/01/2017
Field of study

The optical flow of natural scenes is a combination of the motion of the observer and the independent motion of objects. Existing algorithms typically focus on either recovering motion and structure under the assumption of a purely static world or optical flow for general unconstrained scenes. We combine these approaches in an optical flow algorithm that estimates an explicit segmentation of moving objects from appearance and physical constraints. In static regions we take advantage of strong constraints to jointly estimate the camera motion and the 3D structure of the scene over multiple frames. This allows us to also regularize the structure instead of the motion. Our formulation uses a Plane+Parallax framework, which works even under small baselines, and reduces the motion estimation to a one-dimensional search problem, resulting in more accurate estimation. In moving regions the flow is treated as unconstrained, and computed with an existing optical flow method. The resulting Mostly-Rigid Flow (MR-Flow) method achieves state-of-the-art results on both the MPI-Sintel and KITTI-2015 benchmarks.Comment: 15 pages, 10 figures; accepted for publication at CVPR 201

arXiv.org e-Print Archive

Crossref

MPG.PuRe

Motion Estimation from Disparity Images

Author: Darrell T.
Demirdjian D.
Publication venue
Publication date: 01/01/2001
Field of study

A new method for 3D rigid motion estimation from stereo is proposed in this paper. The appealing feature of this method is that it directly uses the disparity images obtained from stereo matching. We assume that the stereo rig has parallel cameras and show, in that case, the geometric and topological properties of the disparity images. Then we introduce a rigid transformation (called d-motion) that maps two disparity images of a rigidly moving object. We show how it is related to the Euclidean rigid motion and a motion estimation algorithm is derived. We show with experiments that our approach is simple and more accurate than standard approaches

CiteSeerX

DSpace@MIT

Fully Automatic Expression-Invariant Face Correspondence

Author: A Bronstein
A Elad
Augusto Salazar
B Allen
Chang Shu
D Jiang
D Liu
D Vlasic
E Learned-Miller
E Vezzetti
Flavio Prieto
G Passalis
H Li
H Li
H Rabiu
I Dryden
I Kakadiaris
J Guo
J Han
K Chang
L Xiaoguang
M Segundo
O Kaick van
P Nair
P Xi
S Berretti
S Wuhrer
Stefanie Wuhrer
T Cox
T Weise
Y Huang
Y Tong
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

We consider the problem of computing accurate point-to-point correspondences among a set of human face scans with varying expressions. Our fully automatic approach does not require any manually placed markers on the scan. Instead, the approach learns the locations of a set of landmarks present in a database and uses this knowledge to automatically predict the locations of these landmarks on a newly available scan. The predicted landmarks are then used to compute point-to-point correspondences between a template model and the newly available scan. To accurately fit the expression of the template to the expression of the scan, we use as template a blendshape model. Our algorithm was tested on a database of human faces of different ethnic groups with strongly varying expressions. Experimental results show that the obtained point-to-point correspondence is both highly accurate and consistent for most of the tested 3D face models

arXiv.org e-Print Archive

CiteSeerX

NRC Publications Archive

Crossref

Deep Shape-from-Template: Single-image quasi-isometric deformable registration and reconstruction

Author: Bartoli Adrien
Casillas Pérez David
Collins Toby
Fuentes Jiménez David
Pizarro Pérez Daniel
Publication venue: 'Elsevier BV'
Publication date: 01/11/2022
Field of study

Shape-from-Template (SfT) solves 3D vision from a single image and a deformable 3D object model, called a template. Concretely, SfT computes registration (the correspondence between the template and the image) and reconstruction (the depth in camera frame). It constrains the object deformation to quasi-isometry. Real-time and automatic SfT represents an open problem for complex objects and imaging conditions. We present four contributions to address core unmet challenges to realise SfT with a Deep Neural Network (DNN). First, we propose a novel DNN called DeepSfT, which encodes the template in its weights and hence copes with highly complex templates. Second, we propose a semi-supervised training procedure to exploit real data. This is a practical solution to overcome the render gap that occurs when training only with simulated data. Third, we propose a geometry adaptation module to deal with different cameras at training and inference. Fourth, we combine statistical learning with physics-based reasoning. DeepSfT runs automatically and in real-time and we show with numerous experiments and an ablation study that it consistently achieves a lower 3D error than previous work. It outperforms in generalisation and achieves great performance in terms of reconstruction and registration error with wide-baseline, occlusions, illumination changes, weak texture and blur.Agencia Estatal de InvestigaciónMinisterio de Educació

e_Buah - Biblioteca Digital de la Universidad de Alcalá

MFT: Long-Term Tracking of Every Pixel

Author: Matas Jiří
Neoral Michal
Šerých Jonáš
Publication venue
Publication date: 10/11/2023
Field of study

We propose MFT -- Multi-Flow dense Tracker -- a novel method for dense, pixel-level, long-term tracking. The approach exploits optical flows estimated not only between consecutive frames, but also for pairs of frames at logarithmically spaced intervals. It selects the most reliable sequence of flows on the basis of estimates of its geometric accuracy and the probability of occlusion, both provided by a pre-trained CNN. We show that MFT achieves competitive performance on the TAP-Vid benchmark, outperforming baselines by a significant margin, and tracking densely orders of magnitude faster than the state-of-the-art point-tracking methods. The method is insensitive to medium-length occlusions and it is robustified by estimating flow with respect to the reference frame, which reduces drift.Comment: accepted to WACV 2024. Code at https://github.com/serycjon/MF

arXiv.org e-Print Archive