Search CORE

78,688 research outputs found

Robust Phase-Correlation based Registration of Airborne Videos using Motion Estimation

Author: Borgeaud Maurice
de Morsier Frank
Gass Volker
Küchler Christoph
Thiran Jean-Philippe
Vogel Adrian
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 05/01/2011
Field of study

This paper presents a robust algorithm for the registration of airborne video sequences with reference images from a different source (airborne or satellite), based on phase-correlation. Phase-correlations using Fourier-Melin Invariant (FMI) descriptors allow to retrieve the rigid transformation parameters in a fast and non-iterative way. The robustness to multi-sources images is improved by an enhanced image representation based on the gradient norm and the extrapolation of registration parameters between frames by motion estimation. A phase-correlation score, indicator of the registration quality, is introduced to regulate between motion estimation only and frame-toreference image registration. Our Robust Phase-Correlation registration algorithm using Motion Estimation (RPCME) is compared with state-of-the-art Mutual Information (MI) algorithm on two different airborne videos. RPCME algorithm registered most of the frames accurately, retrieving much better orientation than MI. Our algorithm shows robustness and good accuracy to multisource images with the advantage of being a direct (non-iterative) method

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Simplex minimisation for multiple-reference motion estimation

Author: Al-Mualla MES
Bull DR
Canagarajah CN
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/1999
Field of study

Explore Bristol Research

Semi-hierarchical based motion estimation algorithm for the dirac video encoder

Author: Cosmas J
Loo KK
Tun M
Publication venue: 'World Scientific and Engineering Academy and Society (WSEAS)'
Publication date: 01/01/2008
Field of study

Having fast and efficient motion estimation is crucial in today’s advance video compression technique since it determines the compression efficiency and the complexity of a video encoder. In this paper, a method which we call semi-hierarchical motion estimation is proposed for the Dirac video encoder. By considering the fully hierarchical motion estimation only for a certain type of inter frame encoding, complexity of the motion estimation can be greatly reduced while maintaining the desirable accuracy. The experimental results show that the proposed algorithm gives two to three times reduction in terms of the number of SAD calculation compared with existing motion estimation algorithm of Dirac for the same motion estimation accuracy, compression efficiency and PSNR performance. Moreover, depending upon the complexity of the test sequence, the proposed algorithm has the ability to increase or decrease the search range in order to maintain the accuracy of the motion estimation to a certain level

Middlesex University Research Repository

Brunel University Research Archive

Real Time Turbulent Video Perfecting by Image Stabilization and Super-Resolution

Author: A. Mitiche
A.T. Mohammed
B. Cohen
B. Ellerbroek
B. Horn
B.M. Welsh
B.R. Frieden
Barak Fishbain
D. Sadot
D.G. Sheppard
H.H. Nagel
Ianir A. Ideses
J. Weickert
L. Alvarez
L.J. Barron
L.P. Yaroslavsky
L.P. Yaroslavsky
Leonid P. Yaroslavsky
S.C. Cheung
Y. Glick
Y. Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/04/2007
Field of study

Image and video quality in Long Range Observation Systems (LOROS) suffer from atmospheric turbulence that causes small neighbourhoods in image frames to chaotically move in different directions and substantially hampers visual analysis of such image and video sequences. The paper presents a real-time algorithm for perfecting turbulence degraded videos by means of stabilization and resolution enhancement. The latter is achieved by exploiting the turbulent motion. The algorithm involves generation of a reference frame and estimation, for each incoming video frame, of a local image displacement map with respect to the reference frame; segmentation of the displacement map into two classes: stationary and moving objects and resolution enhancement of stationary objects, while preserving real motion. Experiments with synthetic and real-life sequences have shown that the enhanced videos, generated in real time, exhibit substantially better resolution and complete stabilization for stationary objects while retaining real motion.Comment: Submitted to The Seventh IASTED International Conference on Visualization, Imaging, and Image Processing (VIIP 2007) August, 2007 Palma de Mallorca, Spai

arXiv.org e-Print Archive

Crossref

3D high definition video coding on a GPU-based heterogeneous system

Author: Claver Jose M
De Cock Jan
Fernandez-Escribano Gerardo
Martinez Jose Luis
Pieters Bart
Rodriguez-Sanchez Rafael
Sanchez Jose L
Van de Walle Rik
Publication venue: 'Elsevier BV'
Publication date: 01/01/2013
Field of study

H.264/MVC is a standard for supporting the sensation of 3D, based on coding from 2 (stereo) to N views. H.264/MVC adopts many coding options inherited from single view H.264/AVC, and thus its complexity is even higher, mainly because the number of processing views is higher. In this manuscript, we aim at an efficient parallelization of the most computationally intensive video encoding module for stereo sequences. In particular, inter prediction and its collaborative execution on a heterogeneous platform. The proposal is based on an efficient dynamic load balancing algorithm and on breaking encoding dependencies. Experimental results demonstrate the proposed algorithm's ability to reduce the encoding time for different stereo high definition sequences. Speed-up values of up to 90× were obtained when compared with the reference encoder on the same platform. Moreover, the proposed algorithm also provides a more energy-efficient approach and hence requires less energy than the sequential reference algorith

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Ghent University Academic Bibliography

Repositori Institucional de la Universitat Jaume I

Flow-Guided Feature Aggregation for Video Object Detection

Author: Dai Jifeng
Wang Yujie
Wei Yichen
Yuan Lu
Zhu Xizhou
Publication venue
Publication date: 18/08/2017
Field of study

Extending state-of-the-art object detectors from image to video is challenging. The accuracy of detection suffers from degenerated object appearances in videos, e.g., motion blur, video defocus, rare poses, etc. Existing work attempts to exploit temporal information on box level, but such methods are not trained end-to-end. We present flow-guided feature aggregation, an accurate and end-to-end learning framework for video object detection. It leverages temporal coherence on feature level instead. It improves the per-frame features by aggregation of nearby features along the motion paths, and thus improves the video recognition accuracy. Our method significantly improves upon strong single-frame baselines in ImageNet VID, especially for more challenging fast moving objects. Our framework is principled, and on par with the best engineered systems winning the ImageNet VID challenges 2016, without additional bells-and-whistles. The proposed method, together with Deep Feature Flow, powered the winning entry of ImageNet VID challenges 2017. The code is available at https://github.com/msracver/Flow-Guided-Feature-Aggregation

arXiv.org e-Print Archive

Crossref

Fast Multi-frame Stereo Scene Flow with Motion Segmentation

Author: Sato Yoichi
Sinha Sudipta N.
Taniai Tatsunori
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 05/07/2017
Field of study

We propose a new multi-frame method for efficiently computing scene flow (dense depth and optical flow) and camera ego-motion for a dynamic scene observed from a moving stereo camera rig. Our technique also segments out moving objects from the rigid scene. In our method, we first estimate the disparity map and the 6-DOF camera motion using stereo matching and visual odometry. We then identify regions inconsistent with the estimated camera motion and compute per-pixel optical flow only at these regions. This flow proposal is fused with the camera motion-based flow proposal using fusion moves to obtain the final optical flow and motion segmentation. This unified framework benefits all four tasks - stereo, optical flow, visual odometry and motion segmentation leading to overall higher accuracy and efficiency. Our method is currently ranked third on the KITTI 2015 scene flow benchmark. Furthermore, our CPU implementation runs in 2-3 seconds per frame which is 1-3 orders of magnitude faster than the top six methods. We also report a thorough evaluation on challenging Sintel sequences with fast camera and object motion, where our method consistently outperforms OSF [Menze and Geiger, 2015], which is currently ranked second on the KITTI benchmark.Comment: 15 pages. To appear at IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017). Our results were submitted to KITTI 2015 Stereo Scene Flow Benchmark in November 201

arXiv.org e-Print Archive

Crossref

Reduced-complexity multiview prediction scheme with content-adaptive disparity vector estimation

Author: Avci Aykut
De Cock Jan
De Smet Herbert
De Smet Jelle
Lambert Peter
Meuret Youri
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 01/01/2012
Field of study

Ghent University Academic Bibliography