22 research outputs found

    An ECC Based Iterative Algorithm For Photometric Invariant Projective Registration

    Get PDF
    International audienceThe ability of an algorithm to accurately estimate the parameters of the geometric trans- formation which aligns two image profiles even in the presence of photometric distortions can be considered as a basic requirement in many computer vision applications. Projec- tive transformations constitute a general class which includes as special cases the affine, as well as the metric subclasses of transformations. In this paper the applicability of a recently proposed iterative algorithm, which uses the Enhanced Correlation Coefficient as a performance criterion, in the projective image registration problem is investigated. The main theoretical results concerning the proposed iterative algorithm are presented. Furthermore, the performance of the iterative algorithm in the presence of nonlinear photometric distortions is compared against the leading Lucas-Kanade algorithm and its simultaneous inverse compositional variant with the help of a series of experiments involving strong or weak geometric deformations, ideal and noisy conditions and even over-modelling of the warping process. Although under ideal conditions the proposed al- gorithm and simultaneous inverse compositional algorithm exhibit a similar performance and both outperform the Lucas-Kanade algorithm, under noisy conditions the proposed algorithm outperforms the other algorithms in convergence speed and accuracy, and exhibits robustness against photometric distortions

    Motion-Based Sign Language Video Summarization using Curvature and Torsion

    Full text link
    An interesting problem in many video-based applications is the generation of short synopses by selecting the most informative frames, a procedure which is known as video summarization. For sign language videos the benefits of using the tt-parameterized counterpart of the curvature of the 2-D signer's wrist trajectory to identify keyframes, have been recently reported in the literature. In this paper we extend these ideas by modeling the 3-D hand motion that is extracted from each frame of the video. To this end we propose a new informative function based on the tt-parameterized curvature and torsion of the 3-D trajectory. The method to characterize video frames as keyframes depends on whether the motion occurs in 2-D or 3-D space. Specifically, in the case of 3-D motion we look for the maxima of the harmonic mean of the curvature and torsion of the target's trajectory; in the planar motion case we seek for the maxima of the trajectory's curvature. The proposed 3-D feature is experimentally evaluated in applications of sign language videos on (1) objective measures using ground-truth keyframe annotations, (2) human-based evaluation of understanding, and (3) gloss classification and the results obtained are promising.Comment: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessibl

    A Generative Model for the Joint Registration of Multiple Point Sets

    Get PDF
    International audienceThis paper describes a probabilistic generative model and its associated algorithm to jointly register multiple point sets. The vast majority of state-of-the-art registration techniques select one of the sets as the ''model" and perform pairwise alignments between the other sets and this set. The main drawback of this mode of operation is that there is no guarantee that the model-set is free of noise and outliers, which contaminates the estimation of the registration parameters. Unlike previous work, the proposed method treats all the point sets on an equal footing: they are realizations of a Gaussian mixture (GMM) and the registration is cast into a clustering problem. We formally derive an EM algorithm that estimates both the GMM parameters and the rotations and translations that map each individual set onto the ''central" model. The mixture means play the role of the registered set of points while the variances provide rich information about the quality of the registration. We thoroughly validate the proposed method with challenging datasets, we compare it with several state-of-the-art methods, and we show its potential for fusing real depth data

    From Pillars to AI Technology-Based Forest Fire Protection Systems

    Get PDF
    The importance of forest environment in the perspective of the biodiversity as well as from the economic resources which forests enclose, is more than evident. Any threat posed to this critical component of the environment should be identified and attacked through the use of the most efficient available technological means. Early warning and immediate response to a fire event are critical in avoiding great environmental damages. Fire risk assessment, reliable detection and localization of fire as well as motion planning, constitute the most vital ingredients of a fire protection system. In this chapter, we review the evolution of the forest fire protection systems and emphasize on open issues and the improvements that can be achieved using artificial intelligence technology. We start our tour from the pillars which were for a long time period, the only possible method to oversee the forest fires. Then, we will proceed to the exploration of early AI systems and will end-up with nowadays systems that might receive multimodal data from satellites, optical and thermal sensors, smart phones and UAVs and use techniques that cover the spectrum from early signal processing algorithms to latest deep learning-based ones to achieving the ultimate goal

    Photometric Invariant Projevtive Registration by using ECC Maximization

    No full text
    International audienceThe ability of an algorithm to accurately estimate the parameters of the geometric transformation which aligns two image profiles even in the presence of photometric distortions can be considered as a basic requirement in many computer vision applications. Projective transformations constitute a general class which includes as special cases the affine, as well as the metric subclasses of transformations. In this paper the applicability of a recently proposed iterative algorithm, which uses the Enhanced Correlation Coefficient as a performance criterion, in the projective image registration problem is investigated. The main theoretical results concerning the iterative algorithm and an efficient approximation that leads to an optimal closed form solution (per iteration) are presented. Furthermore, the performance of the iterative algorithm in the presence of nonlinear photometric distortions is compared against the leading Lucas-Kanade algorithm by performing numerous simulations. In all cases the proposed algorithm outperforms the Lucas-Kanade algorithm in convergence speed and robustness against photometric distortions under ideal and noisy conditions

    Evangelidis, “An enhanced correlation-based method for stereo correspondence with subpixel accuracy

    Get PDF
    The invariance of the similarity measure in photometric distortions as well as its capability in producing subpixel accuracy are two desired and often required features in most stereo vision applications. In this paper we propose a new correlation-based measure which incorporates both mentioned requirements. Specifically, by using an appropriate interpolation scheme in the candidate windows of the matching image, and using the classical zero mean normalized cross correlation function, we introduce a suitable measure. Although the proposed measure is a nonlinear function of the sub-pixel displacement parameter, its maximization results in a closed form solution, resulting in reduced complexity for its use in matching techniques. Application of the proposed measure in a number of benchmark stereo pair images reveals its superiority over existing correlation-based techniques used for sub-pixel accuracy.

    Spectrum analysis of digital UPWM signals generated from random modulating signals

    No full text
    Abstract This work studies the spectrum of discrete-time Uniform-sampling pulse width modulation (UPWM) signals originating from stochastic input signals. It demonstrates that for ergodic input sequences of independent and identically distributed random variables, the Discrete Fourier Transform (DFT) of the UPWM signals can be directly estimated from the input signal’s statistics. Consequently, it is shown that if the input signal can be modeled as such a random sequence, only statistical information of the sequence is required for the accurate estimation of the DFT of the UPWM signal. This is achieved here by proving that the DFT estimators obtained by observation of the input sequence within a time window are consistent estimators of the DFT coefficients of the underlying random process. Moreover, for signals whose generalized probability density functions can be expressed as functions of a small number of parameters, the DFT coefficients can be estimated or even calculated via closed-form expressions with linear complexity. Examples are given for input signals derived from symmetric and asymmetric distributions. The results are validated by comparison with evaluations of the UPWM signal’s DFT via the Fast Fourier Transform (FFT). The proposed method provides a mathematical framework for the analysis and design of UPWM systems whose inputs have known statistical properties
    corecore