356 research outputs found
Object recognition using multi-view imaging
Single view imaging data has been used in most previous research in computer vision and
image understanding and lots of techniques have been developed. Recently with the fast
development and dropping cost of multiple cameras, it has become possible to have many
more views to achieve image processing tasks. This thesis will consider how to use the
obtained multiple images in the application of target object recognition.
In this context, we present two algorithms for object recognition based on scale-
invariant feature points. The first is single view object recognition method (SOR), which
operates on single images and uses a chirality constraint to reduce the recognition errors
that arise when only a small number of feature points are matched. The procedure is
extended in the second multi-view object recognition algorithm (MOR) which operates on
a multi-view image sequence and, by tracking feature points using a dynamic programming
method in the plenoptic domain subject to the epipolar constraint, is able to fuse feature
point matches from all the available images, resulting in more robust recognition.
We evaluated these algorithms using a number of data sets of real images capturing
both indoor and outdoor scenes. We demonstrate that MOR is better than SOR particularly for noisy and low resolution images, and it is also able to recognize objects that are
partially occluded by combining it with some segmentation techniques
Area and length preserving geometric invariant scale-spaces
Caption title.Includes bibliographical references (p. 23-27).Supported by the National Science Foundation. DMS-8811084 ECS-9122106 Supported by the Air Force Office of Scientific Research. AFOSR-90-0024 Supported by the Army Research Office. DAAL03-91-G-0019 DAAL03-92-G-0115 Supported by the Rothschild Foundation-Yad Hanadiv.Guillermo Sapiro, Allen Tannenbaum
Differential invariant signatures and flows in computer vision : a symmetry group approach
Includes bibliographical references (p. 40-44).Supported by the National Science Foundation. DMS-9204192 DMS-8811084 ECS-9122106 Supported by the Air Force Office of Scientific Research. AFOSR-90-0024 Supported by the Rothschild Foundation-Yad Hanadiv and by Image Evolutions, Ltd.Peter J. Olver, Guillermo Sapiro, Allen Tannenbaum
Object recognition in infrared imagery using appearance-based methods
Abstract unavailable please refer to PD
Solving Correspondences for Non-Rigid Deformations
Projecte final de carrera realitzat en col.laboració amb l'IR
Combinatorial Solutions for Shape Optimization in Computer Vision
This thesis aims at solving so-called shape optimization problems, i.e. problems where the shape of some real-world entity is sought, by applying combinatorial algorithms. I present several advances in this field, all of them based on energy minimization. The addressed problems will become more intricate in the course of the thesis, starting from problems that are solved globally, then turning to problems where so far no global solutions are known. The first two chapters treat segmentation problems where the considered grouping criterion is directly derived from the image data. That is, the respective data terms do not involve any parameters to estimate. These problems will be solved globally. The first of these chapters treats the problem of unsupervised image segmentation where apart from the image there is no other user input. Here I will focus on a contour-based method and show how to integrate curvature regularity into a ratio-based optimization framework. The arising optimization problem is reduced to optimizing over the cycles in a product graph. This problem can be solved globally in polynomial, effectively linear time. As a consequence, the method does not depend on initialization and translational invariance is achieved. This is joint work with Daniel Cremers and Simon Masnou. I will then proceed to the integration of shape knowledge into the framework, while keeping translational invariance. This problem is again reduced to cycle-finding in a product graph. Being based on the alignment of shape points, the method actually uses a more sophisticated shape measure than most local approaches and still provides global optima. It readily extends to tracking problems and allows to solve some of them in real-time. I will present an extension to highly deformable shape models which can be included in the global optimization framework. This method simultaneously allows to decompose a shape into a set of deformable parts, based only on the input images. This is joint work with Daniel Cremers. In the second part segmentation is combined with so-called correspondence problems, i.e. the underlying grouping criterion is now based on correspondences that have to be inferred simultaneously. That is, in addition to inferring the shapes of objects, one now also tries to put into correspondence the points in several images. The arising problems become more intricate and are no longer optimized globally. This part is divided into two chapters. The first chapter treats the topic of real-time motion segmentation where objects are identified based on the observations that the respective points in the video will move coherently. Rather than pre-estimating motion, a single energy functional is minimized via alternating optimization. The main novelty lies in the real-time capability, which is achieved by exploiting a fast combinatorial segmentation algorithm. The results are furthermore improved by employing a probabilistic data term. This is joint work with Daniel Cremers. The final chapter presents a method for high resolution motion layer decomposition and was developed in combination with Daniel Cremers and Thomas Pock. Layer decomposition methods support the notion of a scene model, which allows to model occlusion and enforce temporal consistency. The contributions are twofold: from a practical point of view the proposed method allows to recover fine-detailed layer images by minimizing a single energy. This is achieved by integrating a super-resolution method into the layer decomposition framework. From a theoretical viewpoint the proposed method introduces layer-based regularity terms as well as a graph cut-based scheme to solve for the layer domains. The latter is combined with powerful continuous convex optimization techniques into an alternating minimization scheme. Lastly I want to mention that a significant part of this thesis is devoted to the recent trend of exploiting parallel architectures, in particular graphics cards: many combinatorial algorithms are easily parallelized. In Chapter 3 we will see a case where the standard algorithm is hard to parallelize, but easy for the respective problem instances
View Synthesis from Image and Video for Object Recognition Applications
Object recognition is one of the most important and successful applications in computer vision community. The varying appearances of the test object due to different poses or illumination conditions can make the object recognition problem very challenging. Using view synthesis techniques to generate pose-invariant or illumination-invariant images or videos of the test object is an appealing approach to alleviate the degrading recognition
performance due to non-canonical views or lighting conditions.
In this thesis, we first present a complete framework for better synthesis and understanding of the human pose from a limited number
of available silhouette images. Pose-normalized silhouette images are generated using an active virtual camera and an image based visual hull technique, with the silhouette turning function distance being used as the pose similarity measurement. In order to overcome the inability of the shape from silhouettes method to reonstruct concave regions for human postures, a view synthesis algorithm is proposed for articulating humans using visual hull and contour-based body part segmentation. These two components improve each other for
better performance through the correspondence across viewpoints built via the inner distance shape context measurement.
Face recognition under varying pose is a challenging problem, especially when illumination variations are also present. We propose two algorithms to address this scenario. For a single light source, we demonstrate a pose-normalized face synthesis approach on a
pixel-by-pixel basis from a single view by exploiting the bilateral symmetry of the human face. For more complicated illumination
condition, the spherical harmonic representation is extended to encode pose information. An efficient method is proposed for robust
face synthesis and recognition with a very compact training set.
Finally, we present an end-to-end moving object verification system for airborne video, wherein a homography based view synthesis algorithm is used to simultaneously handle the object's changes in aspect angle, depression angle, and resolution. Efficient integration of spatial and temporal model matching assures the robustness of the verification step. As a byproduct, a robust two camera tracking method using homography is also proposed and demonstrated using challenging surveillance video sequences
Similarity signature curves for forming periodic orbits in the Lorenz system
In this paper, we systematically investigate the short periodic orbits of the
Lorenz system by the aid of the similarity signature curve, and a novel method
to find the short-period orbits of the Lorenz system is proposed. The
similarity invariants are derived by the equivariant moving frame theory and
then the similarity signature curve occurs along with them. The similarity
signature curve of the Lorenz system presents a more regular behavior than the
original one. By combining the sliding window method, the quasi-periodic orbits
can be detected numerically, all periodic orbits with period in
the Lorenz system are found, and their period lengths and symbol sequences are
calculated
- …