1,396 research outputs found

    A survey of visual preprocessing and shape representation techniques

    Get PDF
    Many recent theories and methods proposed for visual preprocessing and shape representation are summarized. The survey brings together research from the fields of biology, psychology, computer science, electrical engineering, and most recently, neural networks. It was motivated by the need to preprocess images for a sparse distributed memory (SDM), but the techniques presented may also prove useful for applying other associative memories to visual pattern recognition. The material of this survey is divided into three sections: an overview of biological visual processing; methods of preprocessing (extracting parts of shape, texture, motion, and depth); and shape representation and recognition (form invariance, primitives and structural descriptions, and theories of attention)

    3D-POLY: A Robot Vision System for Recognizing Objects in Occluded Environments

    Get PDF
    The two factors that determine the time complexity associated with model-driven interpretation of range maps are: I) the particular strategy used for the generation of object hypotheses; and 2) the manner in which both the model and the sensed data are organized, data organization being a primary determinant of the efficiency of verification of a given hypothesis. In this report, we present 3D-POLY, a working system for recognizing objects in the presence of occlusion and against cluttered backgrounds. The time complexity of this system is only O(n2) for single object recognition, where n is the number of features on the object. The most novel aspect of this system is the manner in which the feature data are organized for the models. We use a data structure called the feature sphere for the purpose. We will present efficient algorithms for assigning a feature to its proper place on a feature sphere, and for extracting the neighbors of a given feature from the feature sphere representation. For hypothesis generation, we use local feature sets, a notion similar to those used before us by Bolles, Shirai and others. The combination of the feature sphere idea for streamlining verification and the local feature sets for hypothesis generation results in a system whose time complexity has a polynomial bound. In addition to recognizing objects in occluded environments, 3D-POLY also possesses model learning capability. Model learning consists of looking at a model object from different views and integrating the resulting information. The 3D-POLY system also contains utilities for range image segmentation and classification of scene surfaces

    Extraction robuste de primitives géométriques 3D dans un nuage de points et alignement basé sur les primitives

    Get PDF
    Dans ce projet, nous Ă©tudions les problĂšmes de rĂ©tro-ingĂ©nierie et de contrĂŽle de la qualitĂ© qui jouent un rĂŽle important dans la fabrication industrielle. La rĂ©tro-ingĂ©nierie tente de reconstruire un modĂšle 3D Ă  partir de nuages de points, qui s’apparente au problĂšme de la reconstruction de la surface 3D. Le contrĂŽle de la qualitĂ© est un processus dans lequel la qualitĂ© de tous les facteurs impliquĂ©s dans la production est abordĂ©e. En fait, les systĂšmes ci-dessus nĂ©cessitent beaucoup d’intervention de la part d’un utilisateur expĂ©rimentĂ©, rĂ©sultat souhaitĂ© est encore loin soit une automatisation complĂšte du processus. Par consĂ©quent, de nombreux dĂ©fis doivent encore ĂȘtre abordĂ©s pour atteindre ce rĂ©sultat hautement souhaitable en production automatisĂ©e. La premiĂšre question abordĂ©e dans la thĂšse consiste Ă  extraire les primitives gĂ©omĂ©triques 3D Ă  partir de nuages de points. Un cadre complet pour extraire plusieurs types de primitives Ă  partir de donnĂ©es 3D est proposĂ©. En particulier, une nouvelle mĂ©thode de validation est proposĂ©e pour Ă©valuer la qualitĂ© des primitives extraites. À la fin, toutes les primitives prĂ©sentes dans le nuage de points sont extraites avec les points de donnĂ©es associĂ©s et leurs paramĂštres descriptifs. Ces rĂ©sultats pourraient ĂȘtre utilisĂ©s dans diverses applications telles que la reconstruction de scĂšnes on d’édifices, la gĂ©omĂ©trie constructive et etc. La seconde question traiĂ©e dans ce travail porte sur l’alignement de deux ensembles de donnĂ©es 3D Ă  l’aide de primitives gĂ©omĂ©triques, qui sont considĂ©rĂ©es comme un nouveau descripteur robuste. L’idĂ©e d’utiliser les primitives pour l’alignement arrive Ă  surmonter plusieurs dĂ©fis rencontrĂ©s par les mĂ©thodes d’alignement existantes. Ce problĂšme d’alignement est une Ă©tape essentielle dans la modĂ©lisation 3D, la mise en registre, la rĂ©cupĂ©ration de modĂšles. Enfin, nous proposons Ă©galement une mĂ©thode automatique pour extraire les discontinutĂ©s Ă  partir de donnĂ©es 3D d’objets manufacturĂ©s. En intĂ©grant ces discontinutĂ©s au problĂšme d’alignement, il est possible d’établir automatiquement les correspondances entre primitives en utilisant l’appariement de graphes relationnels avec attributs. Nous avons expĂ©rimentĂ© tous les algorithmes proposĂ©s sur diffĂ©rents jeux de donnĂ©es synthĂ©tiques et rĂ©elles. Ces algorithmes ont non seulement rĂ©ussi Ă  accomplir leur tĂąches avec succĂšs mais se sont aussi avĂ©rĂ©s supĂ©rieus aux mĂ©thodes proposĂ©es dans la literature. Les rĂ©sultats prĂ©sentĂ©s dans le thĂšse pourraient s’avĂ©rĂ©r utilises Ă  plusieurs applications.In this research project, we address reverse engineering and quality control problems that play significant roles in industrial manufacturing. Reverse engineering attempts to rebuild a 3D model from the scanned data captured from a object, which is the problem similar to 3D surface reconstruction. Quality control is a process in which the quality of all factors involved in production is monitored and revised. In fact, the above systems currently require significant intervention from experienced users, and are thus still far from being fully automated. Therefore, many challenges still need to be addressed to achieve the desired performance for automated production. The first proposition of this thesis is to extract 3D geometric primitives from point clouds for reverse engineering and surface reconstruction. A complete framework to extract multiple types of primitives from 3D data is proposed. In particular, a novel validation method is also proposed to assess the quality of the extracted primitives. At the end, all primitives present in the point cloud are extracted with their associated data points and descriptive parameters. These results could be used in various applications such as scene and building reconstruction, constructive solid geometry, etc. The second proposition of the thesis is to align two 3D datasets using the extracted geometric primitives, which is introduced as a novel and robust descriptor. The idea of using primitives for alignment is addressed several challenges faced by existing registration methods. This alignment problem is an essential step in 3D modeling, registration and model retrieval. Finally, an automatic method to extract sharp features from 3D data of man-made objects is also proposed. By integrating the extracted sharp features into the alignment framework, it is possible implement automatic assignment of primitive correspondences using attribute relational graph matching. Each primitive is considered as a node of the graph and an attribute relational graph is created to provide a structural and relational description between primitives. We have experimented all the proposed algorithms on different synthetic and real scanned datasets. Our algorithms not only are successful in completing their tasks with good results but also outperform other methods. We believe that the contribution of them could be useful in many applications

    Mapping three-dimensional geological features from remotely-sensed images and digital elevation models.

    Get PDF
    Accurate mapping of geological structures is important in numerous applications, ranging from mineral exploration through to hydrogeological modelling. Remotely sensed data can provide synoptic views of study areas enabling mapping of geological units within the area. Structural information may be derived from such data using standard manual photo-geologic interpretation techniques, although these are often inaccurate and incomplete. The aim of this thesis is, therefore, to compile a suite of automated and interactive computer-based analysis routines, designed to help a the user map geological structure. These are examined and integrated in the context of an expert system. The data used in this study include Digital Elevation Model (DEM) and Airborne Thematic Mapper images, both with a spatial resolution of 5m, for a 5 x 5 km area surrounding Llyn Cow lyd, Snowdonia, North Wales. The geology of this area comprises folded and faulted Ordo vician sediments intruded throughout by dolerite sills, providing a stringent test for the automated and semi-automated procedures. The DEM is used to highlight geomorphological features which may represent surface expressions of the sub-surface geology. The DEM is created from digitized contours, for which kriging is found to provide the best interpolation routine, based on a number of quantitative measures. Lambertian shading and the creation of slope and change of slope datasets are shown to provide the most successful enhancement of DEMs, in terms of highlighting a range of key geomorphological features. The digital image data are used to identify rock outcrops as well as lithologically controlled features in the land cover. To this end, a series of standard spectral enhancements of the images is examined. In this respect, the least correlated 3 band composite and a principal component composite are shown to give the best visual discrimination of geological and vegetation cover types. Automatic edge detection (followed by line thinning and extraction) and manual interpretation techniques are used to identify a set of 'geological primitives' (linear or arc features representing lithological boundaries) within these data. Inclusion of the DEM data provides the three-dimensional co-ordinates of these primitives enabling a least-squares fit to be employed to calculate dip and strike values, based, initially, on the assumption of a simple, linearly dipping structural model. A very large number of scene 'primitives' is identified using these procedures, only some of which have geological significance. Knowledge-based rules are therefore used to identify the relevant. For example, rules are developed to identify lake edges, forest boundaries, forest tracks, rock-vegetation boundaries, and areas of geomorphological interest. Confidence in the geological significance of some of the geological primitives is increased where they are found independently in both the DEM and remotely sensed data. The dip and strike values derived in this way are compared to information taken from the published geological map for this area, as well as measurements taken in the field. Many results are shown to correspond closely to those taken from the map and in the field, with an error of < 1°. These data and rules are incorporated into an expert system which, initially, produces a simple model of the geological structure. The system also provides a graphical user interface for manual control and interpretation, where necessary. Although the system currently only allows a relatively simple structural model (linearly dipping with faulting), in the future it will be possible to extend the system to model more complex features, such as anticlines, synclines, thrusts, nappes, and igneous intrusions

    Cylinders extraction in non-oriented point clouds as a clustering problem

    Get PDF
    Finding geometric primitives in 3D point clouds is a fundamental task in many engineering applications such as robotics, autonomous-vehicles and automated industrial inspection. Among all solid shapes, cylinders are frequently found in a variety of scenes, comprising natural or man-made objects. Despite their ubiquitous presence, automated extraction and fitting can become challenging if performed ”in-the-wild”, when the number of primitives is unknown or the point cloud is noisy and not oriented. In this paper we pose the problem of extracting multiple cylinders in a scene by means of a Game-Theoretic inlier selection process exploiting the geometrical relations between pairs of axis candidates. First, we formulate the similarity between two possible cylinders considering the rigid motion aligning the two axes to the same line. This motion is represented with a unitary dual-quaternion so that the distance between two cylinders is induced by the length of the shortest geodesic path in SE(3). Then, a Game-Theoretical process exploits such similarity function to extract sets of primitives maximizing their inner mutual consensus. The outcome of the evolutionary process consists in a probability distribution over the sets of candidates (ie axes), which in turn is used to directly estimate the final cylinder parameters. An extensive experimental section shows that the proposed algorithm offers a high resilience to noise, since the process inherently discards inconsistent data. Compared to other methods, it does not need point normals and does not require a fine tuning of multiple parameters

    Advances in Stereo Vision

    Get PDF
    Stereopsis is a vision process whose geometrical foundation has been known for a long time, ever since the experiments by Wheatstone, in the 19th century. Nevertheless, its inner workings in biological organisms, as well as its emulation by computer systems, have proven elusive, and stereo vision remains a very active and challenging area of research nowadays. In this volume we have attempted to present a limited but relevant sample of the work being carried out in stereo vision, covering significant aspects both from the applied and from the theoretical standpoints

    View generated database

    Get PDF
    This document represents the final report for the View Generated Database (VGD) project, NAS7-1066. It documents the work done on the project up to the point at which all project work was terminated due to lack of project funds. The VGD was to provide the capability to accurately represent any real-world object or scene as a computer model. Such models include both an accurate spatial/geometric representation of surfaces of the object or scene, as well as any surface detail present on the object. Applications of such models are numerous, including acquisition and maintenance of work models for tele-autonomous systems, generation of accurate 3-D geometric/photometric models for various 3-D vision systems, and graphical models for realistic rendering of 3-D scenes via computer graphics

    3D sub-nanoscale imaging of unit cell doubling due to octahedral tilting and cation modulation in strained perovskite thin films

    Get PDF
    Determining the 3-dimensional crystallography of a material with sub-nanometre resolution is essential to understanding strain effects in epitaxial thin films. A new scanning transmission electron microscopy imaging technique is demonstrated that visualises the presence and strength of atomic movements leading to a period doubling of the unit cell along the beam direction, using the intensity in an extra Laue zone ring in the back focal plane recorded using a pixelated detector method. This method is used together with conventional atomic resolution imaging in the plane perpendicular to the beam direction to gain information about the 3D crystal structure in an epitaxial thin film of LaFeO3 sandwiched between a substrate of (111) SrTiO3 and a top layer of La0.7Sr0.3MnO3. It is found that a hitherto unreported structure of LaFeO3 is formed under the unusual combination of compressive strain and (111) growth, which is triclinic with a periodicity doubling from primitive perovskite along one of the three directions lying in the growth plane. This results from a combination of La-site modulation along the beam direction, and modulation of oxygen positions resulting from octahedral tilting. This transition to the period-doubled cell is suppressed near both the substrate and near the La0.7Sr0.3MnO3 top layer due to the clamping of the octahedral tilting by the absence of tilting in the substrate and due to an incompatible tilt pattern being present in the La0.7Sr0.3MnO3 layer. This work shows a rapid and easy way of scanning for such transitions in thin films or other systems where disorder-order transitions or domain structures may be present and does not require the use of atomic resolution imaging, and could be done on any scanning TEM instrument equipped with a suitable camera.Comment: Minor fixes, especially in reference

    Tele-Autonomous control involving contact

    Get PDF
    Object localization and its application in tele-autonomous systems are studied. Two object localization algorithms are presented together with the methods of extracting several important types of object features. The first algorithm is based on line-segment to line-segment matching. Line range sensors are used to extract line-segment features from an object. The extracted features are matched to corresponding model features to compute the location of the object. The inputs of the second algorithm are not limited only to the line features. Featured points (point to point matching) and featured unit direction vectors (vector to vector matching) can also be used as the inputs of the algorithm, and there is no upper limit on the number of the features inputed. The algorithm will allow the use of redundant features to find a better solution. The algorithm uses dual number quaternions to represent the position and orientation of an object and uses the least squares optimization method to find an optimal solution for the object's location. The advantage of using this representation is that the method solves for the location estimation by minimizing a single cost function associated with the sum of the orientation and position errors and thus has a better performance on the estimation, both in accuracy and speed, than that of other similar algorithms. The difficulties when the operator is controlling a remote robot to perform manipulation tasks are also discussed. The main problems facing the operator are time delays on the signal transmission and the uncertainties of the remote environment. How object localization techniques can be used together with other techniques such as predictor display and time desynchronization to help to overcome these difficulties are then discussed
    • 

    corecore