27 research outputs found

    Object representation and recognition

    Get PDF
    One of the primary functions of the human visual system is object recognition, an ability that allows us to relate the visual stimuli falling on our retinas to our knowledge of the world. For example, object recognition allows you to use knowledge of what an apple looks like to find it in the supermarket, to use knowledge of what a shark looks like to swim in th

    Structure-aware image denoising, super-resolution, and enhancement methods

    Get PDF
    Denoising, super-resolution and structure enhancement are classical image processing applications. The motive behind their existence is to aid our visual analysis of raw digital images. Despite tremendous progress in these fields, certain difficult problems are still open to research. For example, denoising and super-resolution techniques which possess all the following properties, are very scarce: They must preserve critical structures like corners, should be robust to the type of noise distribution, avoid undesirable artefacts, and also be fast. The area of structure enhancement also has an unresolved issue: Very little efforts have been put into designing models that can tackle anisotropic deformations in the image acquisition process. In this thesis, we design novel methods in the form of partial differential equations, patch-based approaches and variational models to overcome the aforementioned obstacles. In most cases, our methods outperform the existing approaches in both quality and speed, despite being applicable to a broader range of practical situations.Entrauschen, Superresolution und Strukturverbesserung sind klassische Anwendungen der Bildverarbeitung. Ihre Existenz bedingt sich in dem Bestreben, die visuelle Begutachtung digitaler Bildrohdaten zu unterstützen. Trotz erheblicher Fortschritte in diesen Feldern bedürfen bestimmte schwierige Probleme noch weiterer Forschung. So sind beispielsweise Entrauschungsund Superresolutionsverfahren, welche alle der folgenden Eingenschaften besitzen, sehr selten: die Erhaltung wichtiger Strukturen wie Ecken, Robustheit bezüglich der Rauschverteilung, Vermeidung unerwünschter Artefakte und niedrige Laufzeit. Auch im Gebiet der Strukturverbesserung liegt ein ungelöstes Problem vor: Bisher wurde nur sehr wenig Forschungsaufwand in die Entwicklung von Modellen investieret, welche anisotrope Deformationen in bildgebenden Verfahren bewältigen können. In dieser Arbeit entwerfen wir neue Methoden in Form von partiellen Differentialgleichungen, patch-basierten Ansätzen und Variationsmodellen um die oben erwähnten Hindernisse zu überwinden. In den meisten Fällen übertreffen unsere Methoden nicht nur qualitativ die bisher verwendeten Ansätze, sondern lösen die gestellten Aufgaben auch schneller. Zudem decken wir mit unseren Modellen einen breiteren Bereich praktischer Fragestellungen ab

    Development Of Image Analysis Methods Applied To Collagen Imaged With Different Techniques

    Get PDF
    This thesis is concerned with the development of analysis strategies for microscopic images of collagen, with the specific aim to minimise as far as possible the need for user input, thereby reducing potential bias. Images of collagen were chosen for analysis, as collagen is an easily available and biologically important molecule, whose organisation in tissues is of great interest due to its pivotal role in biomechanics, and its implication in many disease states. Quantifying its structure and organisation can facilitate better understanding of the physiological and mechanical role of collagen, as well as provide the basis for a more detailed knowledge of the structural foundation of the body. We had access to a limited number of collagen images from porcine sclera obtained by Atomic Force Microscopy, Polarised Light Microscopy, Second Harmonic Generation Microscopy and Transmission Electron Microscopy. The challenges in developing analysis strategies arose from collagen itself, from the intrinsic differences in the data gathered by different microscopies, and from the presence of artifacts. We developed successful methods to analyse data relevant to collagen characterisation from Atomic Force Microscopy, Polarised Light Microscopy and Second Harmonic Generation Microscopy images, and propose a quantification method where our analysis attempts were not successful. This thesis aims to contribute to a growing body of work in image analysis, which can be applied across disciplines wherever image data are used. It is hoped that further development and refinement of the work presented here may be the focus of future work

    The deep structure of Gaussian scale space images

    Get PDF
    In order to be able to deal with the discrete nature of images in a continuous way, one can use results of the mathematical field of 'distribution theory'. Under almost trivial assumptions, like 'we know nothing', one ends up with convolving the image with a Gaussian filter. In this manner scale is introduced by means of the filter's width. The ensemble of the image and its convolved versions at al scales is called a 'Gaussian scale space image'. The filter's main property is that the scale derivative equals the Laplacean of the spatial variables: it is the Greens function of the so-called Heat, or Diffusion, Equation. The investigation of the image all scales simultaneously is called 'deep structure'. In this thesis I focus on the behaviour of the elementary topological items 'spatial critical points' and 'iso-intensity manifolds'. The spatial critical points are traced over scale. Generically they are annihilated and sometimes created pair wise, involving extrema and saddles. The locations of these so-called 'catastrophe events' are calculated with sub-pixel precision. Regarded in the scale space image, these spatial critical points form one-dimensional manifolds, the so-called critical curves. A second type of critical points is formed by the scale space saddles. They are the only possible critical points in the scale space image. At these points the iso-intensity manifolds exhibit special behaviour: they consist of two touching parts, of which one intersects an extremum that is part of the critical curve containing the scale space saddle. This part of the manifold uniquely assigns an area in scale space to this extremum. The remaining part uniquely assigns it to 'other structure'. Since this can be repeated, automatically an algorithm is obtained that reveals the 'hidden' structure present in the scale space image. This topological structure can be hierarchically presented as a binary tree, enabling one to (de-)select parts of it, sweeping out parts, simplify, etc. This structure can easily be projected to the initial image resulting in an uncommitted 'pre-segmentation': a segmentation of the image based on the topological properties without any user-defined parameters or whatsoever. Investigation of non-generic catastrophes shows that symmetries can easily be dealt with. Furthermore, the appearance of creations is shown to be nothing but (instable) protuberances at critical curves. There is also biological inspiration for using a Gaussian scale space, since the visual system seems to use Gaussian-like filters: we are able of seeing and interpreting multi-scale

    Local, Semi-Local and Global Models for Texture, Object and Scene Recognition

    Get PDF
    This dissertation addresses the problems of recognizing textures, objects, and scenes in photographs. We present approaches to these recognition tasks that combine salient local image features with spatial relations and effective discriminative learning techniques. First, we introduce a bag of features image model for recognizing textured surfaces under a wide range of transformations, including viewpoint changes and non-rigid deformations. We present results of a large-scale comparative evaluation indicating that bags of features can be effective not only for texture, but also for object categization, even in the presence of substantial clutter and intra-class variation. We also show how to augment the purely local image representation with statistical co-occurrence relations between pairs of nearby features, and develop a learning and classification framework for the task of classifying individual features in a multi-texture image. Next, we present a more structured alternative to bags of features for object recognition, namely, an image representation based on semi-local parts, or groups of features characterized by stable appearance and geometric layout. Semi-local parts are automatically learned from small sets of unsegmented, cluttered images. Finally, we present a global method for recognizing scene categories that works by partitioning the image into increasingly fine sub-regions and computing histograms of local features found inside each sub-region. The resulting spatial pyramid representation demonstrates significantly improved performance on challenging scene categorization tasks

    On patient-specific wall stress analysis in abdominal aortic aneurysms

    Get PDF

    Statistical Medial Model dor Cardiac Segmentation and Morphometry

    Get PDF
    In biomedical image analysis, shape information can be utilized for many purposes. For example, irregular shape features can help identify diseases; shape features can help match different instances of anatomical structures for statistical comparison; and prior knowledge of the mean and possible variation of an anatomical structure\u27s shape can help segment a new example of this structure in noisy, low-contrast images. A good shape representation helps to improve the performance of the above techniques. The overall goal of the proposed research is to develop and evaluate methods for representing shapes of anatomical structures. The medial model is a shape representation method that models a 3D object by explicitly defining its skeleton (medial axis) and deriving the object\u27s boundary via inverse-skeletonization . This model represents shape compactly, and naturally expresses descriptive global shape features like thickening , bending , and elongation . However, its application in biomedical image analysis has been limited, and it has not yet been applied to the heart, which has a complex shape. In this thesis, I focus on developing efficient methods to construct the medial model, and apply it to solve biomedical image analysis problems. I propose a new 3D medial model which can be efficiently applied to complex shapes. The proposed medial model closely approximates the medial geometry along medial edge curves and medial branching curves by soft-penalty optimization and local correction. I further develop a scheme to perform model-based segmentation using a statistical medial model which incorporates prior shape and appearance information. The proposed medial models are applied to a series of image analysis tasks. The 2D medial model is applied to the corpus callosum which results in an improved alignment of the patterns of commissural connectivity compared to a volumetric registration method. The 3D medial model is used to describe the myocardium of the left and right ventricles, which provides detailed thickness maps characterizing different disease states. The model-based myocardium segmentation scheme is tested in a heterogeneous adult MRI dataset. Our segmentation experiments demonstrate that the statistical medial model can accurately segment the ventricular myocardium and provide useful parameters to characterize heart function

    Measuring and Modeling Fluid Dynamic Processes using Digital Image Sequence Analysis

    Get PDF
    In this thesis novel motion models have been developed and incorporated into an extended parameter estimation framework that allows to accurately estimate the parameters and regularize them if needed. The performance of this framework has been increased to real time and implemented on inexpensive graphics hardware. Confidence and situation measures have been designed to discard inaccurate estimates. A phase field approach was developed to estimate piecewise smooth motion while detecting object boundaries at the same time. These algorithmic improvements have been successfully applied to three areas of fluid dynamics: air-sea interaction, microfluidics and plant physiology. At the ocean surface, the fluxes of heat and momentum have been measured with thermographic techniques, both spatially and temporally highly resolved. These measurement techniques present milestones for research in air-sea interaction, where point measurements and particle based laboratory measurements represent the state-of-the art. Calculations were done with two models, both making complement assumptions. Still, results derived from both models agree remarkably well. Measurements were conducted in laboratory settings as well as in the field. Microfluidic flow was measured with a new approach to molecular tagging velocimetry that explicitly models Taylor dispersion. This has lead to an increase in accuracy and applicability. Inaccuracies and problems of previous approaches due to Taylor dispersion were successfully evaded. Ground truth test measurements have been conducted, proving the accuracy of this novel technique. For the first time, flow velocities were measured in the xylem of plant leaves with active thermography. This represents a technique for measuring these flows on extended leaf areas on free standing plants, minimizing the impact caused by the measurement. Ground truth measurements on perfused leafs were performed. Measurements were also conducted on free standing plants in a climatic chamber, to measure xylem flows and relate flow velocities to environmental parameter. With a cuvette, environmental factors were varied locally. These measurements underlined the sensitivity of the new approach. A linear relationship in between flow rates and xylem diameter was found

    Marker-free human motion capture in dynamic cluttered environments from a single view-point

    Get PDF
    Human Motion Capture is a widely used technique to obtain motion data for animation of virtual characters. Commercial optical motion capture systems are marker-based. This thesis is about marker-free motion capture. The pose and motion estimation of an observed person is carried out in an optimization framework for articulated objects. The motion function is formulated with kinematic chains consisting of rotations around arbitrary axes in 3D space. This formulation leads to a Nonlinear Least Squares problem, which is solved with gradient-based methods. With the formulation in this thesis the necessary derivatives can be derived analytically. This speeds up processing and increases accuracy. Different gradient based methods are compared to solve the Nonlinear Least Squares problem, which allows the integration of second order motion derivatives as well. The pose estimation requires correspondences between known model of the person and observed data. To obtain this model, a new method is developed, which fits a template model to a specific person from 6 posture images taken by a single camera. Various types of correspondences are integrated in the optimization simultaneously without making approximations to the motion or optimization function, namely 3D-3D correspondences from stereo algorithms and 3D-2D correspondences from image silhouettes and 2D point tracking. Of major importance for the developed methods is the processing time and robustness to cluttered and dynamic background. Experiments show, that complex motion with 24 degrees of freedom is track-able from a single stereo view until body parts get totally occluded. Further methods are developed to estimate pose from a single camera view with cluttered dynamic background. Similar to other work on 2D-3D pose estimation, correspondences between model and image silhouette of the person are established by analyzing the gray value gradient near the predicted model silhouette. To increase the accuracy of silhouette correspondences, color histograms for each body part are combined with image gradient search. The combination of 3D depth data and 2D image data is tested with depth data from a PMD camera (Photonic Mixer Device), which measures the depth to scene points by the time of flight of ligh
    corecore