41 research outputs found

    The multiresolution Fourier transform : a general purpose tool for image analysis

    Get PDF
    The extraction of meaningful features from an image forms an important area of image analysis. It enables the task of understanding visual information to be implemented in a coherent and well defined manner. However, although many of the traditional approaches to feature extraction have proved to be successful in specific areas, recent work has suggested that they do not provide sufficient generality when dealing with complex analysis problems such as those presented by natural images. This thesis considers the problem of deriving an image description which could form the basis of a more general approach to feature extraction. It is argued that an essential property of such a description is that it should have locality in both the spatial domain and in some classification space over a range of scales. Using the 2-d Fourier domain as a classification space, a number of image transforms that might provide the required description are investigated. These include combined representations such as a 2-d version of the short-time Fourier transform (STFT), and multiscale or pyramid representations such as the wavelet transform. However, it is shown that these are limited in their ability to provide sufficient locality in both domains and as such do not fulfill the requirement for generality. To overcome this limitation, an alternative approach is proposed in the form of the multiresolution Fourier transform (MFT). This has a hierarchical structure in which the outermost levels are the image and its discrete Fourier transform (DFT), whilst the intermediate levels are combined representations in space and spatial frequency. These levels are defined to be optimal in terms of locality and their resolution is such that within the transform as a whole there is a uniform variation in resolution between the spatial domain and the spatial frequency domain. This ensures that locality is provided in both domains over a range of scales. The MFT is also invertible and amenable to efficient computation via familiar signal processing techniques. Examples and experiments illustrating its properties are presented. The problem of extracting local image features such as lines and edges is then considered. A multiresolution image model based on these features is defined and it is shown that the MET provides an effective tool for estimating its parameters.. The model is also suitable for representing curves and a curve extraction algorithm is described. The results presented for synthetic and natural images compare favourably with existing methods. Furthermore, when coupled with the previous work in this area, they demonstrate that the MFT has the potential to provide a basis for the solution of general image analysis problems

    Multiresolution estimation of 2-d disparity using a frequency domain approach

    Get PDF
    An efficient algorithm for the estimation of the 2-d disparity between a pair of stereo images is presented. Phase based methods are extended to the case of 2-d disparities and shown to correspond to computing local correlation fields. These are derived at multiple scales via the frequency domain and a coarse-to-fine 'focusing' strategy determines the final disparity estimate. Fast implementation is achieved by using a generalised form of wavelet transform, the multiresolution Fourier transform (MFT), which enables efficient calculation of the local correlations. Results from initial experiments on random noise stereo pairs containing both 1-d and 2-d disparities, illustrate the potential of the approach

    Hierarchical descriptors for nonstationary 1 and 2 dimensional signal processing

    Get PDF
    The representation of signals with important local properties is considered. These signals have a degree of nonstationarity which is dependent upon the amount of localisation. Signal descriptors which seek to represent such signals must correspond in scale to the local properties in order to provide an efficient representation. A review is given of various methods that have been adopted. Although some of these have been used in specific applications, a general and computationally efficient representation is not available. A new descriptor is presented which seeks to fulfil this requirement. It combines the time and frequency representations of a signal in an optimal way by using basis functions which are maximally concentrated in both domains. This corresponds to representing the local properties of the signal. The descriptor adopts a hierarchical structure which incorporates multi-resolution in both domains so that the required amount of localisation can be determined for a given signal. This enables the descriptor to be generally applied since it is consistent with the concept of a nonstationary signal. Results of initial experiments on the descriptor are presented and the report concludes with a discussion of future investigations

    Stabilitet i fosterhjem. Internasjonal forskning om barnets behov for trygghet og forutsigbarhet

    Get PDF
    Formålet med rapporten har vært å beskrive hva som skyldes ustabilitet under fosterhjemsplasseringer, for dernest å foreslå ulike tiltak som kan bøte på denne ustabiliteten. Rapporten har tre ulike deler som tematiserer fosterhjem og stabilitet. Den første delen er introduksjonen, og forklarer hvilken rolle stabilitet har for barn under plassering. Stabilitet og arbeid med å etablere en familieliknende situasjon for barna blir argumentert for at er og har vært et grunnleggende siktemål for fosterhjemstjenesten i norsk barnevern i mange år. Den andre delen avdekker de ulike mekanismene som gjør seg gjeldende, og som medfører ustabilitet. Den tredje delen presenterer en serie med tiltak som man kan tenke seg at kan avhjelpe de situasjoner som leder fram til ustabilitet under plasseringen, og som ble utledet under del to

    Motion Segmentation Based on Integrated Region Layering and Motion Assignment

    No full text
    We describe a novel algorithm for segmenting image sequence frames into regions of pixels moving with coherent motion. It is based on fusing local grey level segmentations with motion estimates obtained using block and partial correlations. The key innovation is the method employed to assign motion labels to the grey level regions. This uses an explicit model of motion occlusion and uncovering based on boundary ownership which predicts the location of motioncompensated difference energy for a given labelling and depth ordering of adjacent regions. A significant advantage of the approach is that region layering is automatically generated with the best assignment. We incorporate the scheme into a global segmentation algorithm in which the local motion assignments are combined using a consistency criterion, leading to layered sets of connected sub-regions representing the segmented motion regions within the frame. Experiments demonstrate the approach is effective

    Estimating Dense Motion Fields Using a Local Two-Component Model

    No full text

    Integrated segmentation and depth ordering of motion layers in image sequences

    No full text
    We describe a method to segment and depth order motion layers simultaneously in an image sequence. Previous approaches have tended to ignore the depth ordering issue or treat it as a post-processing operation. We argue here that motion estimation and segmentation are crucially dependent on depth order and hence that the latter should form an integral part of any layering scheme. Using an explicit model of boundary ownership allowing simultaneous assignment of motions to regions and extraction of depth order, the method fuses colour region segmentations with motion estimates obtained via block correlation. The motion estimates are then updated using a depthdependent partial correlation. Experiments show the approach is effective.

    Tracking multiple animals in wildlife footage

    No full text
    We describe a method for tracking animals in wildlife footage. It uses a CONDENSATION particle filtering framework driven by learnt characteristics of specific animals. The key contribution is a periodic model of animal motion based on the relative positions over time of trackable features at significant body points. We also introduce techniques for maintaining a multimodal state density within the particle filter over time to enable consistent tracking of multiple animals. Initial experiments show that the approach has considerable potential. 1

    Analysis of structured texture using the multiresolution Fourier transform

    Get PDF
    A multiresolution approach to the analysis of structural texture is presented. The multiresolution Fourier transform (MFT) is utilized as a framework to derive a robust algorithm which estimates textural features over a range of spatial scales based on local frequency domain properties. A pair of centroids of local spectra are extracted to represent the dominant frequencies of underlying spatial regions which are equivariant to rotation and scaling. Based on these centroids, the relationship between two different local spectra is characterized by an affine transformation. Assessment of the estimated affine transform is made by normalized correlation, which also provides local phase shift information. Analysis and synthesis of both artificial and natural images demonstrate the capability of the algorithm
    corecore