70 research outputs found

    Modulation Domain Image Processing

    Get PDF
    The classical Fourier transform is the cornerstone of traditional linearsignal and image processing. The discrete Fourier transform (DFT) and thefast Fourier transform (FFT) in particular led toprofound changes during the later decades of the last century in howwe analyze and process 1D and multi-dimensional signals.The Fourier transform represents a signal as an infinite superpositionof stationary sinusoids each of which has constant amplitude and constantfrequency. However, many important practical signals such as radar returnsand seismic waves are inherently nonstationary. Hence, more complextechniques such as the windowed Fourier transform and the wavelet transformwere invented to better capture nonstationary properties of these signals.In this dissertation, I studied an alternative nonstationary representationfor images, the 2D AM-FM model. In contrast to thestationary nature of the classical Fourier representation, the AM-FM modelrepresents an image as a finite sum of smoothly varying amplitudesand smoothly varying frequencies. The model has been applied successfullyin image processing applications such as image segmentation, texture analysis,and target tracking. However, these applications are limitedto \emph{analysis}, meaning that the computed AM and FM functionsare used as features for signal processing tasks such as classificationand recognition. For synthesis applications, few attempts have been madeto synthesize the original image from the AM and FM components. Nevertheless,these attempts were unstable and the synthesized results contained artifacts.The main reason is that the perfect reconstruction AM-FM image model waseither unavailable or unstable. Here, I constructed the first functionalperfect reconstruction AM-FM image transform that paves the way for AM-FMimage synthesis applications. The transform enables intuitive nonlinearimage filter designs in the modulation domain. I showed that these filtersprovide important advantages relative to traditional linear translation invariant filters.This dissertation addresses image processing operations in the nonlinearnonstationary modulation domain. In the modulation domain, an image is modeledas a sum of nonstationary amplitude modulation (AM) functions andnonstationary frequency modulation (FM) functions. I developeda theoretical framework for high fidelity signal and image modeling in themodulation domain, constructed an invertible multi-dimensional AM-FMtransform (xAMFM), and investigated practical signal processing applicationsof the transform. After developing the xAMFM, I investigated new imageprocessing operations that apply directly to the transformed AM and FMfunctions in the modulation domain. In addition, I introduced twoclasses of modulation domain image filters. These filters produceperceptually motivated signal processing results that are difficult orimpossible to obtain with traditional linear processing or spatial domainnonlinear approaches. Finally, I proposed three extensions of the AM-FMtransform and applied them in image analysis applications.The main original contributions of this dissertation include the following.- I proposed a perfect reconstruction FM algorithm. I used aleast-squares approach to recover the phase signal from itsgradient. In order to allow perfect reconstruction of the phase function, Ienforced an initial condition on the reconstructed phase. The perfectreconstruction FM algorithm plays a critical role in theoverall AM-FM transform.- I constructed a perfect reconstruction multi-dimensional filterbankby modifying the classical steerable pyramid. This modified filterbankensures a true multi-scale multi-orientation signal decomposition. Such adecomposition is required for a perceptually meaningful AM-FM imagerepresentation.- I rotated the partial Hilbert transform to alleviate ripplingartifacts in the computed AM and FM functions. This adjustment results inartifact free filtering results in the modulation domain.- I proposed the modulation domain image filtering framework. Iconstructed two classes of modulation domain filters. I showed that themodulation domain filters outperform traditional linear shiftinvariant (LSI) filters qualitatively and quantitatively in applicationssuch as selective orientation filtering, selective frequency filtering,and fundamental geometric image transformations.- I provided extensions of the AM-FM transform for image decompositionproblems. I illustrated that the AM-FM approach can successfullydecompose an image into coherent components such as textureand structural components.- I investigated the relationship between the two prominentAM-FM computational models, namely the partial Hilbert transformapproach (pHT) and the monogenic signal. The established relationshiphelps unify these two AM-FM algorithms.This dissertation lays a theoretical foundation for future nonlinearmodulation domain image processing applications. For the first time, onecan apply modulation domain filters to images to obtain predictableresults. The design of modulation domain filters is intuitive and simple,yet these filters produce superior results compared to those of pixeldomain LSI filters. Moreover, this dissertation opens up other research problems.For instance, classical image applications such as image segmentation andedge detection can be re-formulated in the modulation domain setting.Modulation domain based perceptual image and video quality assessment andimage compression are important future application areas for the fundamentalrepresentation results developed in this dissertation

    The monogenic synchrosqueezed wavelet transform: a tool for the decomposition/demodulation of AM–FM images

    Get PDF
    The synchrosqueezing method aims at decomposing 1D functions into superpositions of a small number of “Intrinsic Modes”, supposed to be well separated both in time and frequency. Based on the unidimensional wavelet transform and its reconstruction properties, the synchrosqueezing transform provides a powerful representation of multicomponent signals in the time–frequency plane, together with a reconstruction of each mode. In this paper, a bidimensional version of the synchrosqueezing transform is defined, by considering a well-adapted extension of the concept of analytic signal to images: the monogenic signal. We introduce the concept of “Intrinsic Monogenic Mode”, that is the bidimensional counterpart of the notion of Intrinsic Mode. We also investigate the properties of its associated Monogenic Wavelet Decomposition. This leads to a natural bivariate extension of the Synchrosqueezed Wavelet Transform, for decomposing and processing multicomponent images. Numerical tests validate the effectiveness of the method on synthetic and real images

    Vector extension of monogenic wavelets for geometric representation of color images

    No full text
    14 pagesInternational audienceMonogenic wavelets offer a geometric representation of grayscale images through an AM/FM model allowing invariance of coefficients to translations and rotations. The underlying concept of local phase includes a fine contour analysis into a coherent unified framework. Starting from a link with structure tensors, we propose a non-trivial extension of the monogenic framework to vector-valued signals to carry out a non marginal color monogenic wavelet transform. We also give a practical study of this new wavelet transform in the contexts of sparse representations and invariant analysis, which helps to understand the physical interpretation of coefficients and validates the interest of our theoretical construction

    On the Analysis and Decomposition of Intrinsically One-Dimensional Signals and their Superpositions

    Get PDF
    Computer and machine vision tasks can roughly be divided into a hierarchy of processing steps applied to input signals captured by a measuring device. In the case of image signals, the first stage in this hierarchy is also referred to as low-level vision or low-level image processing. The field of low-level image processing includes the mathematical description of signals in terms of certain local signal models. The choice of the signal model is often task dependent. A common task is the extraction of features from the signal. Since signals are subject to transformations, for example camera movements in the case of image signals, the features are supposed to fulfill the properties of invariance or equivariance with respect to these transformations. The chosen signal model should reflect these properties in terms of its parameters. This thesis contributes to the field of low-level vision. Local signal structures are represented by (sinusoidal) intrinsically one-dimensional signals and their superpositions. Each intrinsically one-dimensional signal consists of certain parameters such as orientation, amplitude, frequency and phase. If the affine group acts on these signals, the transformations induce a corresponding action in the parameter space of the signal model. Hence, it is reasonable, to estimate the model parameters in order to describe the invariant and equivariant features. The first and main contribution studies superpositions of intrinsically one-dimensional signals in the plane. The parameters of the signal are supposed to be extracted from the responses of linear shift invariant operators: the generalized Hilbert transform (Riesz transform) and its higher-order versions and the partial derivative operators. While well known signal representations, such as the monogenic signal, allow to obtain the local features amplitude, phase and orientation for a single intrinsically one-dimensional signal, there exists no general method to decompose superpositions of such signals into their corresponding features. A novel method for the decomposition of an arbitrary number of sinusoidal intrinsically one-dimensional signals in the plane is proposed. The responses of the higher-order generalized Hilbert transforms in the plane are interpreted as symmetric tensors, which allow to restate the decomposition problem as a symmetric tensor decomposition. Algorithms, examples and applications for the novel decomposition are provided. The second contribution studies curved intrinsically one-dimensional signals in the plane. This signal model introduces a new parameter, the curvature, and allows the representation of curved signal structures. Using the inverse stereographic projection to the sphere, these curved signals are locally identified with intrinsically one-dimensional signals in the three-dimensional Euclidean space and analyzed in terms of the generalized Hilbert transform and partial derivatives therein. The third contribution studies the generalized Hilbert transform in a non-Euclidean space, the two-sphere. The mathematical framework of Clifford analysis proposes a further generalization of the generalized Hilbert transform to the two-sphere in terms of the corresponding Cauchy kernel. Nonetheless, this transform lacks an intuitive interpretation in the frequency domain. A decomposition of the Cauchy kernel in terms of its spherical harmonics is provided. Its coefficients not only provide insights to the generalized Hilbert transform on the sphere, but also allow for fast implementations in terms of analogues of the convolution theorem on the sphere

    AM-FM methods for image and video processing

    Get PDF
    This dissertation is focused on the development of robust and efficient Amplitude-Modulation Frequency-Modulation (AM-FM) demodulation methods for image and video processing (there is currently a patent pending that covers the AM-FM methods and applications described in this dissertation). The motivation for this research lies in the wide number of image and video processing applications that can significantly benefit from this research. A number of potential applications are developed in the dissertation. First, a new, robust and efficient formulation for the instantaneous frequency (IF) estimation: a variable spacing, local quadratic phase method (VS-LQP) is presented. VS-LQP produces much more accurate results than current AM-FM methods. At significant noise levels (SNR \u3c 30dB), for single component images, the VS-LQP method produces better IF estimation results than methods using a multi-scale filterbank. At low noise levels (SNR \u3e 50dB), VS-LQP performs better when used in combination with a multi-scale filterbank. In all cases, VS-LQP outperforms the Quasi-Eigen Approximation algorithm by significant amounts (up to 20dB). New least squares reconstructions using AM-FM components from the input signal (image or video) are also presented. Three different reconstruction approaches are developed: (i) using AM-FM harmonics, (ii) using AM-FM components extracted from different scales and (iii) using AM-FM harmonics with the output of a low-pass filter. The image reconstruction methods provide perceptually lossless results with image quality index values bigger than 0.7 on average. The video reconstructions produced image quality index values, frame by frame, up to more than 0.7 using AM-FM components extracted from different scales. An application of the AM-FM method to retinal image analysis is also shown. This approach uses the instantaneous frequency magnitude and the instantaneous amplitude (IA) information to provide image features. The new AM-FM approach produced ROC area of 0.984 in classifying Risk 0 versus Risk 1, 0.95 in classifying Risk 0 versus Risk 2, 0.973 in classifying Risk 0 versus Risk 3 and 0.95 in classifying Risk 0 versus all images with any sign of Diabetic Retinopathy. An extension of the 2D AM-FM demodulation methods to three dimensions is also presented. New AM-FM methods for motion estimation are developed. The new motion estimation method provides three motion estimation equations per channel filter (AM, IF motion equations and a continuity equation). Applications of the method in motion tracking, trajectory estimation and for continuous-scale video searching are demonstrated. For each application, we discuss the advantages of the AM-FM methods over current approaches

    A Novel Representation for Two-dimensional Image Structures

    Get PDF
    This paper presents a novel approach towards two-dimensional (2D) image structures modeling. To obtain more degrees of freedom, a 2D image signal is embedded into a certain geometric algebra. Coupling methods of differential geometry, tensor algebra, monogenic signal and quadrature filter, we can design a general model for 2D structures as the monogenic extension of a curvature tensor. Based on it, a local representation for the intrinsically two-dimensional (i2D) structure is derived as the monogenic curvature signal. From it, independent features of local amplitude, phase and orientation are simultaneously extracted. Besides, a monogenic curvature scale-space can be built by applying a Poisson kernel to the monogenic curvature signal. Compared with the other related work, the remarkable advantage of our approach lies in the rotationally invariant phase evaluation of 2D structures in a multi-scale framework, which delivers access to phase-based processing in many computer vision tasks

    Multirate Frequency Transformations: Wideband AM-FM Demodulation with Applications to Signal Processing and Communications

    Get PDF
    The AM-FM (amplitude & frequency modulation) signal model finds numerous applications in image processing, communications, and speech processing. The traditional approaches towards demodulation of signals in this category are the analytic signal approach, frequency tracking, or the energy operator approach. These approaches however, assume that the amplitude and frequency components are slowly time-varying, e.g., narrowband and incur significant demodulation error in the wideband scenarios. In this thesis, we extend a two-stage approach towards wideband AM-FM demodulation that combines multirate frequency transformations (MFT) enacted through a combination of multirate systems with traditional demodulation techniques, e.g., the Teager-Kasiser energy operator demodulation (ESA) approach to large wideband to narrowband conversion factors. The MFT module comprises of multirate interpolation and heterodyning and converts the wideband AM-FM signal into a narrowband signal, while the demodulation module such as ESA demodulates the narrowband signal into constituent amplitude and frequency components that are then transformed back to yield estimates for the wideband signal. This MFT-ESA approach is then applied to the various problems of: (a) wideband image demodulation and fingerprint demodulation, where multidimensional energy separation is employed, (b) wideband first-formant demodulation in vowels, and (c) wideband CPM demodulation with partial response signaling, to demonstrate its validity in both monocomponent and multicomponent scenarios as an effective multicomponent AM-FM signal demodulation and analysis technique for image processing, speech processing, and communications based applications

    Algebraic Representation and Geometric Interpretation of Hilbert Transformed Signals

    Get PDF
    This thesis covers a fundamental problem of local phase based signal processing: the isotropic generalization of the classical one dimensional analytic signal (D. Gabor) to higher dimensional signal domains. The classical analytic signal extends a real valued one dimensional signal to a complex valued signal by means of the classical 1D Hilbert transform. This signal extension enables the complete analysis of local phase and local amplitude information for each frequency component in the sense of Fourier analysis. In case of two dimensional signal domains, e.g. for images, additional geometric information is required to characterize higher dimensional signals locally. The local geometric information is called orientation, which consists of the main orientation and apex angle for two superimposed one dimensional signals. The problem of two dimensional signal analysis is the fact that in general those signals could consist of an unlimited number of superimposed one dimensional signals with individual orientations. Local phase, amplitude and additional orientation information can be extracted by the monogenic signal (M. Felsberg and G. Sommer) which is always restricted to the subclass of intrinsically one dimensional signals, i.e. the class of signals which only make use of one degree of freedom within the embedding signal domain. In case of 2D images the monogenic signal enables the rotationally invariant analysis of lines and edges. In contrast to the 1D analytic signal the monogenic signal extends all real valued signals of dimension n to a (n+1) - dimensional vector valued monogenic signal by means of the generalized first order Hilbert transform, which is also known as the Riesz transform. The analytic signal and the monogenic signal show that a direct relation between analytical signals and their algebraic representation exists. This fact has motivated the work and the results of this thesis, namely the extension of the 2D monogenic signal to more general 2D analytic signals, their algebraic representation, and their most geometric embedding. In case of more general 2D signals the geometric algebra will be shown to be a natural representation, and the conformal space as the geometric embedding for the signal interpretation. In this thesis we present 2D analytic signals as generalizations of the 2D monogenic signal which now extend the original 2D signal to a multi-vector valued signal in homogeneous conformal space by means of higher order Hilbert transforms, and by means of a so called hybrid matrix geometric algebra representation. The 2D analytic signal and the more general multi-vector signal will be interpreted in conformal space which delivers a descriptive geometric interpretation and algebraic embedding of signals. In case of 2D image signals the 2D analytic signal and the multi-vector signal enable the rotationally invariant analysis of lines, edges, corners and junctions in one unified framework. Furthermore, additional local curvature can be determined by first order generalized Hilbert transforms without the need of derivatives. This so called conformal monogenic signal can be defined for any signal domain
    • …
    corecore