176 research outputs found
Advances In Internal Model Principle Control Theory
In this thesis, two advanced implementations of the internal model principle (IMP) are presented. The first is the identification of exponentially damped sinusoidal (EDS) signals with unknown parameters which are widely used to model audio signals. This application is developed in discrete time as a signal processing problem. An IMP based adaptive algorithm is developed for estimating two EDS parameters, the damping factor and frequency. The stability and convergence of this adaptive algorithm is analyzed based on a discrete time two time scale averaging theory. Simulation results demonstrate the identification performance of the proposed algorithm and verify its stability.
The second advanced implementation of the IMP control theory is the rejection of disturbances consisting of both predictable and unpredictable components. An IMP controller is used for rejecting predictable disturbances. But the phase lag introduced by the IMP controller limits the rejection capability of the wideband disturbance controller, which is used for attenuating unpredictable disturbance, such as white noise. A combination of open and closed-loop control strategy is presented. In the closed-loop mode, both controllers are active. Once the tracking error is insignificant, the input to the IMP controller is disconnected while its output control action is maintained. In the open loop mode, the wideband disturbance controller is made more aggressive for attenuating white noise. Depending on the level of the tracking error, the input to the IMP controller is connected intermittently. Thus the system switches between open and closed-loop modes.
A state feedback controller is designed as the wideband disturbance controller in this application. Two types of predictable disturbances are considered, constant and periodic. For a constant disturbance, an integral controller, the simplest IMP controller, is used. For a periodic disturbance with unknown frequencies, adaptive IMP controllers are used to estimate the frequencies before cancelling the disturbances. An extended multiple Lyapunov functions (MLF) theorem is developed for the stability analysis of this intermittent control strategy. Simulation results justify the optimal rejection performance of this switched control by comparing with two other traditional controllers
High-resolution sinusoidal analysis for resolving harmonic collisions in music audio signal processing
Many music signals can largely be considered an additive combination of
multiple sources, such as musical instruments or voice. If the musical sources
are pitched instruments, the spectra they produce are predominantly harmonic,
and are thus well suited to an additive sinusoidal model. However,
due to resolution limits inherent in time-frequency analyses, when the harmonics
of multiple sources occupy equivalent time-frequency regions, their
individual properties are additively combined in the time-frequency representation
of the mixed signal. Any such time-frequency point in a mixture
where multiple harmonics overlap produces a single observation from which
the contributions owed to each of the individual harmonics cannot be trivially
deduced. These overlaps are referred to as overlapping partials or harmonic
collisions. If one wishes to infer some information about individual sources in
music mixtures, the information carried in regions where collided harmonics
exist becomes unreliable due to interference from other sources. This interference
has ramifications in a variety of music signal processing applications
such as multiple fundamental frequency estimation, source separation, and
instrumentation identification.
This thesis addresses harmonic collisions in music signal processing applications.
As a solution to the harmonic collision problem, a class of signal
subspace-based high-resolution sinusoidal parameter estimators is explored.
Specifically, the direct matrix pencil method, or equivalently, the Estimation
of Signal Parameters via Rotational Invariance Techniques (ESPRIT)
method, is used with the goal of producing estimates of the salient parameters
of individual harmonics that occupy equivalent time-frequency regions. This
estimation method is adapted here to be applicable to time-varying signals
such as musical audio. While high-resolution methods have been previously
explored in the context of music signal processing, previous work has not
addressed whether or not such methods truly produce high-resolution sinusoidal parameter estimates in real-world music audio signals. Therefore, this
thesis answers the question of whether high-resolution sinusoidal parameter
estimators are really high-resolution for real music signals.
This work directly explores the capabilities of this form of sinusoidal parameter
estimation to resolve collided harmonics. The capabilities of this
analysis method are also explored in the context of music signal processing
applications. Potential benefits of high-resolution sinusoidal analysis are
examined in experiments involving multiple fundamental frequency estimation
and audio source separation. This work shows that there are indeed
benefits to high-resolution sinusoidal analysis in music signal processing applications,
especially when compared to methods that produce sinusoidal
parameter estimates based on more traditional time-frequency representations.
The benefits of this form of sinusoidal analysis are made most evident
in multiple fundamental frequency estimation applications, where substantial
performance gains are seen. High-resolution analysis in the context of
computational auditory scene analysis-based source separation shows similar
performance to existing comparable methods
Modeling and frequency tracking of marine mammal whistle calls
Submitted in partial fulfillment of the requirements for the degree of Master of Science at the Massachusetts Institute of Technology and the Woods Hole Oceanographic Institution February 2009Marine mammal whistle calls present an attractive medium for covert underwater
communications. High quality models of the whistle calls are needed in order to synthesize
natural-sounding whistles with embedded information. Since the whistle calls
are composed of frequency modulated harmonic tones, they are best modeled as a
weighted superposition of harmonically related sinusoids. Previous research with bottlenose
dolphin whistle calls has produced synthetic whistles that sound too “clean”
for use in a covert communications system. Due to the sensitivity of the human auditory
system, watermarking schemes that slightly modify the fundamental frequency
contour have good potential for producing natural-sounding whistles embedded with
retrievable watermarks. Structured total least squares is used with linear prediction
analysis to track the time-varying fundamental frequency and harmonic amplitude
contours throughout a whistle call. Simulation and experimental results demonstrate
the capability to accurately model bottlenose dolphin whistle calls and retrieve embedded
information from watermarked synthetic whistle calls. Different fundamental
frequency watermarking schemes are proposed based on their ability to produce natural
sounding synthetic whistles and yield suitable watermark detection and retrieval
Measuring gravitational waves from binary black hole coalescences: II. the waves' information and its extraction, with and without templates
We discuss the extraction of information from detected binary black hole
(BBH) coalescence gravitational waves, focusing on the merger phase that occurs
after the gradual inspiral and before the ringdown. Our results are: (1) If
numerical relativity simulations have not produced template merger waveforms
before BBH detections by LIGO/VIRGO, one can band-pass filter the merger waves.
For BBHs smaller than about 40 solar masses detected via their inspiral waves,
the band pass filtering signal to noise ratio indicates that the merger waves
should typically be just barely visible in the noise for initial and advanced
LIGO interferometers. (2) We derive an optimized (maximum likelihood) method
for extracting a best-fit merger waveform from the noisy detector output; one
"perpendicularly projects" this output onto a function space (specified using
wavelets) that incorporates our prior knowledge of the waveforms. An extension
of the method allows one to extract the BBH's two independent waveforms from
outputs of several interferometers. (3) If numerical relativists produce codes
for generating merger templates but running the codes is too expensive to allow
an extensive survey of the merger parameter space, then a coarse survey of this
parameter space, to determine the ranges of the several key parameters and to
explore several qualitative issues which we describe, would be useful for data
analysis purposes. (4) A complete set of templates could be used to test the
nonlinear dynamics of general relativity and to measure some of the binary
parameters. We estimate the number of bits of information obtainable from the
merger waves (about 10 to 60 for LIGO/VIRGO, up to 200 for LISA), estimate the
information loss due to template numerical errors or sparseness in the template
grid, and infer approximate requirements on template accuracy and spacing.Comment: 33 pages, Rextex 3.1 macros, no figures, submitted to Phys Rev
Towards Real-Time Non-Stationary Sinusoidal Modelling of Kick and Bass Sounds for Audio Analysis and Modification
Sinusoidal Modelling is a powerful and flexible parametric method for analysing and processing audio signals. These signals have an underlying structure that modern spectral models aim to exploit by separating the signal into sinusoidal, transient, and noise components. Each of these can then be modelled in a manner most appropriate to that component's inherent structure. The accuracy of the estimated parameters is directly related to the quality of the model's representation of the signal, and the assumptions made about its underlying structure. For sinusoidal models, these assumptions generally affect the non-stationary estimates related to amplitude and frequency modulations, and the type of amplitude change curve. This is especially true when using a single analysis frame in a non-overlapping framework, where biased estimates can result in discontinuities at frame boundaries. It is therefore desirable for such a model to distinguish between the shape of different amplitude changes and adapt the estimation of this accordingly.
Intra-frame amplitude change can be interpreted as a change in the windowing function applied to a stationary sinusoid, which can be estimated from the derivative of the phase with respect to frequency at magnitude peaks in the DFT spectrum. A method for measuring monotonic linear amplitude change from single-frame estimates using the first-order derivative of the phase with respect to frequency (approximated by the first-order difference) is presented, along with a method of distinguishing between linear and exponential amplitude change. An adaption of the popular matching pursuit algorithm for refining model parameters in a segmented framework has been investigated using a dictionary comprised of sinusoids with parameters varying slightly from model estimates, based on Modelled Pursuit (MoP).
Modelling of the residual signal using a segmented undecimated Wavelet Transform (segUWT) is presented. A generalisation for both the forward and inverse transforms, for delay compensations and overlap extensions for different lengths of Wavelets and the number of decomposition levels in an Overlap Save (OLS) implementation for dealing with convolution block-based artefacts is presented. This shift invariant implementation of the DWT is a popular tool for de-noising and shows promising results for the separation of transients from noise
Statistical models for natural sounds
It is important to understand the rich structure of natural sounds in order to solve important
tasks, like automatic speech recognition, and to understand auditory processing
in the brain. This thesis takes a step in this direction by characterising the statistics of
simple natural sounds. We focus on the statistics because perception often appears to
depend on them, rather than on the raw waveform. For example the perception of auditory
textures, like running water, wind, fire and rain, depends on summary-statistics,
like the rate of falling rain droplets, rather than on the exact details of the physical
source.
In order to analyse the statistics of sounds accurately it is necessary to improve a
number of traditional signal processing methods, including those for amplitude demodulation,
time-frequency analysis, and sub-band demodulation. These estimation tasks
are ill-posed and therefore it is natural to treat them as Bayesian inference problems.
The new probabilistic versions of these methods have several advantages. For example,
they perform more accurately on natural signals and are more robust to noise,
they can also fill-in missing sections of data, and provide error-bars. Furthermore,
free-parameters can be learned from the signal. Using these new algorithms we demonstrate
that the energy, sparsity, modulation depth and modulation time-scale in each
sub-band of a signal are critical statistics, together with the dependencies between the
sub-band modulators. In order to validate this claim, a model containing co-modulated
coloured noise carriers is shown to be capable of generating a range of realistic sounding
auditory textures.
Finally, we explored the connection between the statistics of natural sounds and perception.
We demonstrate that inference in the model for auditory textures qualitatively
replicates the primitive grouping rules that listeners use to understand simple acoustic
scenes. This suggests that the auditory system is optimised for the statistics of natural
sounds
Target Recognition Using Late-Time Returns from Ultra-Wideband, Short-Pulse Radar
The goal of this research is to develop algorithms that recognize targets by exploiting properties in the late-time resonance induced by ultra-wide band radar signals. A new variant of the Matrix Pencil Method algorithm is developed that identifies complex resonant frequencies present in the scattered signal. Kalman filters are developed to represent the dynamics of the signals scattered from several target types. The Multiple Model Adaptive Estimation algorithm uses the Kalman filters to recognize targets. The target recognition algorithm is shown to be successful in the presence of noise. The performance of the new algorithms is compared to that of previously published algorithms
Recommended from our members
Bayesian methods in music modelling
This thesis presents several hierarchical generative Bayesian models of musical signals designed to improve the accuracy of existing multiple pitch detection systems and other musical signal processing applications whilst remaining feasible for real-time computation. At the lowest level the signal is modelled as a set of overlapping sinusoidal basis functions. The parameters of these basis functions are built into a prior framework based on principles known from musical theory and the physics of musical instruments. The model of a musical note optionally includes phenomena such as frequency and amplitude modulations, damping, volume, timbre and inharmonicity. The occurrence of note onsets in a performance of a piece of music is controlled by an underlying tempo process and the alignment of the timings to the underlying score of the music.
A variety of applications are presented for these models under differing inference constraints. Where full Bayesian inference is possible, reversible-jump Markov Chain Monte Carlo is employed to estimate the number of notes and partial frequency components in each frame of music. We also use approximate techniques such as model selection criteria and variational Bayes methods for inference in situations where computation time is limited or the amount of data to be processed is large. For the higher level score parameters, greedy search and conditional modes algorithms are found to be sufficiently accurate.
We emphasize the links between the models and inference algorithms developed in this thesis with that in existing and parallel work, and demonstrate the effects of making modifications to these models both theoretically and by means of experimental results
- …