979 research outputs found
Audio Modeling based on Delayed Sinusoids
International audienceIn this work, we present an evolution of the DDS (Damped & Delayed Sinusoidal) model introduced within the framework of the general signal modeling. This model is named the Partial Damped & Delayed Sinusoidal (PDDS) model and takes into account a single time delay parameter for a set (sum) of damped sinusoids. This modi- ÂŻcation is more consistent with the transient audio modeling problem. We show the validity of this approach by compari- son with the well-known EDS (Exponentially Damped Sinu- soids) approach. Finally, the performances of three model high-resolution parameter estimation algorithms are com- pared on synthetic fast time-varying signals and on two typ- ical audio transients
Sound Source Separation
This is the author's accepted pre-print of the article, first published as G. Evangelista, S. Marchand, M. D. Plumbley and E. Vincent. Sound source separation. In U. Zölzer (ed.), DAFX: Digital Audio Effects, 2nd edition, Chapter 14, pp. 551-588. John Wiley & Sons, March 2011. ISBN 9781119991298. DOI: 10.1002/9781119991298.ch14file: Proof:e\EvangelistaMarchandPlumbleyV11-sound.pdf:PDF owner: markp timestamp: 2011.04.26file: Proof:e\EvangelistaMarchandPlumbleyV11-sound.pdf:PDF owner: markp timestamp: 2011.04.2
The Sound Manifesto
Computing practice today depends on visual output to drive almost all user
interaction. Other senses, such as audition, may be totally neglected, or used
tangentially, or used in highly restricted specialized ways. We have excellent
audio rendering through D-A conversion, but we lack rich general facilities for
modeling and manipulating sound comparable in quality and flexibility to
graphics. We need co-ordinated research in several disciplines to improve the
use of sound as an interactive information channel.
Incremental and separate improvements in synthesis, analysis, speech
processing, audiology, acoustics, music, etc. will not alone produce the
radical progress that we seek in sonic practice. We also need to create a new
central topic of study in digital audio research. The new topic will assimilate
the contributions of different disciplines on a common foundation. The key
central concept that we lack is sound as a general-purpose information channel.
We must investigate the structure of this information channel, which is driven
by the co-operative development of auditory perception and physical sound
production. Particular audible encodings, such as speech and music, illuminate
sonic information by example, but they are no more sufficient for a
characterization than typography is sufficient for a characterization of visual
information.Comment: To appear in the conference on Critical Technologies for the Future
of Computing, part of SPIE's International Symposium on Optical Science and
Technology, 30 July to 4 August 2000, San Diego, C
Damped and delayed sinuosidal model for transient modeling
International audienceIn this work, we present the Damped and De- layed Sinusoidal (DDS) model, a generalization of the sinu- soidal model. This model takes into account an angular fre- quency, a damping factor, a phase, an amplitude and a time- delay parameter for each component. Two algorithms are introduced for the DDS parameter estimation using a sub- band processing approach. Finally, we derive the Cramer- Rao Bound (CRB) expression for the DDS model and a simulation-based performance analysis in the context of a noisy fast time-varying synthetic signal and in the audio transient signal modeling context
Asymptotic Performance for Delayed Exponential Process
International audienceThe damped and delayed sinusoidal (DDS) model can be defined as the sum of sinusoids whose waveforms can be damped and delayed. This model is suitable for compactly modeling short time events. This property is closely related to its ability to reduce the time-support of each sinusoidal component. In this correspondence, we derive exact and approximate asymptotic Cramér–Rao bounds (CRBs) for the DDS model. This analysis shows that this model has better, or at least similar, theoretical performance than the well-known exponentially damped sinusoidal (EDS) model. In particular, the performance in the DDS case is significantly improved compared to that of the EDS for closely spaced sinusoids thanks to the nonzero time delays. Consequently, we can exploit the advantageous properties of the DDS model and, in the same time, we can keep high theoretical model parameter estimation accuracy
Sparse Modeling of Grouped Line Spectra
This licentiate thesis focuses on clustered parametric models for estimation of line spectra, when the spectral content of a signal source is assumed to exhibit some form of grouping. Different from previous parametric approaches, which generally require explicit knowledge of the model orders, this thesis exploits sparse modeling, where the orders are implicitly chosen. For line spectra, the non-linear parametric model is approximated by a linear system, containing an overcomplete basis of candidate frequencies, called a dictionary, and a large set of linear response variables that selects and weights the components in the dictionary. Frequency estimates are obtained by solving a convex optimization program, where the sum of squared residuals is minimized. To discourage overfitting and to infer certain structure in the solution, different convex penalty functions are introduced into the optimization. The cost trade-off between fit and penalty is set by some user parameters, as to approximate the true number of spectral lines in the signal, which implies that the response variable will be sparse, i.e., have few non-zero elements. Thus, instead of explicit model orders, the orders are implicitly set by this trade-off. For grouped variables, the dictionary is customized, and appropriate convex penalties selected, so that the solution becomes group sparse, i.e., has few groups with non-zero variables. In an array of sensors, the specific time-delays and attenuations will depend on the source and sensor positions. By modeling this, one may estimate the location of a source. In this thesis, a novel joint location and grouped frequency estimator is proposed, which exploits sparse modeling for both spectral and spatial estimates, showing robustness against sources with overlapping frequency content. For audio signals, this thesis uses two different features for clustering. Pitch is a perceptual property of sound that may be described by the harmonic model, i.e., by a group of spectral lines at integer multiples of a fundamental frequency, which we estimate by exploiting a novel adaptive total variation penalty. The other feature, chroma, is a concept in musical theory, collecting pitches at powers of 2 from each other into groups. Using a chroma dictionary, together with appropriate group sparse penalties, we propose an automatic transcription of the chroma content of a signal
Multipath Parameter Estimation from OFDM Signals in Mobile Channels
We study multipath parameter estimation from orthogonal frequency division
multiplex signals transmitted over doubly dispersive mobile radio channels. We
are interested in cases where the transmission is long enough to suffer time
selectivity, but short enough such that the time variation can be accurately
modeled as depending only on per-tap linear phase variations due to Doppler
effects. We therefore concentrate on the estimation of the complex gain, delay
and Doppler offset of each tap of the multipath channel impulse response. We
show that the frequency domain channel coefficients for an entire packet can be
expressed as the superimposition of two-dimensional complex sinusoids. The
maximum likelihood estimate requires solution of a multidimensional non-linear
least squares problem, which is computationally infeasible in practice. We
therefore propose a low complexity suboptimal solution based on iterative
successive and parallel cancellation. First, initial delay/Doppler estimates
are obtained via successive cancellation. These estimates are then refined
using an iterative parallel cancellation procedure. We demonstrate via Monte
Carlo simulations that the root mean squared error statistics of our estimator
are very close to the Cramer-Rao lower bound of a single two-dimensional
sinusoid in Gaussian noise.Comment: Submitted to IEEE Transactions on Wireless Communications (26 pages,
9 figures and 3 tables
- …