Search CORE

979 research outputs found

Audio Modeling based on Delayed Sinusoids

Author: Abed-Meraim Karim
Boyer Remy
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2004
Field of study

International audienceIn this work, we present an evolution of the DDS (Damped & Delayed Sinusoidal) model introduced within the framework of the general signal modeling. This model is named the Partial Damped & Delayed Sinusoidal (PDDS) model and takes into account a single time delay parameter for a set (sum) of damped sinusoids. This modi- ¯cation is more consistent with the transient audio modeling problem. We show the validity of this approach by compari- son with the well-known EDS (Exponentially Damped Sinu- soids) approach. Finally, the performances of three model high-resolution parameter estimation algorithms are com- pared on synthetic fast time-varying signals and on two typ- ical audio transients

HAL-CentraleSupelec

HAL-Rennes 1

Sound Source Separation

Author: Evangelista G
Marchand S
Plumbley MD
Vincent E
Publication venue: 'Wiley'
Publication date: 01/01/2011
Field of study

This is the author's accepted pre-print of the article, first published as G. Evangelista, S. Marchand, M. D. Plumbley and E. Vincent. Sound source separation. In U. Zölzer (ed.), DAFX: Digital Audio Effects, 2nd edition, Chapter 14, pp. 551-588. John Wiley & Sons, March 2011. ISBN 9781119991298. DOI: 10.1002/9781119991298.ch14file: Proof:e\EvangelistaMarchandPlumbleyV11-sound.pdf:PDF owner: markp timestamp: 2011.04.26file: Proof:e\EvangelistaMarchandPlumbleyV11-sound.pdf:PDF owner: markp timestamp: 2011.04.2

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

Queen Mary Research Online

Surrey Research Insight

HAL-Rennes 1

The Sound Manifesto

Author: Bisnovatyi Ilia
O'Donnell Michael J.
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 08/07/2000
Field of study

Computing practice today depends on visual output to drive almost all user interaction. Other senses, such as audition, may be totally neglected, or used tangentially, or used in highly restricted specialized ways. We have excellent audio rendering through D-A conversion, but we lack rich general facilities for modeling and manipulating sound comparable in quality and flexibility to graphics. We need co-ordinated research in several disciplines to improve the use of sound as an interactive information channel. Incremental and separate improvements in synthesis, analysis, speech processing, audiology, acoustics, music, etc. will not alone produce the radical progress that we seek in sonic practice. We also need to create a new central topic of study in digital audio research. The new topic will assimilate the contributions of different disciplines on a common foundation. The key central concept that we lack is sound as a general-purpose information channel. We must investigate the structure of this information channel, which is driven by the co-operative development of auditory perception and physical sound production. Particular audible encodings, such as speech and music, illuminate sonic information by example, but they are no more sufficient for a characterization than typography is sufficient for a characterization of visual information.Comment: To appear in the conference on Critical Technologies for the Future of Computing, part of SPIE's International Symposium on Optical Science and Technology, 30 July to 4 August 2000, San Diego, C

arXiv.org e-Print Archive

Crossref

Damped and delayed sinuosidal model for transient modeling

Author: Abed-Meraim Karim
Boyer Remy
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2005
Field of study

International audienceIn this work, we present the Damped and De- layed Sinusoidal (DDS) model, a generalization of the sinu- soidal model. This model takes into account an angular fre- quency, a damping factor, a phase, an amplitude and a time- delay parameter for each component. Two algorithms are introduced for the DDS parameter estimation using a sub- band processing approach. Finally, we derive the Cramer- Rao Bound (CRB) expression for the DDS model and a simulation-based performance analysis in the context of a noisy fast time-varying synthetic signal and in the audio transient signal modeling context

HAL-CentraleSupelec

HAL-Rennes 1

Asymptotic Performance for Delayed Exponential Process

Author: Abed-Meraim Karim
Boyer Remy
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2007
Field of study

International audienceThe damped and delayed sinusoidal (DDS) model can be defined as the sum of sinusoids whose waveforms can be damped and delayed. This model is suitable for compactly modeling short time events. This property is closely related to its ability to reduce the time-support of each sinusoidal component. In this correspondence, we derive exact and approximate asymptotic Cramér–Rao bounds (CRBs) for the DDS model. This analysis shows that this model has better, or at least similar, theoretical performance than the well-known exponentially damped sinusoidal (EDS) model. In particular, the performance in the DDS case is significantly improved compared to that of the EDS for closely spaced sinusoids thanks to the nonzero time delays. Consequently, we can exploit the advantageous properties of the DDS model and, in the same time, we can keep high theoretical model parameter estimation accuracy

HAL-CentraleSupelec

HAL-Rennes 1

Rate-Distortion Efficient Amplitude Modulated Sinusoidal Audio Coding

Author: Christensen Mads Græsbøll
van de Par S.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2004
Field of study

VBN

Sparse Modeling of Grouped Line Spectra

Author: Kronvall Ted
Publication venue: Centre for Mathematical Sciences, Lund University
Publication date: 01/01/2015
Field of study

This licentiate thesis focuses on clustered parametric models for estimation of line spectra, when the spectral content of a signal source is assumed to exhibit some form of grouping. Different from previous parametric approaches, which generally require explicit knowledge of the model orders, this thesis exploits sparse modeling, where the orders are implicitly chosen. For line spectra, the non-linear parametric model is approximated by a linear system, containing an overcomplete basis of candidate frequencies, called a dictionary, and a large set of linear response variables that selects and weights the components in the dictionary. Frequency estimates are obtained by solving a convex optimization program, where the sum of squared residuals is minimized. To discourage overfitting and to infer certain structure in the solution, different convex penalty functions are introduced into the optimization. The cost trade-off between fit and penalty is set by some user parameters, as to approximate the true number of spectral lines in the signal, which implies that the response variable will be sparse, i.e., have few non-zero elements. Thus, instead of explicit model orders, the orders are implicitly set by this trade-off. For grouped variables, the dictionary is customized, and appropriate convex penalties selected, so that the solution becomes group sparse, i.e., has few groups with non-zero variables. In an array of sensors, the specific time-delays and attenuations will depend on the source and sensor positions. By modeling this, one may estimate the location of a source. In this thesis, a novel joint location and grouped frequency estimator is proposed, which exploits sparse modeling for both spectral and spatial estimates, showing robustness against sources with overlapping frequency content. For audio signals, this thesis uses two different features for clustering. Pitch is a perceptual property of sound that may be described by the harmonic model, i.e., by a group of spectral lines at integer multiples of a fundamental frequency, which we estimate by exploiting a novel adaptive total variation penalty. The other feature, chroma, is a concept in musical theory, collecting pitches at powers of 2 from each other into groups. Using a chroma dictionary, together with appropriate group sparse penalties, we propose an automatic transcription of the chroma content of a signal

Lund University Publications

Efficient Parametric Coding of Transients

Author: Christensen Mads Græsbøll
van de Par Steven
Publication venue
Publication date: 01/01/2006
Field of study

VBN

Multipath Parameter Estimation from OFDM Signals in Mobile Channels

Author: Alexander Paul
Grant Alex
Haley David
Letzepis Nick
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 14/11/2010
Field of study

We study multipath parameter estimation from orthogonal frequency division multiplex signals transmitted over doubly dispersive mobile radio channels. We are interested in cases where the transmission is long enough to suffer time selectivity, but short enough such that the time variation can be accurately modeled as depending only on per-tap linear phase variations due to Doppler effects. We therefore concentrate on the estimation of the complex gain, delay and Doppler offset of each tap of the multipath channel impulse response. We show that the frequency domain channel coefficients for an entire packet can be expressed as the superimposition of two-dimensional complex sinusoids. The maximum likelihood estimate requires solution of a multidimensional non-linear least squares problem, which is computationally infeasible in practice. We therefore propose a low complexity suboptimal solution based on iterative successive and parallel cancellation. First, initial delay/Doppler estimates are obtained via successive cancellation. These estimates are then refined using an iterative parallel cancellation procedure. We demonstrate via Monte Carlo simulations that the root mean squared error statistics of our estimator are very close to the Cramer-Rao lower bound of a single two-dimensional sinusoid in Gaussian noise.Comment: Submitted to IEEE Transactions on Wireless Communications (26 pages, 9 figures and 3 tables

arXiv.org e-Print Archive

Crossref