5,246 research outputs found
Differentiable Modelling of Percussive Audio with Transient and Spectral Synthesis
Differentiable digital signal processing (DDSP) techniques, including methods
for audio synthesis, have gained attention in recent years and lend themselves
to interpretability in the parameter space. However, current differentiable
synthesis methods have not explicitly sought to model the transient portion of
signals, which is important for percussive sounds. In this work, we present a
unified synthesis framework aiming to address transient generation and
percussive synthesis within a DDSP framework. To this end, we propose a model
for percussive synthesis that builds on sinusoidal modeling synthesis and
incorporates a modulated temporal convolutional network for transient
generation. We use a modified sinusoidal peak picking algorithm to generate
time-varying non-harmonic sinusoids and pair it with differentiable noise and
transient encoders that are jointly trained to reconstruct drumset sounds. We
compute a set of reconstruction metrics using a large dataset of acoustic and
electronic percussion samples that show that our method leads to improved onset
signal reconstruction for membranophone percussion instruments.Comment: To be published in The Proceedings of Forum Acusticum, Sep 2023,
Turin, Ital
Differentiable Modelling of Percussive Audio with Transient and Spectral Synthesis
Differentiable digital signal processing (DDSP) techniques, including methods for audio synthesis, have gained attention in recent years and lend themselves to interpretability in the parameter space. However, current differentiable synthesis methods have not explicitly sought to model the transient portion of signals, which is important for percussive sounds. In this work, we present a unified synthesis framework aiming to address transient generation and percussive synthesis within a DDSP framework. To this end, we propose a model for percussive synthesis that builds on sinusoidal modeling synthesis and incorporates a modulated temporal convolutional network for transient generation. We use a modified sinusoidal peak picking algorithm to generate time-varying non-harmonic sinusoids and pair it with differentiable noise and transient encoders that are jointly trained to reconstruct drumset sounds. We compute a set of reconstruction metrics using a large dataset of acoustic and electronic percussion samples that show that our method leads to improved onset signal reconstruction for membranophone percussion instruments
Multichannel high resolution NMF for modelling convolutive mixtures of non-stationary signals in the time-frequency domain
Several probabilistic models involving latent components have been proposed for modeling time-frequency (TF) representations of audio signals such as spectrograms, notably in the nonnegative matrix factorization (NMF) literature. Among them, the recent high-resolution NMF (HR-NMF) model is able to take both phases and local correlations in each frequency band into account, and its potential has been illustrated in applications such as source separation and audio inpainting. In this paper, HR-NMF is extended to multichannel signals and to convolutive mixtures. The new model can represent a variety of stationary and non-stationary signals, including autoregressive moving average (ARMA) processes and mixtures of damped sinusoids. A fast variational expectation-maximization (EM) algorithm is proposed to estimate the enhanced model. This algorithm is applied to piano signals, and proves capable of accurately modeling reverberation, restoring missing observations, and separating pure tones with close frequencies
Modeling and rendering for development of a virtual bone surgery system
A virtual bone surgery system is developed to provide the potential of a realistic, safe, and controllable environment for surgical education. It can be used for training in orthopedic surgery, as well as for planning and rehearsal of bone surgery procedures...Using the developed system, the user can perform virtual bone surgery by simultaneously seeing bone material removal through a graphic display device, feeling the force via a haptic deice, and hearing the sound of tool-bone interaction --Abstract, page iii
Audio Modeling based on Delayed Sinusoids
International audienceIn this work, we present an evolution of the DDS (Damped & Delayed Sinusoidal) model introduced within the framework of the general signal modeling. This model is named the Partial Damped & Delayed Sinusoidal (PDDS) model and takes into account a single time delay parameter for a set (sum) of damped sinusoids. This modi- ¯cation is more consistent with the transient audio modeling problem. We show the validity of this approach by compari- son with the well-known EDS (Exponentially Damped Sinu- soids) approach. Finally, the performances of three model high-resolution parameter estimation algorithms are com- pared on synthetic fast time-varying signals and on two typ- ical audio transients
- …