Search CORE

3,379 research outputs found

Audio Analysis/synthesis System

Author
Publication venue
Publication date
Field of study

A method and apparatus for the automatic analysis, synthesis and modification of audio signals, based on an overlap-add sinusoidal model, is disclosed. Automatic analysis of amplitude, frequency and phase parameters of the model is achieved using an analysis-by-synthesis procedure which incorporates successive approximation, yielding synthetic waveforms which are very good approximations to the original waveforms and are perceptually identical to the original sounds. A generalized overlap-add sinusoidal model is introduced which can modify audio signals without objectionable artifacts. In addition, a new approach to pitch-scale modification allows for the use of arbitrary spectral envelope estimates and addresses the problems of high-frequency loss and noise amplification encountered with prior art methods. The overlap-add synthesis method provides the ability to synthesize sounds with computational efficiency rivaling that of synthesis using the discrete short-time Fourier transform (DSTFT) while eliminating the modification artifacts associated with that method.Georgia Tech Research Corporatio

Scholarly Materials And Research @ Georgia Tech

Reconstruction-based speech enhancement from robust acoustic features

Author: Ahmadi
Ben Milner
Boll
Cappe
Carmona
Chen
Cohen
Darch
de Cheveigné
Ephraim
Ephraim
Gales
Gauvain
Gerkmann
Gonzalez
Hu
Hu
Hu
Jensen
Kawahara
Leggetter
Loizou
Makhoul
Martin
Martin
McAulay
Milner
Milner
Mohammadiha
Oppenheim
Paliwal
Philip Harding
Rangachari
Reynolds
Stylianou
Syrdal
Varga
Xiao
Yan
Zen
Publication venue: 'Elsevier BV'
Publication date: 17/10/2015
Field of study

This paper proposes a method of speech enhancement where a clean speech signal is reconstructed from a sinusoidal model of speech production and a set of acoustic speech features. The acoustic features are estimated from noisy speech and comprise, for each frame, a voicing classification (voiced, unvoiced or non-speech), fundamental frequency (for voiced frames) and spectral envelope. Rather than using different algorithms to estimate each parameter, a single statistical model is developed. This comprises a set of acoustic models and has similarity to the acoustic modelling used in speech recognition. This allows noise and speaker adaptation to be applied to acoustic feature estimation to improve robustness. Objective and subjective tests compare reconstruction-based enhancement with other methods of enhancement and show the proposed method to be highly effective at removing noise

Crossref

University of East Anglia digital repository

Sinusoidal masks for single channel speech separation

Author: Christensen Mads Græsbøll
Jensen Søren Holdt
Mowlaee Pejman
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 14/03/2010
Field of study

VBN

Reconstructing intelligible audio speech from visual speech features

Author: Le Cornu Thomas
Milner Ben
Publication venue
Publication date: 01/01/2015
Field of study

This work describes an investigation into the feasibility of producing intelligible audio speech from only visual speech fea- tures. The proposed method aims to estimate a spectral enve- lope from visual features which is then combined with an arti- ficial excitation signal and used within a model of speech pro- duction to reconstruct an audio signal. Different combinations of audio and visual features are considered, along with both a statistical method of estimation and a deep neural network. The intelligibility of the reconstructed audio speech is measured by human listeners, and then compared to the intelligibility of the video signal only and when combined with the reconstructed audio

University of East Anglia digital repository

Audio Inpainting

Author: Adler A
Elad M
Emiya V
Gribonval R
Jafari MG
Plumbley MD
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/03/2012
Field of study

(c) 2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works. Published version: IEEE Transactions on Audio, Speech and Language Processing 20(3): 922-932, Mar 2012. DOI: 10.1090/TASL.2011.2168211

HAL-CentraleSupelec

CiteSeerX

INRIA a CCSD electronic archive server

Queen Mary Research Online

Surrey Research Insight

Hal-Diderot

HAL-Rennes 1

Enhancement of Single-Channel Periodic Signals in the Time-Domain

Author: Benesty Jacob
Christensen Mads Græsbøll
Jensen Jesper Rindom
Jensen Søren Holdt
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

VBN