Search CORE

250 research outputs found

Perceptually Motivated Wavelet Packet Transform for Bioacoustic Signal Enhancement

Author: Cohen I.
Deller J. R.
Fu Q.
Jidong Tao
Michael T. Johnson
Osiejuk T. S.
Seyfarth R. M.
Shao Y.
Yao Ren
Publication venue: e-Publications@Marquette
Publication date: 01/07/2008
Field of study

A significant and often unavoidable problem in bioacoustic signal processing is the presence of background noise due to an adverse recording environment. This paper proposes a new bioacoustic signal enhancement technique which can be used on a wide range of species. The technique is based on a perceptually scaled wavelet packet decomposition using a species-specific Greenwood scale function. Spectral estimation techniques, similar to those used for human speech enhancement, are used for estimation of clean signal wavelet coefficients under an additive noise model. The new approach is compared to several other techniques, including basic bandpass filtering as well as classical speech enhancement methods such as spectral subtraction, Wiener filtering, and Ephraim–Malah filtering. Vocalizations recorded from several species are used for evaluation, including the ortolan bunting (Emberiza hortulana), rhesus monkey (Macaca mulatta), and humpback whale (Megaptera novaeanglia), with both additive white Gaussian noise and environment recording noise added across a range of signal-to-noise ratios (SNRs). Results, measured by both SNR and segmental SNR of the enhanced wave forms, indicate that the proposed method outperforms other approaches for a wide range of noise conditions

epublications@Marquette

Crossref

Kalman tracking of linear predictor and harmonic noise models for noisy speech enhancement

Author: Ben Milner
Boll
Chen
Deller
Ephraim
Ephraim
Ephraim
Ephraim
Esfandiar Zavarehei
Friedman
Griffin
Hansen
Ioannis Andrianakis
Jonathan Darch
Kalman
Lim
Lim
Paul White
Qin Yan
Rentzos
Saeed Vaseghi
Sameti
Secrest
Seltzer
Stylianou
Stylianou
Tucker
Turunen
Vaseghi
Weber
Yan
Publication venue: 'Elsevier BV'
Publication date: 01/01/2008
Field of study

This paper presents a speech enhancement method based on the tracking and denoising of the formants of a linear prediction (LP) model of the spectral envelope of speech and the parameters of a harmonic noise model (HNM) of its excitation. The main advantages of tracking and denoising the prominent energy contours of speech are the efficient use of the spectral and temporal structures of successive speech frames and a mitigation of processing artefact known as the ‘musical noise’ or ‘musical tones’.The formant-tracking linear prediction (FTLP) model estimation consists of three stages: (a) speech pre-cleaning based on a spectral amplitude estimation, (b) formant-tracking across successive speech frames using the Viterbi method, and (c) Kalman filtering of the formant trajectories across successive speech frames.The HNM parameters for the excitation signal comprise; voiced/unvoiced decision, the fundamental frequency, the harmonics’ amplitudes and the variance of the noise component of excitation. A frequency-domain pitch extraction method is proposed that searches for the peak signal to noise ratios (SNRs) at the harmonics. For each speech frame several pitch candidates are calculated. An estimate of the pitch trajectory across successive frames is obtained using a Viterbi decoder. The trajectories of the noisy excitation harmonics across successive speech frames are modeled and denoised using Kalman filters.The proposed method is used to deconstruct noisy speech, de-noise its model parameters and then reconstitute speech from its cleaned parts. Experimental evaluations show the performance gains of the formant tracking, pitch extraction and noise reduction stages

Crossref

Southampton (e-Prints Soton)

University of East Anglia digital repository

A NEW SPEECH ENHANCEMENT TECHNIQUE USING PERCEPTUAL CONSTRAINED SPECTRAL WEIGHTING FACTORS

Author: Muni Kumar T.
Rama Murthy M. B.
Rama Rao Ch. V.
Rao K Srinivasa
Publication venue: Institute for Project Management Pvt. Ltd
Publication date: 29/07/2020
Field of study

This paper deals with musical noise result from perceptual speech enhancement type algorithms and especially wiener filtering. Although perceptual speech enhancement methods perform better than the non perceptual methods, most of them still return annoying residual musical noise. This is due to the fact that if only noise above the noise masking threshold is filtered then noise below the noise masking threshold can become audible if its maskers are filtered. It can affect the performance of perceptual speech enhancement method that process audible noise only. In order to overcome this drawback here proposed a new speech enhancement technique. It aims to improve the quality of the enhanced speech signal provided by perceptual wiener filtering by controlling the latter via a second filter regarded as a psychoacoustically motivated weighting factor. The simulation results shows that the performance is improved compared to other perceptual speech enhancement method

Interscience Research Network

Amélioration psychoacoustique du filtrage de Wiener : quelques approches récentes et une nouvelle méthode

Author: Amehraye Asmaa
Pastor Dominique
Tamtaoui Ahmed
Publication venue: HAL CCSD
Publication date: 01/01/2007
Field of study

*Bruit musical, distorsion, filtre deWiener, psychoacoustique, signal de parol

Enhancing Audio Signal Quality and Learning Experience with Integrated Covariance Weiner Filtering in College Music Education

Author: Yang Mengna
Publication venue: Auricle Global Society of Education and Research
Publication date: 30/07/2023
Field of study

In recent years, computer music technology has become increasingly prevalent in college music education, offering new possibilities for creative expression and pedagogical approaches. This paper concentrated on the music education in the colleges with the application of integrated time and frequency filtering (ITFF) with Kalman integrated covariance Weiner filtering in college music education. The ITFF technique combines time and frequency domain analysis to enhance the quality and clarity of audio signals. By integrating the Kalman integrated covariance Weiner filtering, the ITFF method provides robust noise reduction and improved signal representation. This integrated approach enables music educators to effectively analyze and manipulate audio signals in real-time, fostering a more immersive and engaging learning environment for students. The findings of this study highlight the benefits and potential applications of ITFF with Kalman-integrated covariance Weiner filtering in college music education, including audio signal enhancement, sound synthesis, and interactive performance systems. The integration of computer music technology with advanced filtering techniques presents new opportunities for exploring sound, composition, and music production within an educational context

International Journal on Recent and Innovation Trends in Computing and Communication

Dereverberation and Denoising Techniques for ASR Applications

Author: Fernando Santana Pacheco
Rui Seara
Publication venue: 'IntechOpen'
Publication date: 01/11/2008
Field of study

IntechOpen

Crossref

A Study into Speech Enhancement Techniques in Adverse Environment

Author: Nahma Lara
Publication venue: Curtin University
Publication date: 01/01/2018
Field of study

This dissertation developed speech enhancement techniques that improve the speech quality in applications such as mobile communications, teleconferencing and smart loudspeakers. For these applications it is necessary to suppress noise and reverberation. Thus the contribution in this dissertation is twofold: single channel speech enhancement system which exploits the temporal and spectral diversity of the received microphone signal for noise suppression and multi-channel speech enhancement method with the ability to employ spatial diversity to reduce reverberation

espace@Curtin