5 research outputs found
Objective and Subjective Evaluation of Wideband Speech Quality
Traditional landline and cellular communications use a bandwidth of 300 - 3400 Hz for transmitting speech. This narrow bandwidth impacts quality, intelligibility and naturalness of transmitted speech. There is an impending change within the telecommunication industry towards using wider bandwidth speech, but the enlarged bandwidth also introduces a few challenges in speech processing. Echo and noise are two challenging issues in wideband telephony, due to increased perceptual sensitivity by users.
Subjective and/or objective measurements of speech quality are important in benchmarking speech processing algorithms and evaluating the effect of parameters like noise, echo, and delay in wideband telephony. Subjective measures include ratings of speech quality by listeners, whereas objective measures compute a metric based on the reference and degraded speech samples. While subjective quality ratings are the gold - standard\u27\u27, they are also time- and resource- consuming. An objective metric that correlates highly with subjective data is attractive, as it can act as a substitute for subjective quality scores in gauging the performance of different algorithms and devices.
This thesis reports results from a series of experiments on subjective and objective speech quality evaluation for wideband telephony applications. First, a custom wideband noise reduction database was created that contained speech samples corrupted by different background noises at different signal to noise ratios (SNRs) and processed by six different noise reduction algorithms. Comprehensive subjective evaluation of this database revealed an interaction between the algorithm performance, noise type and SNR. Several auditory-based objective metrics such as the Loudness Pattern Distortion (LPD) measure based on the Moore - Glasberg auditory model were evaluated in predicting the subjective scores. In addition, the performance of Bayesian Multivariate Regression Splines(BMLS) was also evaluated in terms of mapping the scores calculated by the objective metrics to the true quality scores. The combination of LPD and BMLS resulted in high correlation with the subjective scores and was used as a substitution for fine - tuning the noise reduction algorithms.
Second, the effect of echo and delay on the wideband speech was evaluated in both listening and conversational context, through both subjective and objective measures. A database containing speech samples corrupted by echo with different delay and frequency response characteristics was created, and was later used to collect subjective quality ratings. The LPD - BMLS objective metric was then validated using the subjective scores.
Third, to evaluate the effect of echo and delay in conversational context, a realtime simulator was developed. Pairs of subjects conversed over the simulated system and rated the quality of their conversations which were degraded by different amount of echo and delay. The quality scores were analysed and LPD+BMLS combination was found to be effective in predicting subjective impressions of quality for condition-averaged data
Sound quality effects of an adaptive nonlinear frequency compression processor with normal-hearing and hearing-impaired listeners
© 2019 American Academy of Audiology. All rights reserved. Background: Frequency lowering (FL) technology offers a means of improving audibility of high-frequency sounds. For some listeners, the benefit of such technology can be accompanied by a perceived degradation in sound quality, depending on the strength of the FL setting. Purpose: The studies presented in this article investigate the effect of a new type of FL signal processing for hearing aids, adaptive nonlinear frequency compression (ANFC), on subjective speech quality. Research Design: Listener ratings of sound quality were collected for speech stimuli processed with systematically varied fitting parameters. Study Sample: Study 1 included 40 normal-hearing (NH) adult and child listeners. Study 2 included 11 hearing-impaired (HI) adult and child listeners. HI listeners were fitted with laboratory-worn hearing aids for use during listening tasks. Intervention: Speech quality ratings were assessed across test conditions consisting of various strengths of static nonlinear frequency compression (NFC) and ANFC speech. Test conditions included those that were fine-tuned on an individual basis per hearing aid fitting and conditions that were modified to intentionally alter the sound quality of the signal. Data Collection and Analysis: Listeners rated speech quality using the MUlti Stimulus test with Hidden Reference and Anchor (MUSHRA) test paradigm. Ratings were analyzed for reliability and to compare results across conditions. Results: Results show that interrater reliability is high for both studies, indicating that NH and HI listeners from both adult and child age groups can reliably complete the MUSHRA task. Results comparing sound quality ratings across experimental conditions suggest that both the NH and HI listener groups rate the stimuli intended to have poor sound quality (e.g., anchors and the strongest available parameter settings) as having below-average sound quality ratings. A different trend in the results is reported when considering the other experimental conditions across the listener groups in the studies. Speech quality ratings measured with NH listeners improve as the strength of ANFC decreases, with a range of bad to good ratings reported, on average. Speech quality ratings measured with HI listeners are similar and above-average for many of the experimental stimuli, including those with fine-tuned NFC and ANFC parameters. Conclusions: Overall, HI listeners provide similar sound quality ratings when comparing static and adaptive forms of frequency compression, especially when considering the individualized parameter settings. These findings suggest that a range in settings may result in above-average sound quality for adults and children with hearing impairment. Furthermore, the fitter should fine-tune FL parameters for each individual listener, regardless of type of FL technology
Fitting frequency-lowering signal processing applying the American Academy of audiology pediatric amplification guideline: Updates and protocols
Background: Although guidelines for fitting hearing AIDS for children are well developed and have strong basis in evidence, specific protocols for fitting and verifying technologies can supplement such guidelines. One such technologyisfrequency-lowering signal processing. Children require accesstoabroad bandwidth of speech to detect and use all phonemes including female/s/. When access through conventional amplification is not possible, the use of frequency-lowering signal processing may be considered as a means to overcome limitations. Fitting and verification protocols are needed to better define candidacy determination and options for assessing and fine tuning frequency-lowering signal processing for individuals. Purpose: This work aims to (1) describe a set of calibrated phonemes that can be used to characterize the variation in different brands of frequency-lowering processors in hearing AIDS and the verification with these signals and (2) determine whether verification with these signal are predictive of perceptual changes associated with changes in the strength of frequency-lowering signal processing. Finally, we aimed to develop a fitting protocol for use in pediatric clinical practice. Study Sample: Study 1 used a sample of six hearing AIDS spanning four types of frequency lowering algorithms for an electroacoustic evaluation. Study 2 included 21 adults who had hearing loss (mean age 66 yr). Data Collection and Analysis: Simulated fricatives were designed to mimic the level and frequency shape of female fricatives extracted from two sources of speech. These signals were used to verify the frequency-lowering effects of four distinct types of frequency-lowering signal processors available in commercial hearing AIDS, and verification measures were compared to extracted fricatives made in a reference system. In a second study, the simulated fricatives were used within a probe microphone measurement system to verify a wide range of frequency compression settings in a commercial hearing aid, and 27 adult listeners were tested at each setting. The relation between the hearing aid verification measures and the listener\u27s ability to detect and discriminate between fricatives was examined. Results: Verification measures made with the simulated fricatives agreed to within 4 dB, on average, and tended to mimic the frequency response shape of fricatives presented in a running speech context. Some processors showed a greater aided response level for fricatives in running speech than fricatives presented in isolation. Results with listeners indicated that verified settings that provided a positive sensation level of /s/ and that maximized the frequency difference between /s/ and /f/ tended to have the best performance. Conclusions: Frequency-lowering signal processors have measureable effects on the high-frequency fricative content of speech, particularly female/s/. It is possible to measure these effects either with a simple strategy that presents an isolated simulated fricative and measures the aided frequency response or withamore complex system that extracts fricatives from running speech. For some processors, a more accurate result may be achieved with a running speech system. In listeners, the aided frequency location and sensation level of fricatives may be helpful in predicting whether a specific hearing aid fitting, with or without frequency-lowering, will support access to the fricatives of speech
Fitting noise management signal processing applying the American Academy of audiology pediatric amplification guideline: Verification protocols
Background: Although guidelines for fitting hearing AIDS for children are well developed and have strong basis in evidence, specific protocols for fitting and verifying some technologies are not always available. One such technology is noise management in children\u27s hearing AIDS. Children are frequently in highlevel and/or noisy environments, and many options for noise management exist in modern hearing AIDS. Verification protocols are needed to define specific test signals and levels for use in clinical practice. Purpose: This work aims to (1) describe the variation in different brands of noise reduction processors in hearing AIDS and the verification of these processors and (2) determine whether these differences are perceived by 13 children who have hearing loss. Finally, we aimed to develop a verification protocol for use in pediatric clinical practice. Study Sample: A set of hearing AIDS was tested using both clinically available test systems and a reference system, so that the impacts of noise reduction signal processing in hearing AIDS could be characterized for speech in a variety of background noises. A second set of hearing AIDS was tested across a range of audiograms and across two clinical verification systems to characterize the variance in clinical verification measurements. Finally, a set of hearing aid recordings that varied by type of noise reduction was rated for sound quality by children with hearing loss. Results: Significant variation across makes and models of hearing AIDS was observed in both the speed of noise reduction activation and the magnitude of noise reduction. Reference measures indicate that noise-only testing may overestimate noise reduction magnitude compared to speech-in-noise testing. Variation across clinical test signals was also observed, indicating that some test signals may be more successful than others for characterization of hearing aid noise reduction. Children provided different sound quality ratings across hearing AIDS, and for one hearing aid rated the sound quality as higher with the noise reduction system activated. Conclusions: Implications for clinical verification systems may be that greater standardization and the use of speech-in-noise test signals may improve the quality and consistency of noise reduction verification cross clinics. A suggested clinical protocol for verification of noise management in children\u27s hearing AIDS is suggested