21 research outputs found
Extended high-frequency bone conduction audiometry Calibration of bone conductor transducers in the conventional and extended high-frequency range
Objective: To monitor ototoxicity, air conduction (AC) extended high frequency (EHF) thresholds can be measured up to 16 kHz. However, conductive hearing loss might influence these results. This is unfortunate because the EHF thresholds are important to follow the impact of ototoxic medication during therapy. Therefore a suitable bone conduction (BC) transducer and norm values for EHF BC measurements are needed. Design: In this study three different BC transducers were used: the B71 (Radioear), the KH70 (Präcitronic), and the KLH96 (Westra). Hearing thresholds were measured from 0.125 to 16 kHz using AC transducers (Telephonics TDH39, Sennheiser HDA200), and BC thresholds from 0.25 to 8 kHz with the B71, and from 0.25 to 16 kHz with the KLH96 and KH70. Study sample: 60 ears of 30 normal hearing subjects were measured. Results: The KLH96 showed the highest output for the high frequencies, and distortion measurements were similar to the KH70. The results show that EHF measurements are possible using the KLH96 and KH70 bone conductors. Conclusion: EHF BC measurements are reliable when using the KLH96 and KH70 bone conductors. The extended force sensitivity of the used artificial mastoid should be determined for a proper EHF BC calibration
Comparison of three calibration procedures for free-field reference speech audiometry
Objective: In standards IEC 60645-1 and ANSI S3.6, the free-field equivalent earphone output level method is assumed as the reference for speech audiometry. Three calibration procedures for this method were compared in this study. Design: Speech audiometry was conducted with Dutch consonant-vowel-consonant words for the following conditions: 1. TDH39 earphones, 2. loudspeaker, and 3. free-field simulated with TDH39 earphones. The first calibration procedure was based on the empirically determined difference between the speech recognition threshold (SRT) with earphones and a loudspeaker. The second procedure was based on the theoretical free-field correction, derived from the known speech spectrum and the free-field to coupler difference. The third calibration procedure corresponded to the results of the free-field simulated speech material under earphones. Study sample: The sample included 20 normal hearing subjects. Results: The differences between the observed SRT in the free-field and earphone conditions and the free-field and simulated free-field conditions were 7.1 dB and 0.6 dB, respectively. Conclusion: The three calibration procedures for the free-field equivalent output method yielded approximately the same results, and therefore all appear to be useful for TDH39 earphones
The Extended Speech Transmission Index: Predicting speech intelligibility in fluctuating noise and reverberant rooms
The Speech Transmission Index (STI) is used to predict speech intelligibility in noise and reverberant environments. However, measurements and predictions in fluctuating noises lead to inaccuracies. In the current paper, the Extended Speech Transmission Index (ESTI) is presented in order to deal with these shortcomings. Speech intelligibility in normally hearing subjects was measured using stationary and fluctuating maskers. These results served to optimize model parameters. Data from the literature were then used to verify the ESTI-model. Model outcomes were accurate for stationary maskers, maskers with artificial fluctuations, and maskers with real life non-speech modulations. Maskers with speech-like characteristics introduced systematic errors in the model outcomes, probably due to a combination of modulation masking, context effects, and informational masking
A context-based model to predict the intelligibility of sentences in non-stationary noises
The context-based Extended Speech Transmission Index (cESTI) (van Schoonhoven et al., 2022, J. Acoust. Soc. Am. 151, 1404-1415) was successfully applied to predict the intelligibility of monosyllabic words with different degrees of context in interrupted noise. The current study aimed to use the same model for the prediction of sentence intelligibility in different types of non-stationary noise. The necessary context factors and transfer functions were based on values found in existing literature. The cESTI performed similar to or better than the original ESTI when noise had speech-like characteristics. We hypothesize that the remaining inaccuracies in model predictions can be attributed to the limits of the modelling approach with regard to mechanisms, such as modulation masking and informational masking
Extended speech intelligibility index for the prediction of the speech reception threshold in fluctuating noise
The extension to the speech intelligibility index (SII; ANSI S3.5-1997 (1997)) proposed by Rhebergen and Versfeld [Rhebergen, K.S., and Versfeld, N.J. (2005). J. Acoust. Soc. Am. 117(4), 2181-2192] is able to predict for normal-hearing listeners the speech intelligibility in both stationary and fluctuating noise maskers with reasonable accuracy. The extended SII model was validated with speech reception threshold (SRT) data from the literature. However, further validation is required and the present paper describes SRT experiments with nonstationary noise conditions that are critical to the extended model. From these data, it can be concluded that the extended SII model is able to predict the SRTs for the majority of conditions, but that predictions are better when the extended SII model includes a function to account for forward maskin
Prediction of the intelligibility for speech in real-life background noises for subjects with normal hearing
OBJECTIVES: The speech reception threshold (SRT) traditionally is measured in stationary noise that has the long-term average speech spectrum of the target speech. However, in real life the instantaneous spectrum of the background noise is likely to be different from the stationary long-term average speech spectrum noise. To gain more insight into the effect of real-life background noises on speech intelligibility, the SRT of listeners with normal hearing was measured in a set of noises that varied in both the spectral and the temporal domain. This article investigates the ability of the extended speech intelligibility index (ESII), proposed by Rhebergen et al. to account for SRTs in these real-life background noises. DESIGN: SRTs in noise were measured in 12 subjects with normal hearing. Interfering noises consisted of a variety of real-life noises, selected from a database, and chosen on the basis of their spectrotemporal differences. Measured SRTs were converted to ESII values and compared. Ideally, at threshold, ESII values should be the same, because the ESII represents the amount of speech information available to the listener. RESULTS: SRTs ranged from -6 dB SNR (in stationary noise) to -21 dB SNR (in machine gun noise). Conversion to ESII values resulted in an average value of 0.34, with a standard deviation of 0.06. SRT predictions with the ESII model were better than those obtained with the conventional SII (ANSI 53.5-1997) model. In case of interfering speech, the ESII model predictions were poorer, because additional, nonenergetic (informational) masking is thought to occur. CONCLUSIONS: For the present set of masking noises, being representative for a variety of real-life noises, the ESII model of Rhebergen et al. is able to predict the SRTs of subjects with normal hearing with reasonable accuracy. It may be concluded that the ESII model can provide valuable predictions for the speech intelligibility in some everyday situation
The dynamic range of speech, compression, and its effect on the speech reception threshold in stationary and interrupted noise
Changes in the speech reception threshold (SRT) after amplitude compression of speech or speech in noise may be due to changes in the dynamic range of the speech signal. However, current models set up to predict the speech intelligibility consider the dynamic range of speech to be fixed regardless of the type of compression. The present paper describes two experiments with normal-hearing subjects to examine the effect of the dynamic range on the SRT in stationary and interrupted noise after wide dynamic range compression. The dynamic range has been varied by compression or expansion of only the speech signal, leaving the masking noise unaltered, or by compression or expansion of the mixed speech-in-noise signal. The results show that compression affects the SRT, both in a positive or a negative direction, not only due to dynamic range but also due to distortion of the speech signa
Quantifying and modeling the acoustic effects of compression on speech in noise
In this presentation a method is proposed that is able to separate a speech signal out of a noise signal after processing of the signal through wide-dynamic-range compression (WDRC). This technique reconstructs the speech signal and noise signal sample by sample separately using the gain factor of the WDRC, and can be used to quantify the acoustic effects of WDRC in noise. It will be shown that this technique is more accurate than a frequently used inversion technique, because the method is not affected by phase shifts that introduce distortion products in the reconstructed speech signal. As a result, the acoustic effects of WDRC can be measured more accurately. In addition, this reconstruction method allows modeling the speech intelligibility after non-linear signal processing in the Speech Intelligibility Index. With the aid of Speech Reception Threshold data it will be shown that this approach can give a good account for most existing dat
Towards measuring the Speech Transmission Index in fluctuating noise: Accuracy and limitations
In the field of room acoustics, the modulation transfer function (MTF) can be used to predict speech intelligibility in stationary noise and reverberation and can be expressed in one single value: the Speech Transmission Index (STI). One drawback of the classical STI measurement method is that it is not validated for fluctuating background noise. As opposed to the classical measurement method, the MTF due to reverberation can also be calculated using an impulse response measurement. This indirect method presents an opportunity for STI measurements in fluctuating noise, and a first prerequisite is a reliable impulse response measurement. The conditions under which the impulse response can be measured with sufficient precision were investigated in the current study. Impulse response measurements were conducted using a sweep stimulus. Two experiments are discussed with variable absorption, different levels of stationary and fluctuating background noise, and different sweep levels. Additionally, simulations with different types of fluctuating noise were conducted in an attempt to extrapolate the experimental findings to other acoustical conditions. The experiments and simulations showed that a minimum impulse-to-noise ratio of +25 dB in fluctuating noise was needed. (C) 2017 Acoustical Society of Americ