Search CORE

103 research outputs found

Coding Strategies for Cochlear Implants Under Adverse Environments

Author: Tahmina Qudsia
Publication venue: UWM Digital Commons
Publication date: 01/05/2016
Field of study

Cochlear implants are electronic prosthetic devices that restores partial hearing in patients with severe to profound hearing loss. Although most coding strategies have significantly improved the perception of speech in quite listening conditions, there remains limitations on speech perception under adverse environments such as in background noise, reverberation and band-limited channels, and we propose strategies that improve the intelligibility of speech transmitted over the telephone networks, reverberated speech and speech in the presence of background noise. For telephone processed speech, we propose to examine the effects of adding low-frequency and high- frequency information to the band-limited telephone speech. Four listening conditions were designed to simulate the receiving frequency characteristics of telephone handsets. Results indicated improvement in cochlear implant and bimodal listening when telephone speech was augmented with high frequency information and therefore this study provides support for design of algorithms to extend the bandwidth towards higher frequencies. The results also indicated added benefit from hearing aids for bimodal listeners in all four types of listening conditions. Speech understanding in acoustically reverberant environments is always a difficult task for hearing impaired listeners. Reverberated sounds consists of direct sound, early reflections and late reflections. Late reflections are known to be detrimental to speech intelligibility. In this study, we propose a reverberation suppression strategy based on spectral subtraction to suppress the reverberant energies from late reflections. Results from listening tests for two reverberant conditions (RT60 = 0.3s and 1.0s) indicated significant improvement when stimuli was processed with SS strategy. The proposed strategy operates with little to no prior information on the signal and the room characteristics and therefore, can potentially be implemented in real-time CI speech processors. For speech in background noise, we propose a mechanism underlying the contribution of harmonics to the benefit of electroacoustic stimulations in cochlear implants. The proposed strategy is based on harmonic modeling and uses synthesis driven approach to synthesize the harmonics in voiced segments of speech. Based on objective measures, results indicated improvement in speech quality. This study warrants further work into development of algorithms to regenerate harmonics of voiced segments in the presence of noise

University of Wisconsin-Milwaukee

Non-Intrusive Speech Intelligibility Prediction

Author: Sørensen Charlotte
Publication venue: Aalborg Universitetsforlag
Publication date: 01/01/2019
Field of study

VBN

Koklear İmplant Konuşma İşlemcileri için Optimum Parametrelerin Objektif Ölçütler Kullanılarak Belirlenmesi

Author: Arslan Özkan
Publication venue: TUBITAK
Publication date: 01/01/2022
Field of study

In a cochlear implant (CI) speech processor, several parameters such as channel numbers, bandwidths, rectification type, and cutoff frequency play an important role in acquiring enhanced speech. The effective and general purpose CI approach has been a research topic for a long time. In this study, it is aimed to determine the optimum parameters for CI users by using different channel numbers (4, 8, 12, 16, and 22), rectification types (half and full) and cutoff frequencies (200, 250, 300, 350, and 400 Hz). The CI approaches have been tested on Turkish sentences which are taken from METU database. The optimum CI structure has been tested with objective quality that weighted spectral slope (WSS) and objective intelligibility measures such as short-term objective intelligibility (STOI) and perceptual evaluation of speech quality (PESQ). Experimental results show that 400 Hz cutoff frequency, full wave rectifier, and 16-channels CI approach give better quality and higher intelligibility scores than other CI approaches according to STOI, PESQ and WSS results. The proposed CI approach provides the ability to percept 91% of output vocoded Turkish speech for CI users. © 2022, TUBITAK. All rights reserved

Namik Kemal University Institutional Repository

Recommended from our members

Using recurrent neural networks to improve the perception of speech in non-stationary noise by people with cochlear implants.

Author: Carlyon Robert P
Goehring Tobias
Keshavarzi Mahmoud
Moore Brian CJ
Publication venue: J Acoust Soc Am
Publication date: 01/07/2019
Field of study

Speech-in-noise perception is a major problem for users of cochlear implants (CIs), especially with non-stationary background noise. Noise-reduction algorithms have produced benefits but relied on a priori information about the target speaker and/or background noise. A recurrent neural network (RNN) algorithm was developed for enhancing speech in non-stationary noise and its benefits were evaluated for speech perception, using both objective measures and experiments with CI simulations and CI users. The RNN was trained using speech from many talkers mixed with multi-talker or traffic noise recordings. Its performance was evaluated using speech from an unseen talker mixed with different noise recordings of the same class, either babble or traffic noise. Objective measures indicated benefits of using a recurrent over a feed-forward architecture, and predicted better speech intelligibility with than without the processing. The experimental results showed significantly improved intelligibility of speech in babble noise but not in traffic noise. CI subjects rated the processed stimuli as significantly better in terms of speech distortions, noise intrusiveness, and overall quality than unprocessed stimuli for both babble and traffic noise. These results extend previous findings for CI users to mostly unseen acoustic conditions with non-stationary noise

Apollo (Cambridge)

Pitch-based non-intrusive objective intelligibility prediction

Author: Boldt Jesper B.
Christensen Mads G.
Sorensen Charlotte
Xenaki Angeliki
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 16/06/2017
Field of study

Crossref

VBN

Deep learning-based denoising streamed from mobile phones improves speech-in-noise understanding for hearing aid users

Author: Berry Mark
Diehl Peter Udo
Hasemann Henning
Hofmann Veit M.
Kaya Müge
Kepp Kevin
Meyer-Rachner Paul
Pudszuhn Annett
Sattler Felix
Singer Yosef
Sprengel Elias
Vormann Matthias
Zilly Hannes
Zippel Marlene
Publication venue
Publication date: 22/08/2023
Field of study

The hearing loss of almost half a billion people is commonly treated with hearing aids. However, current hearing aids often do not work well in real-world noisy environments. We present a deep learning based denoising system that runs in real time on iPhone 7 and Samsung Galaxy S10 (25ms algorithmic latency). The denoised audio is streamed to the hearing aid, resulting in a total delay of around 75ms. In tests with hearing aid users having moderate to severe hearing loss, our denoising system improves audio across three tests: 1) listening for subjective audio ratings, 2) listening for objective speech intelligibility, and 3) live conversations in a noisy environment for subjective ratings. Subjective ratings increase by more than 40%, for both the listening test and the live conversation compared to a fitted hearing aid as a baseline. Speech reception thresholds, measuring speech understanding in noise, improve by 1.6 dB SRT. Ours is the first denoising system that is implemented on a mobile device, streamed directly to users' hearing aids using only a single channel as audio input while improving user satisfaction on all tested aspects, including speech intelligibility. This includes overall preference of the denoised and streamed signal over the hearing aid, thereby accepting the higher latency for the significant improvement in speech understanding

arXiv.org e-Print Archive

A non-intrusive method for estimating binaural speech intelligibility from noise-corrupted signals captured by a pair of microphones

Author: Cox TJ
Liu Q
Tang Y
Wang W
Publication venue: 'Elsevier BV'
Publication date: 01/02/2018
Field of study

A non-intrusive method is introduced to predict binaural speech intelligibility in noise directly from signals captured using a pair of microphones. The approach combines signal processing techniques in blind source separation and localisation, with an intrusive objective intelligibility measure (OIM). Therefore, unlike classic intrusive OIMs, this method does not require a clean reference speech signal and knowing the location of the sources to operate. The proposed approach is able to estimate intelligibility in stationary and fluctuating noises, when the noise masker is presented as a point or diffused source, and is spatially separated from the target speech source on a horizontal plane. The performance of the proposed method was evaluated in two rooms. When predicting subjective intelligibility measured as word recognition rate, this method showed reasonable predictive accuracy with correlation coefficients above 0.82, which is comparable to that of a reference intrusive OIM in most of the conditions. The proposed approach offers a solution for fast binaural intelligibility prediction, and therefore has practical potential to be deployed in situations where on-site speech intelligibility is a concern

University of Salford Institutional Repository

University of Surrey

Surrey Research Insight

Nonintrusive Speech Intelligibility Prediction Using Convolutional Neural Networks

Author: Haan Jan Mark De
Heidemann Andersen Asger
Jensen Jesper
Tan Zheng-Hua
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/10/2018
Field of study

VBN

Non-intrusive codebook-based intelligibility prediction

Author: Boldt Jesper Bünsow
Christensen Mads Græsbøll
Kavalekalam Mathew Shaji
Sørensen Charlotte
Xenaki Angeliki
Publication venue: 'Elsevier BV'
Publication date: 01/07/2018
Field of study

VBN

Electroacoustic Assessment of Hearing Aids and PSAPs

Author: Sheel Manan
Publication venue: Scholarship@Western
Publication date: 11/12/2018
Field of study

Hearing aids and personal sound amplification products (PSAPs) are commonly used assistive devices for treating hearing loss. Due to the diversity in the hardware and signal processing algorithms in these devices, comprehensive verification of their performance is essential. Existing standards for assistive hearing devices are primarily used for quality control purposes and do not quantify their performance in a perceptually-relevant manner. This thesis developed a comprehensive electroacoustic testing toolbox for hearing devices that encompasses both quality control and perceptually-relevant measures. In particular, a test sequence was developed to assess the effectiveness of noise reduction feature in assistive hearing devices. Several commercially-available hearing aids and PSAPs on the “best seller” list at Amazon.ca were evaluated using the toolbox. Key results include: (a) hearing aids differ in their noise reduction performance; (b) some of the popular PSAPs do not meet the ANSI standards and are capable of producing dangerous sound pressure levels; and (c) hearing aids performed better than PSAPs on perceptually-relevant metrics

Scholarship@Western