Search CORE

89 research outputs found

Predicting the Intelligibility of Noisy and Nonlinearly Processed Binaural Speech

Author: de Haan Jan Mark
Heidemann Andersen Asger
Jensen Jesper
Tan Zheng-Hua
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 16/07/2016
Field of study

VBN

Data-Driven Speech Intelligibility Prediction

Author: Pedersen Mathias
Publication venue: Aalborg Universitetsforlag
Publication date: 01/01/2023
Field of study

VBN

Modeling speech intelligibility based on the signal-to-noise envelope power ratio

Author: Jørgensen Søren
Publication venue: Technical University of Denmark, Department of Electrical Engineering
Publication date: 01/01/2014
Field of study

Online Research Database In Technology

A non-intrusive method for estimating binaural speech intelligibility from noise-corrupted signals captured by a pair of microphones

Author: Cox TJ
Liu Q
Tang Y
Wang W
Publication venue: 'Elsevier BV'
Publication date: 01/02/2018
Field of study

A non-intrusive method is introduced to predict binaural speech intelligibility in noise directly from signals captured using a pair of microphones. The approach combines signal processing techniques in blind source separation and localisation, with an intrusive objective intelligibility measure (OIM). Therefore, unlike classic intrusive OIMs, this method does not require a clean reference speech signal and knowing the location of the sources to operate. The proposed approach is able to estimate intelligibility in stationary and fluctuating noises, when the noise masker is presented as a point or diffused source, and is spatially separated from the target speech source on a horizontal plane. The performance of the proposed method was evaluated in two rooms. When predicting subjective intelligibility measured as word recognition rate, this method showed reasonable predictive accuracy with correlation coefficients above 0.82, which is comparable to that of a reference intrusive OIM in most of the conditions. The proposed approach offers a solution for fast binaural intelligibility prediction, and therefore has practical potential to be deployed in situations where on-site speech intelligibility is a concern

University of Salford Institutional Repository

University of Surrey

Surrey Research Insight

Speech Intelligibility Prediction for Hearing Aid Systems

Author: Heidemann Andersen Asger
Publication venue: Aalborg Universitetsforlag
Publication date: 01/01/2017
Field of study

VBN

End-to-end Speech Intelligibility Prediction Using Time-Domain Fully Convolutional Neural Networks

Author: Andersen Asger Heidemann
Jensen Jesper
Jensen Søren Holdt
Kolbæk Morten
Pedersen Mathias
Publication venue: 'International Speech Communication Association'
Publication date: 01/01/2020
Field of study

Crossref

VBN

Personalized signal-independent beamforming for binaural hearing aids

Author: A. Naylor Patrick
Brookes Mike
H. Moore Alastair
Jensen Jesper
Mark de Haan Jan
Syskind Pedersen Michael
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/05/2019
Field of study

Crossref

VBN

Using a single-channel reference with the MBSTOI binaural intelligibility metric

Author: Brookes M
Guiraud P
Moore AH
Naylor PA
Vos RR
Publication venue: 'Elsevier BV'
Publication date: 06/03/2023
Field of study

In order to assess the intelligibility of a target signal in a noisy environment, intrusive speech intelligibility metrics are typically used. They require a clean reference signal to be available which can be difficult to obtain especially for binaural metrics like the modified binaural short time objective intelligibility metric (MBSTOI). We here present a hybrid version of MBSTOI that incorporates a deep learning stage that allows the metric to be computed with only a single-channel clean reference signal. The models presented are trained on simulated data containing target speech, localised noise, diffuse noise, and reverberation. The hybrid output metrics are then compared directly to MBSTOI to assess performances. Results show the performance of our single channel reference vs MBSTOI. The outcome of this work offers a fast and flexible way to generate audio data for machine learning (ML) and highlights the potential for low level implementation of ML into existing tools

Spiral - Imperial College Digital Repository

Data-driven Speech Intelligibility Enhancement and Prediction for Hearing Aids

Author: Tu Zehai
Publication venue
Publication date: 01/07/2023
Field of study

Hearing impairment is a widespread problem around the world. It is estimated that one in six people are living with some degree of hearing loss. Moderate and severe hearing impairment has been recognised as one of the major causes of disability, which is associated with declines in the quality of life, mental illness and dementia. However, investigation shows that only 10-20\% of older people with significant hearing impairment wear hearing aids. One of the main factors causing the low uptake is that current devices struggle to help hearing aid users understand speech in noisy environments. For the purpose of compensating for the elevated hearing thresholds and dysfunction of source separation processing caused by the impaired auditory system, amplification and denoising have been the major focuses of current hearing aid studies to improve the intelligibility of speech in noise. Also, it is important to derive a metric that can fairly predict speech intelligibility for the better development of hearing aid techniques. This thesis aims to enhance the speech intelligibility of hearing impaired listeners. Motivated by the success of data-driven approaches in many speech processing applications, this work proposes the differentiable hearing aid speech processing (DHASP) framework to optimise both the amplification and denoising modules within a hearing aid processor. This is accomplished by setting an intelligibility-based optimisation objective and taking advantage of large-scale speech databases to train the hearing aid processor to maximise the intelligibility for the listeners. The first set of experiments is conducted on both clean and noisy speech databases, and the results from objective evaluation suggest that the amplification fittings optimised within the DHASP framework can outperform a widely used and well-recognised fitting. The second set of experiments is conducted on a large-scale database with simulated domestic noisy scenes. The results from both objective and subjective evaluations show that the DHASP-optimised hearing aid processor incorporating a deep neural network-based denoising module can achieve competitive performance in terms of intelligibility enhancement. A precise intelligibility predictor can provide reliable evaluation results to save the cost of expensive and time-consuming subjective evaluation. Inspired by the findings that automatic speech recognition (ASR) models show similar recognition results as humans in some experiments, this work exploits ASR models for intelligibility prediction. An intrusive approach using ASR hidden representations and a non-intrusive approach using ASR uncertainty are proposed and explained in the third and fourth experimental chapters. Experiments are conducted on two databases, one with monaural speech in speech-spectrum-shaped noise with normal hearing listeners, and the other one with processed binaural speech in domestic noise with hearing impaired listeners. Results suggest that both the intrusive and non-intrusive approaches can achieve top performances and outperform a number of widely used intelligibility prediction approaches. In conclusion, this thesis covers both the enhancement and prediction of speech intelligibility for hearing aids. The proposed hearing aid processor optimised within the proposed DHASP framework can significantly improve the intelligibility of speech in noise for hearing impaired listeners. Also, it is shown that the proposed ASR-based intelligibility prediction approaches can achieve state-of-the-art performances against a number of widely used intelligibility predictors

White Rose E-theses Online

Model-based speech enhancement for hearing aids

Author: Kavalekalam Mathew Shaji
Publication venue: Aalborg Universitetsforlag
Publication date: 01/01/2018
Field of study

VBN