Search CORE

59 research outputs found

Data-Driven Speech Intelligibility Prediction

Author: Pedersen Mathias
Publication venue: Aalborg Universitetsforlag
Publication date: 01/01/2023
Field of study

VBN

Non-Intrusive Speech Intelligibility Prediction

Author: Sørensen Charlotte
Publication venue: Aalborg Universitetsforlag
Publication date: 01/01/2019
Field of study

VBN

Validation of the Non-Intrusive Codebook-based Short Time Objective Intelligibility Metric for Processed Speech

Author: Boldt Jesper
Christensen Mads Græsbøll
Sørensen Charlotte
Publication venue: 'International Speech Communication Association'
Publication date: 01/01/2019
Field of study

Crossref

VBN

Harmonic Beamformers for Non-Intrusive Speech Intelligibility Prediction

Author: Boldt Jesper
Christensen Mads Græsbøll
Sørensen Charlotte
Publication venue: 'International Speech Communication Association'
Publication date: 01/01/2019
Field of study

Crossref

VBN

Pitch-based non-intrusive objective intelligibility prediction

Author: Boldt Jesper B.
Christensen Mads G.
Sorensen Charlotte
Xenaki Angeliki
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 16/06/2017
Field of study

Crossref

VBN

A non-intrusive method for estimating binaural speech intelligibility from noise-corrupted signals captured by a pair of microphones

Author: Cox TJ
Liu Q
Tang Y
Wang W
Publication venue: 'Elsevier BV'
Publication date: 01/02/2018
Field of study

A non-intrusive method is introduced to predict binaural speech intelligibility in noise directly from signals captured using a pair of microphones. The approach combines signal processing techniques in blind source separation and localisation, with an intrusive objective intelligibility measure (OIM). Therefore, unlike classic intrusive OIMs, this method does not require a clean reference speech signal and knowing the location of the sources to operate. The proposed approach is able to estimate intelligibility in stationary and fluctuating noises, when the noise masker is presented as a point or diffused source, and is spatially separated from the target speech source on a horizontal plane. The performance of the proposed method was evaluated in two rooms. When predicting subjective intelligibility measured as word recognition rate, this method showed reasonable predictive accuracy with correlation coefficients above 0.82, which is comparable to that of a reference intrusive OIM in most of the conditions. The proposed approach offers a solution for fast binaural intelligibility prediction, and therefore has practical potential to be deployed in situations where on-site speech intelligibility is a concern

University of Salford Institutional Repository

Crossref

University of Surrey

Surrey Research Insight

Unsupervised Uncertainty Measures of Automatic Speech Recognition for Non-intrusive Speech Intelligibility Prediction

Author: Barker Jon
Ma Ning
Tu Zehai
Publication venue
Publication date: 08/04/2022
Field of study

Non-intrusive intelligibility prediction is important for its application in realistic scenarios, where a clean reference signal is difficult to access. The construction of many non-intrusive predictors require either ground truth intelligibility labels or clean reference signals for supervised learning. In this work, we leverage an unsupervised uncertainty estimation method for predicting speech intelligibility, which does not require intelligibility labels or reference signals to train the predictor. Our experiments demonstrate that the uncertainty from state-of-the-art end-to-end automatic speech recognition (ASR) models is highly correlated with speech intelligibility. The proposed method is evaluated on two databases and the results show that the unsupervised uncertainty measures of ASR models are more correlated with speech intelligibility from listening results than the predictions made by widely used intrusive methods.Comment: Submitted to INTERSPEECH202

arXiv.org e-Print Archive

End-to-end Speech Intelligibility Prediction Using Time-Domain Fully Convolutional Neural Networks

Author: Andersen Asger Heidemann
Jensen Jesper
Jensen Søren Holdt
Kolbæk Morten
Pedersen Mathias
Publication venue: 'International Speech Communication Association'
Publication date: 01/01/2020
Field of study

Crossref

VBN

Non-intrusive codebook-based intelligibility prediction

Author: Boldt Jesper Bünsow
Christensen Mads Græsbøll
Kavalekalam Mathew Shaji
Sørensen Charlotte
Xenaki Angeliki
Publication venue: 'Elsevier BV'
Publication date: 01/07/2018
Field of study

VBN

Training Data-Driven Speech Intelligibility Predictors on Heterogeneous Listening Test Data

Author: Andersen Asger H.
Jensen Jesper
Jensen Soren Holdt
Pedersen Mathias Bach
Tan Zheng Hua
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2022
Field of study

VBN