Search CORE

12 research outputs found

Computational modeling of speech intelligibility in adverse conditions

Author: Chabot-Leclerc Alexandre
Publication venue: DTU Elektro
Publication date: 01/01/2016
Field of study

Online Research Database In Technology

Predicting masking release of lateralized speech

Author: Chabot-Leclerc Alexandre
Dau Torsten
MacDonald Ewen
Publication venue
Publication date: 01/01/2016
Field of study

Online Research Database In Technology

Predicting binaural speech intelligibility using the signal-to-noise ratio in the envelope power spectrum domain

Author: Alexandre Chabot-Leclerc
ANSI
Ewen N. MacDonald
Houtgast T.
IEC
ISO
Loizou P. C.
Plomp R.
Torsten Dau
Wagener K.
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/2016
Field of study

Crossref

Online Research Database In Technology

PAMBOX: A Python auditory modeling toolbox

Author: Alexandre Chabot-leclerc (515576)
Publication venue
Publication date
Field of study

Poster presented at EuroScipy 2014. Toolboxes for modeling auditory perception have a surprisingly long history, starting with the Auditory Toolbox, first written by Malcom Slaney for Mathematica, in 1993, and then ported to Matlab in 1998. Here we present the Python Auditory Modeling Toolbox (PAMBOX), an open-source Python package for auditory modeling. The goal of the toolbox is to provide a collection of components that can be easily combined and extended to solve auditory modeling problems. PAMBOX contains code for modeling cochlear filtering, envelope extraction, as well as modulation processing. The toolbox also includes speech intelligibility models. These models are commonly used to predict how well speech is understood in a given situation, such as in the presence of noise or reverberation. The intelligibility models use a simple and consistent "predict" API, inspired by scikit-learn's "fit and predict" API. This simplifies comparisons across models. PAMBOX also includes a framework for performing intelligibility experiments compatible with IPython.parallel. Models that are not original to PAMBOX are validated against their original implementations, where available. PAMBOX is based on NumPy, SciPy, and Pandas. It is distributed under the Modified BSD License.</p

FigShare

Predicting speech release from masking through spatial separation in distance

Author: Chabot-Leclerc Alexandre
Dau Torsten
Publication venue
Publication date: 01/01/2014
Field of study

Online Research Database In Technology

The role of across-frequency envelope processing for speech intelligibility

Author: Chabot-Leclerc Alexandre
Dau Torsten
Jørgensen Søren
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/2013
Field of study

Poster presented at the 21st International Congress on Acoustics in Montreal, in June 2013. Speech intelligibility models consist of a preprocessing part that transforms the stimuli into some internal (auditory) representation, and a decision metric that quantifies effects of transmission channel, speech interferers, and auditory processing on the speech intelligibility. Here, two recent speech intelligibility models, the spectro-temporal modulation index (STMI; Elhilali et al., 2003) and the speech-based envelope power spectrum model (sEPSM; Jo/rgensen and Dau, 2011) were evaluated in conditions of noisy speech subjected to reverberation, and to nonlinear distortions through either a phase jitter process or noise reduction via spectral subtraction. The contributions of the individual preprocessing stages in the models and the role of the decision metrics were analyzed in the different experimental conditions. It is demonstrated that an explicit across-frequency envelope processing stage, as assumed in the STMI, together with the metric based on the envelope power signal-to-noise ratio, as assumed in the sEPSM, are required to account for all three conditions. However, a simple across audio-frequency mechanism combined with a purely temporal modulation filterbank is assumed to be sufficient to describe the data, i.e., a joint two-dimensional modulation filterbank might not be required.</p

Online Research Database In Technology

FigShare

The role of across-frequency envelope processing for speech intelligibility

Author: Chabot-Leclerc Alexandre
Dau Torsten
Jørgensen Søren
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/2013
Field of study

Crossref

Online Research Database In Technology

FigShare

The speech-based envelope power spectrum model (sEPSM) family: Development, achievements, and current challenges

Author: Chabot-Leclerc Alexandre
Dau Torsten
Relaño-Iborra Helia
Scheidiger Christoph
Zaar Johannes
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/2017
Field of study

Crossref

Online Research Database In Technology

Predicting masking release of lateralized speech

Author: Alexandre Chabot-leclerc (515576)
Ewen N. MacDonald (353558)
Torsten Dau (559769)
Publication venue
Publication date
Field of study

Poster presented at ISAAR 2015, in Nyborg, DK. "Locsei et al. [2015, Speech in Noise Workshop, Copenhagen, pp.46] measured speech reception thresholds (SRTs) in anechoic conditions where the target speech and the maskers were lateralized using interaural time delays. The maskers were speech-shaped noise (SSN) and reversed babble (RB) with two, four, or 8 talkers. For a given interferer type, the number of maskers presented on the target’s side was varied, such that none, some, or all maskers were presented on the same side as the target. In general, SRTs did not vary significantly when at least one masker was presented on the same side as the target. The largest masking release (MR) was observed when all maskers were on the opposite side of the target. The data could be accounted for using a binaural extension of the sEPSM model [Jørgensen and Dau, J. Acoust. Soc. Am. 130(3), 1475–1487], which uses a short-term equalization–cancellation process to model binaural unmasking. The modeling results suggest that, in these conditions, explicit top-down processing, such as streaming, is not required and that the MR could be fully accounted for by only bottom-up processes. However, independent access to the noisy speech and the noise alone by the model could be considered as implicit streaming and should therefore be taken into account when considering “bottom-up” models."</p

FigShare

The speech-based envelope power spectrum model (sEPSM) family: Development, achievements, and current challenges

Author: Alexandre Chabot-Leclerc
Christoph Scheidiger
Helia Relaño-Iborra
Johannes Zaar
Torsten Dau
Publication venue: 'Acoustical Society of America (ASA)'
Publication date
Field of study

Crossref