4,819 research outputs found
A study on the relationship between the intelligibility and quality of algorithmically-modified speech for normal hearing listeners
This study investigates the relationship between the intelligibility and quality of modified speech in noise and in quiet. Speech signals were processed by seven algorithms designed to increase speech intelligibility in noise without altering speech intensity. In three noise maskers, including both stationary and fluctuating noise at two signal-to-noise ratios (SNR), listeners identified keywords from unmodified or modified sentences. The intelligibility performance of each type of speech was measured as the listeners’ word recognition rate in each condition, while the quality was rated as a mean opinion score. In quiet, only the perceptual quality of each type of speech was assessed. The results suggest that when listening in noise, modification performance on improving intelligibility is more important than its potential negative impact on speech quality. However, when listening in quiet or at SNRs in which intelligibility is no longer an issue to listeners, the impact to speech quality due to modification becomes a concern
A non-intrusive method for estimating binaural speech intelligibility from noise-corrupted signals captured by a pair of microphones
A non-intrusive method is introduced to predict binaural speech intelligibility in noise directly from signals captured using a pair of microphones. The approach combines signal processing techniques in blind source separation
and localisation, with an intrusive objective intelligibility measure (OIM). Therefore, unlike classic intrusive OIMs, this method does not require a clean reference speech signal and knowing the location of the sources to operate.
The proposed approach is able to estimate intelligibility in stationary and fluctuating noises, when the noise masker is presented as a point or diffused source, and is spatially separated from the target speech source on a horizontal
plane. The performance of the proposed method was evaluated in two rooms. When predicting subjective intelligibility measured as word recognition rate, this method showed reasonable predictive accuracy with correlation coefficients above 0.82, which is comparable to that of a reference intrusive OIM in most of the conditions. The proposed approach offers a solution for fast binaural intelligibility prediction, and therefore has practical potential
to be deployed in situations where on-site speech intelligibility is a concern
Background adaptation for improved listening experience in broadcasting
The intelligibility of speech in noise can be improved by modifying the speech. But with object-based audio, there is the possibility of altering the background sound while leaving the speech unaltered. This may prove a less intrusive approach, affording good speech intelligibility without overly compromising the perceived sound quality. In this study, the technique of spectral weighting was applied to the background. The frequency-dependent weightings for adaptation were learnt by maximising a weighted combination of two perceptual objective metrics for speech intelligibility and audio quality. The balance between the two objective metrics was determined by the perceptual relationship between intelligibility and quality. A neural network was trained to provide a fast solution for real-time processing. Tested in a variety of background sounds and speech-to-background ratios (SBRs), the proposed method led to a large intelligibility gain over the unprocessed baseline. Compared to an approach using constant weightings, the proposed method was able to dynamically preserve the overall audio quality better with respect to SBR changes
In Situ Investigation of Charge Performance in Anatase TiO2 Powder for Methane Conversion by Vis–NIR Spectroscopy
The intrinsic behavior of photogenerated charges and reactions with chemicals are key for a photocatalytic process. To observe these basic steps is of great importance. Here we present a reliable and robust system to monitor these basic steps in powder photocatalysts, and more importantly to elucidate the key issue in photocatalytic methane conversion over the benchmark catalyst TiO2. Under constant excitation, the absorption signal across the NIR region was demonstrated to be dominated by photoexcited electrons, the absorption of photoexcited holes increases toward shorter wavelengths in the visible region, and the overall shapes of the photoinduced absorption spectra obtained using the system demonstrated in the present work are consistent with widely accepted transient absorption results. Next, in situ measurements provide direct experimental evidence that the initial step of methane activation over TiO2 involves oxidation by photoexcited holes. It is calculated that 90 ± 6% of photoexcited electrons are scavenged by O2 (in dry air), 61 ± 9% of photoexcited holes are scavenged by methane (10% in argon), and a similar amount of photoexcited electrons can be scavenged by O2 even when the O2 concentration is reduced by a factor of 10. The present results suggest that O2 is much more easily activated in comparison to methane over anatase TiO2, which rationalizes the much higher methane/O2 ratio frequently used in practice in comparison to that required stoichiometrically for photocatalytic production of value-added chemicals via methane oxidation with oxygen. In addition, methanol (a preferable product of methane oxidation) is much more readily oxidized than methane over anatase TiO2
Recommended from our members
Homogeneous catalysis under ultra-dilute conditions: TAML/NaClO oxidation of persistent metaldehyde
This document is the unedited Author’s version of a Submitted Work that was subsequently accepted for publication in Journal of the American Chemical Society, copyright © American Chemical Society after peer review. To access the final edited and published work see https://doi.org/10.1021/jacs.6b11145TAML activators enable homogenous oxidation catalysis where the catalyst and substrate (S) are ultra-dilute (pM–low μM) and the oxidant is very dilute (high nM–low mM). Water contamination by exceptionally persistent micropollutants (MPs), including metaldehyde (Met), provides an ideal space for determining the characteristics and utilitarian limits of this ultradilute catalysis. The low MP concentrations decrease throughout catalysis with S oxidation (kII) and catalyst inactivation (ki) competing for the active catalyst. The percentage of substrate converted (%Cvn) can be increased by discovering methods to increase kII/ki. Here we show that NaClO extends catalyst lifetime to increase the Met turnover number (TON) threefold compared with H2O2, highlighting the importance of oxidant choice as a design tool in TAML systems. Met oxidation studies (pH 7, D2O, 0.01 M phosphate, 25 °C) monitored by 1H NMR spectroscopy show benign acetic acid as the only significant product. Analysis of TAML/NaClO treated Met solutions employing successive identical catalyst doses revealed that the processes can be modeled by the recently published relationship between the initial and final [S] (S0 and S∞, respectively), the initial [catalyst] (FeTot) and kII/ki. Consequently, this study establishes that S is proportional to S0 and that the %Cvn is conserved across all catalyst doses in multicatalyst-dose processes because the rate of the kII process depends on [S] while that of the ki process does not. A general tool for determining the FeTot required to effect a desired %Cvn is presented. Examination of the dependence of TON on kII/ki and FeTot at a fixed S0 indicates that for any TAML process employing FeTot < 1 10-6 M, small catalyst doses are not more efficient than one large dose.T.J.C thanks the Heinz Endowments for funding. NMR instrumentation at CMU was partially supported by NSF (CHE-0130903 and CHE-1039870)
- …