177,431 research outputs found

    The impact of speech type on listening effort and intelligibility for native and non-native listeners

    Get PDF
    Listeners are routinely exposed to many different types of speech, including artificially-enhanced and synthetic speech, styles which deviate to a greater or lesser extent from naturally-spoken exemplars. While the impact of differing speech types on intelligibility is well-studied, it is less clear how such types affect cognitive processing demands, and in particular whether those speech forms with the greatest intelligibility in noise have a commensurately lower listening effort. The current study measured intelligibility, self-reported listening effort, and a pupillometry-based measure of cognitive load for four distinct types of speech: (i) plain i.e. natural unmodified speech; (ii) Lombard speech, a naturally-enhanced form which occurs when speaking in the presence of noise; (iii) artificially-enhanced speech which involves spectral shaping and dynamic range compression; and (iv) speech synthesized from text. In the first experiment a cohort of 26 native listeners responded to the four speech types in three levels of speech-shaped noise. In a second experiment, 31 non-native listeners underwent the same procedure at more favorable signal-to-noise ratios, chosen since second language listening in noise has a more detrimental effect on intelligibility than listening in a first language. For both native and non-native listeners, artificially-enhanced speech was the most intelligible and led to the lowest subjective effort ratings, while the reverse was true for synthetic speech. However, pupil data suggested that Lombard speech elicited the lowest processing demands overall. These outcomes indicate that the relationship between intelligibility and cognitive processing demands is not a simple inverse, but is mediated by speech type. The findings of the current study motivate the search for speech modification algorithms that are optimized for both intelligibility and listening effort.</p

    The impact of speech type on listening effort and intelligibility for native and non-native listeners

    Get PDF
    Listeners are routinely exposed to many different types of speech, including artificially-enhanced and synthetic speech, styles which deviate to a greater or lesser extent from naturally-spoken exemplars. While the impact of differing speech types on intelligibility is well-studied, it is less clear how such types affect cognitive processing demands, and in particular whether those speech forms with the greatest intelligibility in noise have a commensurately lower listening effort. The current study measured intelligibility, self-reported listening effort, and a pupillometry-based measure of cognitive load for four distinct types of speech: (i) plain i.e. natural unmodified speech; (ii) Lombard speech, a naturally-enhanced form which occurs when speaking in the presence of noise; (iii) artificially-enhanced speech which involves spectral shaping and dynamic range compression; and (iv) speech synthesized from text. In the first experiment a cohort of 26 native listeners responded to the four speech types in three levels of speech-shaped noise. In a second experiment, 31 non-native listeners underwent the same procedure at more favorable signal-to-noise ratios, chosen since second language listening in noise has a more detrimental effect on intelligibility than listening in a first language. For both native and non-native listeners, artificially-enhanced speech was the most intelligible and led to the lowest subjective effort ratings, while the reverse was true for synthetic speech. However, pupil data suggested that Lombard speech elicited the lowest processing demands overall. These outcomes indicate that the relationship between intelligibility and cognitive processing demands is not a simple inverse, but is mediated by speech type. The findings of the current study motivate the search for speech modification algorithms that are optimized for both intelligibility and listening effort

    Toward a more comprehensive understanding of the impact of masker type and signal-to-noise ratio on the pupillary response while performing a speech-in-noise test

    Get PDF
    Difficulties arising in everyday speech communication often result from the acoustical environment, which may contain interfering background noise or competing speakers. Thus, listening and understanding speech in noise can be exhausting. Two experiments are presented in the current study that further explored the impact of masker type and Signal-to-Noise Ratio (SNR) on listening effort by means of pupillometry. In both studies, pupillary responses of participants were measured while performing the Danish Hearing in Noise Test (HINT; Nielsen and Dau, 2011). The first experiment aimed to replicate and extend earlier observed effects of noise type and semantic interference on listening effort (Koelewijn et al., 2012). The impact of three different masker types, i.e. a fluctuating noise, a 1-talker masker and a 4-talker masker on listening effort was examined at a fixed speech intelligibility. In a second experiment, effects of SNR on listening effort were examined while presenting the HINT sentences across a broad range of fixed SNRs corresponding to intelligibility scores ranging from 100% to 0% correct performance. A peak pupil dilation (PPD) was calculated and a Growth Curve Analysis (GCA) was performed to examine listening effort involved in speech recognition as a function of SNR. The results of two experiments showed that the pupil dilation response is highly affected by both masker type and SNR when performing the HINT. The PPD was highest, suggesting the highest level of effort, for speech recognition in the presence of the 1-talker masker in comparison to the 4-talker babble and the fluctuating noise masker. However, the disrupting effect of one competing talker disappeared for intelligibly levels around 50%. Furthermore, it was demonstrated that the pupillary response strongly varied as a function of SNRs. Listening effort was highest for intermediate SNRs with performance accuracies ranging between 30% and 70% correct. GCA revealed time-dependent effects of the SNR on the pupillary response that were not reflected in the PPD

    Impact of Masker Type and Reverberation on Pupillary Response and Listening Effort

    Get PDF
    The thesis explores the concept of listening effort by investigating changes in pupil dilation during a speech-in-noise test with six different listening conditions. The changes in pupil size have been captured with an eye tracking device. The test group consisted of 20 volunteers. The results of the listening test revealed that maskers and reverberation had a detrimental effect on speech intelligibility. Mean and peak pupil dilation measurements for anechoic conditions displayed similar patterns, with larger pupil sizes observed for masker types with higher speech recognition thresholds, indicating increased listening effort. The impact of reverberation varied depending on the noise type. This thesis, along with previous studies, highlights the potential of pupillometry as a relevant tool providing an insight into speech processing difficulties not captured by standard diagnostic methods. It suggests that pupillometry could complement existing practices and methods in hearing evaluation. However, further research and the development of detailed guidelines for pupil data pre-processing are necessary to enhance the reliability of pupillometry in clinical settings. By doing so, this method could contribute to a better understanding of hearing challenges faced by patients on a daily basis

    The Time Constraint in Conference Interpreting: Simultaneous vs. Consecutive

    Get PDF
    The present paper focuses on the concept of time constraint in interpreting. The main aim of the study is to compare the two modes of interpreting, i.e. consecutive and simultaneous in terms of the temporal load imposed by the operations constituting each of them. The discussion centres on the issues of external pacing and processing capacity management, the two focal points of The Time Constraint. The paper also examines a range of strategies interpreters resort to in order to minimise the impact of time pressure in both CI and SI, such as EVS regulation, economy of expression, text-editing strategies, and notation techniques

    Simultaneous interpreting : walking a tightrope

    Get PDF
    Several phenomena associated with the differences in the performance of novice interpreters and semi-professionals have been discussed in the paper. Particular emphasis was placed on the occurrence of imported cognitive load which strongly influenced the performance of the subjects also in places where no intrinsic difficulty had been detected. Nevertheless, too little evidence was provided to establish a more detailed pattern of imported cognitive load, which was due to the limited number of participants in the study. It would be possible to obtain more detailed data and comments from the participants by means of interviews conducted individually with the participants. It would allow asking detailed questions to the participants, which might be a more reliable method than the immediate retrospective accounts. Moreover, in the present study such variables as gender differences, age differences and the possible influence of other foreign languages were not taken into account. Perhaps these variables might shed some light on the issue of the management of cognitive resources. Also, the corpus gathered for the present study may be used for the investigation of other aspects of the SI performance

    Cepstral analysis based on the Glimpse proportion measure for improving the intelligibility of HMM-based synthetic speech in noise

    Get PDF
    In this paper we introduce a new cepstral coefficient extraction method based on an intelligibility measure for speech in noise, the Glimpse Proportion measure. This new method aims to increase the intelligibility of speech in noise by modifying the clean speech, and has applications in scenarios such as public announcement and car navigation systems. We first explain how the Glimpse Proportion measure operates and further show how we approximated it to integrate it into an existing spectral envelope parameter extraction method commonly used in the HMM-based speech synthesis framework. We then demonstrate how this new method changes the modelled spectrum according to the characteristics of the noise and show results for a listening test with vocoded and HMM-based synthetic speech. The test indicates that the proposed method can significantly improve intelligibility of synthetic speech in speech shaped noise. Index Terms — cepstral coefficient extraction, objective measure for speech intelligibility, Lombard speech, HMM-based speech synthesis 1

    The impact of sound field systems on learning and attention in elementary school classrooms

    Get PDF
    Purpose: An evaluation of the installation and use of sound field systems (SFS) was carried out to investigate their impact on teaching and learning in elementary school classrooms. Methods: The evaluation included acoustic surveys of classrooms, questionnaire surveys of students and teachers and experimental testing of students with and without the use of SFS. Students ’ perceptions of classroom environments and objective data evaluating change in performance on cognitive and academic assessments with amplification over a six month period are reported. Results: Teachers were positive about the use of SFS in improving children’s listening and attention to verbal instructions. Over time students in amplified classrooms did not differ from those in nonamplified classrooms in their reports of listening conditions, nor did their performance differ in measures of numeracy, reading or spelling. Use of SFS in the classrooms resulted in significantly larger gains in performance in the number of correct items on the nonverbal measure of speed of processing and the measure of listening comprehension. Analysis controlling for classroom acoustics indicated that students ’ listening comprehension score
    • 

    corecore