9,786 research outputs found
Preferred levels for background ducking to produce esthetically pleasing audio for TV with clear speech
In audio production, background ducking facilitates speech intelligibility while allowing the background to fulfill its purpose, e.g., to create ambience, set the mood, or convey semantic cues. Technical details for recommended ducking practices are not currently documented in the literature. Hence, we first analyzed common practices found in TV documentaries. Second, a listening test investigated the preferences of 22 normal-hearing participants on the Loud- ness Difference (LD) between commentary and background during ducking. Highly personal preferences were observed, highlighting the importance of object-based personalization. Sta- tistically significant difference was found between non-expert and expert listeners. On average, non-experts preferred LDs that were 4 LU higher than the ones preferred by experts. A sta- tistically significant difference was also found between Commentary over Music (CoM) and Commentary over Ambience (CoA). Based on the test results, we recommend at least 10 LU difference for CoM and at least 15 LU for CoA. Moreover, a computational method based on the Binaural Distortion-Weighted Glimpse Proportion (BiDWGP) was found to match the median preferred LD for each item with good accuracy (mean absolute error = 1.97 LU ± 2.50)
Scraping sounds and disgusting noises
Thirty-four horrible sounds have been examined in an Internet-based psychoacoustic experiment. This paper presents the results for the scraping and disgusting noises used. It is not understood why some humans find certain scraping noises, such as the sound of fingernails being scraped down a blackboard, so terrible. In this experiment, the variations in ratings with age, gender and location are examined. The results for one of the scraping sounds is consistent with the hypothesis suggested by others, that the response comes from a vestigial reflex related to the warning cries of monkeys. But this was not true for the actual recording of the fingernails scraping down a blackboard. An alternative hypothesis that the response is related to an audio–haptic interaction was tested and results indicated that this idea warrants further investigation. Other possible causes of the response, drawing on work concerning dissonance, are tentatively suggested. The disgusting sounds examined included the worst sound found in the experiment, the sound of someone vomiting. However, none of the disgusting sounds tested promoted responses consistent with a ‘disgust reaction’ based purely on survival instincts. Cultural factors might be important in our response to the disgusting sounds, with the influence of manners and etiquette being suggested as a possible factor
Exploring object-based content adaptation for mobile audio
Mobile devices enable users to consume media with audio content in a wide range of contexts, with environmental noise being present in many of these. Several methods exist that aim to improve the experience of mobile listening by utilising information about the environmental noise, such as volume and dynamic range adaptation. This paper explores a fundamentally different approach to improving the mobile listening experience by using the object-based audio paradigm, where individual audio sources are mixed in response to each specific listening context. Three experimental studies, containing both quantitative and qualitative aspects, are presented which investigate whether environmental noise influences preference of background-foreground audio object balance in a mix. The results indicate that environmental noise can influence the preferred audio mix and that the nature of the adaptations made is dependent upon both audio content and user. Additionally, qualitative analysis provides an understanding of the role of environmental noise on preferred audio mix. It is believed that the content adaptation method explored in this paper is a simple yet useful tool for adapting content to suit both the context and the user
Hearing aid processing of loud speech and noise signals: Consequences for loudness perception and listening comfort.
Ordered Preference Elicitation Strategies for Supporting Multi-Objective Decision Making
In multi-objective decision planning and learning, much attention is paid to
producing optimal solution sets that contain an optimal policy for every
possible user preference profile. We argue that the step that follows, i.e,
determining which policy to execute by maximising the user's intrinsic utility
function over this (possibly infinite) set, is under-studied. This paper aims
to fill this gap. We build on previous work on Gaussian processes and pairwise
comparisons for preference modelling, extend it to the multi-objective decision
support scenario, and propose new ordered preference elicitation strategies
based on ranking and clustering. Our main contribution is an in-depth
evaluation of these strategies using computer and human-based experiments. We
show that our proposed elicitation strategies outperform the currently used
pairwise methods, and found that users prefer ranking most. Our experiments
further show that utilising monotonicity information in GPs by using a linear
prior mean at the start and virtual comparisons to the nadir and ideal points,
increases performance. We demonstrate our decision support framework in a
real-world study on traffic regulation, conducted with the city of Amsterdam.Comment: AAMAS 2018, Source code at
https://github.com/lmzintgraf/gp_pref_elici
Tissue-conducted spatial sound fields
We describe experiments using multiple cranial transducers to achieve auditory spatial perceptual impressions via bone (BC) and tissue conduction (TC), bypassing the peripheral hearing apparatus. This could be useful in cases of peripheral hearing damage or where ear-occlusion is undesirable. Previous work (e.g. Stanley and Walker 2006, MacDonald and Letowski 2006)1,2 indicated robust lateralization is feasible via tissue conduction. We have utilized discrete signals, stereo and first order ambisonics to investigate control of externalization, range, direction in azimuth and elevation, movement and spaciousness. Early results indicate robust and coherent effects. Current technological implementations are presented and potential development paths discussed
Free-field reciprocity calibration of laboratory standard (LS) microphones using a time selective technique
- …
