9,786 research outputs found

    Preferred levels for background ducking to produce esthetically pleasing audio for TV with clear speech

    Get PDF
    In audio production, background ducking facilitates speech intelligibility while allowing the background to fulfill its purpose, e.g., to create ambience, set the mood, or convey semantic cues. Technical details for recommended ducking practices are not currently documented in the literature. Hence, we first analyzed common practices found in TV documentaries. Second, a listening test investigated the preferences of 22 normal-hearing participants on the Loud- ness Difference (LD) between commentary and background during ducking. Highly personal preferences were observed, highlighting the importance of object-based personalization. Sta- tistically significant difference was found between non-expert and expert listeners. On average, non-experts preferred LDs that were 4 LU higher than the ones preferred by experts. A sta- tistically significant difference was also found between Commentary over Music (CoM) and Commentary over Ambience (CoA). Based on the test results, we recommend at least 10 LU difference for CoM and at least 15 LU for CoA. Moreover, a computational method based on the Binaural Distortion-Weighted Glimpse Proportion (BiDWGP) was found to match the median preferred LD for each item with good accuracy (mean absolute error = 1.97 LU ± 2.50)

    Scraping sounds and disgusting noises

    Get PDF
    Thirty-four horrible sounds have been examined in an Internet-based psychoacoustic experiment. This paper presents the results for the scraping and disgusting noises used. It is not understood why some humans find certain scraping noises, such as the sound of fingernails being scraped down a blackboard, so terrible. In this experiment, the variations in ratings with age, gender and location are examined. The results for one of the scraping sounds is consistent with the hypothesis suggested by others, that the response comes from a vestigial reflex related to the warning cries of monkeys. But this was not true for the actual recording of the fingernails scraping down a blackboard. An alternative hypothesis that the response is related to an audio–haptic interaction was tested and results indicated that this idea warrants further investigation. Other possible causes of the response, drawing on work concerning dissonance, are tentatively suggested. The disgusting sounds examined included the worst sound found in the experiment, the sound of someone vomiting. However, none of the disgusting sounds tested promoted responses consistent with a ‘disgust reaction’ based purely on survival instincts. Cultural factors might be important in our response to the disgusting sounds, with the influence of manners and etiquette being suggested as a possible factor

    Exploring object-based content adaptation for mobile audio

    Get PDF
    Mobile devices enable users to consume media with audio content in a wide range of contexts, with environmental noise being present in many of these. Several methods exist that aim to improve the experience of mobile listening by utilising information about the environmental noise, such as volume and dynamic range adaptation. This paper explores a fundamentally different approach to improving the mobile listening experience by using the object-based audio paradigm, where individual audio sources are mixed in response to each specific listening context. Three experimental studies, containing both quantitative and qualitative aspects, are presented which investigate whether environmental noise influences preference of background-foreground audio object balance in a mix. The results indicate that environmental noise can influence the preferred audio mix and that the nature of the adaptations made is dependent upon both audio content and user. Additionally, qualitative analysis provides an understanding of the role of environmental noise on preferred audio mix. It is believed that the content adaptation method explored in this paper is a simple yet useful tool for adapting content to suit both the context and the user

    Ordered Preference Elicitation Strategies for Supporting Multi-Objective Decision Making

    Full text link
    In multi-objective decision planning and learning, much attention is paid to producing optimal solution sets that contain an optimal policy for every possible user preference profile. We argue that the step that follows, i.e, determining which policy to execute by maximising the user's intrinsic utility function over this (possibly infinite) set, is under-studied. This paper aims to fill this gap. We build on previous work on Gaussian processes and pairwise comparisons for preference modelling, extend it to the multi-objective decision support scenario, and propose new ordered preference elicitation strategies based on ranking and clustering. Our main contribution is an in-depth evaluation of these strategies using computer and human-based experiments. We show that our proposed elicitation strategies outperform the currently used pairwise methods, and found that users prefer ranking most. Our experiments further show that utilising monotonicity information in GPs by using a linear prior mean at the start and virtual comparisons to the nadir and ideal points, increases performance. We demonstrate our decision support framework in a real-world study on traffic regulation, conducted with the city of Amsterdam.Comment: AAMAS 2018, Source code at https://github.com/lmzintgraf/gp_pref_elici

    On the applicability of models for outdoor sound (A)

    Get PDF

    Tissue-conducted spatial sound fields

    Get PDF
    We describe experiments using multiple cranial transducers to achieve auditory spatial perceptual impressions via bone (BC) and tissue conduction (TC), bypassing the peripheral hearing apparatus. This could be useful in cases of peripheral hearing damage or where ear-occlusion is undesirable. Previous work (e.g. Stanley and Walker 2006, MacDonald and Letowski 2006)1,2 indicated robust lateralization is feasible via tissue conduction. We have utilized discrete signals, stereo and first order ambisonics to investigate control of externalization, range, direction in azimuth and elevation, movement and spaciousness. Early results indicate robust and coherent effects. Current technological implementations are presented and potential development paths discussed

    Ultrasonic splitting of oil-in-water emulsions

    Get PDF
    corecore