20 research outputs found

    Audibility and Interpolation of Head-Above-Torso Orientation in Binaural Technology

    Get PDF
    Head-related transfer functions (HRTFs) incorporate fundamental cues required for human spatial hearing and are often applied to auralize results obtained from room acoustic simulations. HRTFs are typically available for various directions of sound incidence and a fixed head-above-torso orientation (HATO). If-in interactive auralizations-HRTFs are exchanged according to the head rotations of a listener, the auralization result most often corresponds to a listener turning head and torso simultaneously, while-in reality-listeners usually turn their head independently above a fixed torso. In the present study, we show that accounting for HATO produces clearly audible differences, thereby suggesting the relevance of correct HATO when aiming at perceptually transparent binaural synthesis. Furthermore, we addressed the efficient representation of variable HATO in interactive acoustic simulations using spatial interpolation. Hereby, we evaluated two different approaches: interpolating between HRTFs with identical torso-to-source but different head-to-source orientations (head interpolation) and interpolating between HRTFs with the same head-to-source but different torso-to-source orientations (torso interpolation). Torso interpolation turned out to be more robust against increasing interpolation step width. In this case the median threshold of audibility for the head-above-torso resolution was about 25 degrees, whereas with head interpolation the threshold was about 10 degrees. Additionally, we tested a non-interpolation approach (nearest neighbor) as a suitable means for mobile applications with limited computational capacities

    Assessing the Authenticity of Individual Dynamic Binaural Synthesis

    Get PDF
    Binaural technology allows to capture sound fields by recording the sound pressure arriving at the listener’s ear canal entrances. If these signals are reconstructed for the same listener the simulation should be indistinguishable from the corresponding real sound field. A simulation fulfilling this premise could be termed as perceptually authentic. Authenticity has been assessed previously for static binaural resynthesis of sound sources in anechoic environments, i.e. for HRTF-based simulations not accounting for head movements of the listeners. Results indicated that simulations were still discernable from real sound fields, at least, if critical audio material was used. However, for dynamic binaural synthesis to our knowledge – and probably because this technology is even more demanding – no such study has been conducted so far. Thus, having developed a state-of-the-art system for individual dynamic auralization of anechoic and reverberant acoustical environments, we assessed its perceptual authenticity by letting subjects directly compare binaural simulations and real sound fields. To this end, individual binaural room impulses were acquired for two different source positions in a medium-sized recording studio, as well as individual headphone transfer functions. Listening tests were conducted for two different audio contents applying a most sensitive ABX test paradigm. Results showed that for speech signals many of the subjects failed to reliably detect the simulation. For pink noise pulses, however, all subjects could distinguish the simulation from reality. Results further provided evidence for future improvements.DFG, WE 4057/3-1, Simulation and Evaluation of Acoustical Environments (SEACEN

    Virtual restoration of the sound of the Hispanic Rite

    Get PDF
    The virtual acoustic reality techniques are powerful tools for the recovery of acoustical heritage of historic buildings. Through the acoustic modeling and auralization techniques itÂŽs possible to reconstruct the sound of disappeared buildings or the ones with significant modifications over the years, knowing the original geometry and the acoustic characteristics of their surfaces. This paper shows the results of a research project whose goal is the virtual recovery of the sound of the Hispanic Rite, the rite celebrated by Christians of the Iberian Peninsula before the imposition of the Roman Rite in the mid-eleventh century. For this purpose, acoustic models of a series of Pre-Romanesque churches were made. These acoustic models represent the churches in their original state, following the reconstruction hypothesis proposed by leading researchers in medieval liturgical archeology. Multichannel anechoic recordings of several pieces of the music of the Hispanic Rite have been carried out using a spherical array composed of 31 microphones. Finally, static and dynamic auralizations have been developed, involving the different liturgical configurations which were usual in this rite

    Soundscape Evaluation of a Heritage Event in an Open Environment: The Water Tribunal of the Plain of Valencia (Spain)

    Full text link
    [EN] The acoustic evaluation of indoor environments is common in the application of virtual acoustics. In addition, in the study of cultural heritage buildings, it is a useful tool, but it is not so common when describing the acoustic environment in intangible cultural heritage events and even in outdoor environments. In this paper, the acoustic environment of the Water Tribunal of the Plain of Valencia (Spain) is studied. It is analyzed from a soundscape perspective, characterizing the sound source and evaluating it within relation to the environment and evaluating its subjective response. With the research carried out, it has been obtained, on the one hand, a complete study of the acoustics of the environment of the Water Tribunal and, on the other, an enhancement of the Valencian tangible and intangible heritage.This research was partially funded by Spanish Ministry of Science, Innovation and Universities, grant number BIA2016-76957-C3-3-R and by the Universitat de Valencia with the grant UV-INV-AE-1550319.Díaz-Rubio, E.; Segura-Garcia, J.; Fayos-Jordan, R.; Cerdå, S.; Cibriån, RM.; Giménez Pérez, A. (2022). Soundscape Evaluation of a Heritage Event in an Open Environment: The Water Tribunal of the Plain of Valencia (Spain). Applied Sciences. 12(9):1-17. https://doi.org/10.3390/app1209429211712

    Data set: BRIRs for position-dynamic binaural synthesis measured in two rooms

    Get PDF
    Binaural room impulse responses were measured with a KEMAR 45BA head-and-torso-simulator. For the first data set, it was placed at different positions located on a line with a length of 2m in a 25 cm positional resolution and an azimuth resolution of 4 . Two source positions were considered in the setup, one in front of the line, one at the side. The same arrangement of source and receiver positions was realized in two different rooms, a quite dry listening laboratory and a quite reverberant seminar room. For the second data set, BRIRs and omni-directional RIRs were measured for a translation line with a length of 7.5m through the given seminar room. The data sets are valuable for realizing, testing and studying dynamic binaural walk-through scenarios in the two different rooms

    Ecological Validity of Immersive Virtual Reality (IVR) Techniques for the Perception of Urban Sound Environments

    Get PDF
    Immersive Virtual Reality (IVR) is a simulated technology used to deliver multisensory information to people under different environmental conditions. When IVR is generally applied in urban planning and soundscape research, it reveals attractive possibilities for the assessment of urban sound environments with higher immersion for human participation. In virtual sound environments, various topics and measures are designed to collect subjective responses from participants under simulated laboratory conditions. Soundscape or noise assessment studies during virtual experiences adopt an evaluation approach similar to in situ methods. This paper aims to review the approaches that are utilized to assess the ecological validity of IVR for the perception of urban sound environments and the necessary technologies during audio–visual reproduction to establish a dynamic IVR experience that ensures ecological validity. The review shows that, through the use of laboratory tests including subjective response surveys, cognitive performance tests and physiological responses, the ecological validity of IVR can be assessed for the perception of urban sound environments. The reproduction system with head-tracking functions synchronizing spatial audio and visual stimuli (e.g., head-mounted displays (HMDs) with first-order Ambisonics (FOA)-tracked binaural playback) represents the prevailing trend to achieve high ecological validity. These studies potentially contribute to the outcomes of a normalized evaluation framework for subjective soundscape and noise assessments in virtual environment

    CPX based synthesis for binaural auralization of vehicle rolling noise to an arbitrary positioned stander-by receiver

    Get PDF
    Virtual reality is becoming an important tool for studying the interaction between pedestrians and road vehicles, by allowing the analysis of potentially hazard situations without placing subjects in real risk. However, most of the current simulators are unable to accurately recreate traffic sounds that are congruent with the visual scene. This has been recognized as a fault in the virtual audio-visual scenarios used in such contexts. This study proposes a method for delivering a binaural auralization of the noise generated by a moving vehicle to an arbitrarily located moving listener (pedestrian). Building on previously developed methods, the proposal presented here integrates in a novel way a dynamic auralization engine, thus enabling real-time update of the acoustic cues in the binaural signal delivered via headphones. Furthermore, the proposed auralization routine uses Close ProXimity (CPX) tyre-road noise signal as sound source input, facilitating the quick interchangeability of source signals, and easing the noise collection procedure. Two validation experiments were carried out, one to quantitatively compare field signals with CPX-derived virtual signal recordings, and another to assess these same signals through psychoacoustic models. The latter aims to assure that the reproduction of the synthesized signal is perceptually similar to one occurring on pedestrian/vehicle interactions during situations of street crossing. Discrepancies were detected, and emphasized when the vehicle is within close distance from the receiver (pedestrian). However, the analysis indicated that these pose no hindrance to the study of vehicle–pedestrian interaction. Improvements to the method are identified and further developments are proposed.This work was supported by the ‘‘Fundação para a CiĂȘncia e a Tecnologia” [PTDC/ECM-TRA/3568/2014, SFRH/BD/131638/2017, UIDB/04029/2020] This work is part of the activities of the research project AnPeB – ‘‘ANalysis of PEdestrians Behaviour based on simulated urban environments and its incorporation in risk modelling” (PTDC/ECM TRA/3568/2014), funded by the ‘‘Promover a Produção CientĂ­fica e Desenvolvimento TecnolĂłgico e a Constituição de Redes TemĂĄti cas” (3599-PPCDT) project and supported by the ‘‘European Com munity Fund FEDER” and the doctoral scholarship SFRH/ BD/131638/2017, funded by ‘‘Fundação para a CiĂȘncia e a Tecnolo gia (FCT)”

    On the plausibility of simplified acoustic room representations for listener translation in dynamic binaural auralizations

    Get PDF
    Diese Doktorarbeit untersucht die Wahrnehmung vereinfachter akustischer RaumreprĂ€sentationen in positionsdynamischer Binauralwiedergabe fĂŒr die Hörertranslation. Die dynamische Binauralsynthese ist eine Audiowiedergabemethode zur Erzeugung rĂ€umlicher auditiver Illusionen ĂŒber Kopfhörer fĂŒr virtuelle, erweiterte und gemischte RealitĂ€t (VR/AR/MR). Dabei ist es nun eine typische Anforderung, immersive Inhalte in sechs Freiheitsgraden (6DOF) zu erkunden. Dynamische binaurale Schallfeldimitationen mit hoher physikalischer Genauigkeit zu realisieren, ist meist mit sehr hohem Rechenaufwand verbunden. FrĂŒhere psychoakustische Studien weisen jedoch darauf hin, dass Menschen eine begrenzte Empfindlichkeit gegenĂŒber den Details des Schallfelds haben, insbesondere im spĂ€ten Nachhall. Dies birgt das Potential physikalischer Vereinfachungen bei der positionsdynamischen Auralisation von RĂ€umen. Beispielsweise wurden Konzepte vorgeschlagen, die auf der perzeptiven Mixing Time oder der Hörbarkeitsschwelle von frĂŒhen Reflexionen basieren, fĂŒr welche jedoch eine grĂŒndliche psychoakustische Bewertung noch aussteht. ZunĂ€chst wurde ein Aufbau zur positionsdynamischen Raumauralisation implementiert und evaluiert. Daran untersucht die Arbeit wesentliche Systemparameter wie die erforderliche rĂ€umliche Auflösung eines Positionsrasters fĂŒr die dynamische Anpassung. Da allgemein etablierte Testmethoden zur wahrnehmungsbezogenen Bewertung von rĂ€umlichen auditiven Illusionen unter BerĂŒcksichtigung interaktiver Hörertranslation fehlten, untersucht die Arbeit verschiedene AnsĂ€tze zur Messung der PlausibilitĂ€t. Auf dieser Grundlage werden physikalische Vereinfachungen im Verlauf des Schallfeldes in positionsdynamischen binauralen Auralisationen der Raumakustik untersucht. FĂŒr die Hauptexperimente wurden binaurale Raumimpulsantworten (BRIRs) entlang einer Linie fĂŒr die Hörertranslation in einem eher trockenen Hörlabor und einem halligen Seminarraum Ă€hnlicher GrĂ¶ĂŸe gemessen. Die erstellten DatensĂ€tze enthalten Szenarien von Hörerbewegungen auf eine virtuelle Schallquelle zu, daran vorbei, davon weg oder dahinter. DarĂŒber hinaus betrachten die Untersuchungen zwei ExtremfĂ€lle der Quellenorientierung, um die Auswirkungen einer Variation der Schallquellenrichtcharakteristik zu berĂŒcksichtigen. Die BRIR-SĂ€tze werden systematisch bearbeitet und vereinfacht, um die Auswirkungen auf die Wahrnehmung zu bewerten. Insbesondere das Konzept der perzeptiven Mixing Time und manipulierte rĂ€umlich-zeitliche Muster frĂŒher Reflexionen dienten als TestfĂ€lle in den psychoakustischen Studien. Die Ergebnisse zeigen ein hohes Potential fĂŒr Vereinfachungen, unterstreichen aber auch die Relevanz der genauen Imitation prominenter frĂŒher Reflexionen. Die Ergebnisse bestĂ€tigen auch das Konzept der wahrnehmungsbezogenen Mixing Time fĂŒr die betrachteten FĂ€lle der positionsdynamischen binauralen Wiedergabe. Die Beobachtungen verdeutlichen, dass gĂ€ngige Testszenarien fĂŒr Auralisierungen, Interpolation und Extrapolation nicht kritisch genug sind, um allgemeine Schlussfolgerungen ĂŒber die Eignung der getesteten Rendering-AnsĂ€tze zu ziehen. Die Arbeit zeigt LösungsansĂ€tze auf.This thesis investigates the effect of simplified acoustic room representations in position-dynamic binaural audio for listener translation. Dynamic binaural synthesis is an audio reproduction method to create spatial auditory illusions over headphones for virtual, augmented, and mixed reality (AR/VR/MR). It has become a typical demand to explore immersive content in six degrees of freedom (6DOF). Realizing dynamic binaural sound field imitations with high physical accuracy requires high computational effort. However, previous psychoacoustic research indicates that humans have limited sensitivity to the details of the sound field. This fact bears the potential to simplify the physics in position-dynamic room auralizations. For example, concepts based on the perceptual mixing time or the audibility threshold of early reflections have been proposed. This thesis investigates the effect of simplified acoustic room representations in position-dynamic binaural audio for listener translation. First, a setup for position dynamic binaural room auralization was implemented and evaluated. Essential system parameters like the required position grid resolution for the audio reproduction were examined. Due to the lack of generally established test methods for the perceptual evaluation of spatial auditory illusions considering interactive listener translation, this thesis explores different approaches for measuring plausibility. Based on this foundation, this work examines physical impairments and simplifications in the progress of the sound field in position dynamic binaural auralizations of room acoustics. For the main experiments, sets of binaural room impulse responses (BRIRs) were measured along a line for listener translation in a relatively dry listening laboratory and a reverberant seminar room of similar size. These sets include scenarios of walking towards a virtual sound source, past it, away from it, or behind it. The consideration of two extreme cases of source orientation took into account the effects of variations in directivity. The BRIR sets were systematically impaired and simplified to evaluate the perceptual effects. Especially the concept of the perceptual mixing time and manipulated spatiotemporal patterns of early reflections served as test cases. The results reveal a high potential for simplification but also underline the relevance of accurately imitating prominent early reflections. The findings confirm the concept of the perceptual mixing time for the considered cases of position-dynamic binaural audio. The observations highlight that common test scenarios for dynamic binaural rendering approaches are not sufficiently critical to draw general conclusions about their suitability. This thesis proposes strategies to solve this

    Perceptual Evaluation of Model- and Signal-Based Predictors of the Mixing Time in Binaural Room Impulse Responses

    Get PDF
    When creating virtual acoustic environments, the computational demands can be reduced by using generic late reverberation. Beyond the “mixing time,” the diffuse reverberation no longer contains details of the specific location. Therefore, a perceptually validated model for predicting the mixing time of different spaces will be helpful. This study evaluates various predictors of the perceptual mixing time using 9 different spaces. Both model- and signal-based estimators of mixing time were examined for their ability to predict the results of a group of expert listeners. For a shoebox-shaped room, the average perceptual mixing time can be predicted by the enclosure’s ratio of volume over surface area V/S and by vV, which serve as indicators of the mean free path length and the reflection density, respectively. Moreover, the “echo density profile” by Abel and Huang (AES paper 6985) can be used to predict the perceptual mixing time from measured data
    corecore