Search CORE

28 research outputs found

Automated Audio Realism Detection

Author: Ihlefeld Antje
Khaleghimeybodi Morteza
Saddler Mark
Publication venue: Technical Disclosure Commons
Publication date: 29/08/2023
Field of study

User gaze locations are tracked during an artificial reality experience. Audio content that is spatialized to a target location is adjusted such that it is temporarily spatialized to a test location for a period of time and then reverts back to being spatialized at the target location. The spatialized audio content is presented concurrently with display of a virtual object at the target location. Over the period of time the audio content is spatialized to the test location and then reverts back to the target location and the level of realism of the artificial reality experience is evaluated using the tracked gaze locations during the period of time

Technical Disclosure Common

Across-frequency combination of interaural time difference in bilateral cochlear implant listeners

Author: Alan Kan
Antje Ihlefeld
Ruth Y. Litovsky
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2014
Field of study

The current study examined how cochlear implant (CI) listeners combine temporally interleaved envelope-ITD information across two sites of stimulation. When two cochlear sites jointly transmit ITD information, one possibility is that CI listeners can extract the most reliable ITD cues available. As a result, ITD sensitivity would be sustained or enhanced compared to single-site stimulation. Alternatively, mutual interference across multiple sites of ITD stimulation could worsen dual-site performance compared to listening to the better of two electrode pairs. Two experiments used direct stimulation to examine how CI users can integrate ITDs across two pairs of electrodes. Experiment 1 tested ITD discrimination for two stimulation sites using 100-Hz sinusoidally modulated 1000-pps-carrier pulse trains. Experiment 2 used the same stimuli ramped with 100 ms windows, as a control condition with minimized onset cues. For all stimuli, performance improved monotonically with increasing modulation depth. Results show that when CI listeners are stimulated with electrode pairs at two cochlear sites, sensitivity to ITDs was similar to that seen when only the electrode pair with better sensitivity was activated. None of the listeners showed a decrement in performance from the worse electrode pair. This could be achieved either by listening to the better electrode pair or by truly integrating the information across cochlear sites

Frontiers - Publisher Connector

PubMed Central

DYNAMIC TORSO REFLECTION FILTERING FOR INTERACTIVE BINAURAL SPATIAL AUDIO BASED ON BIOLOGICALLY CONSTRAINED IMU DRIFT COMPENSATION

Author: Brimijoin William Owen, II
Ihlefeld Antje
Robinson Philip
Publication venue: Technical Disclosure Commons
Publication date: 10/05/2023
Field of study

An audio uses information uses a biologically constrained IMU drift compensation for audio spatial rendering to drive a dynamic filtering process to better reproduce the acoustic effects of head on torso orientation on the HRTF

Technical Disclosure Common

Accurate Sound Localization in Reverberant Environments Is Mediated by Robust Encoding of Spatial Cues in the Auditory Midbrain

Author: Delgutte Bertrand
Devore Sasha
Hancock Kenneth E.
Ihlefeld Antje
Shinn-Cunningham Barbara
Publication venue: 'Elsevier BV'
Publication date: 01/04/2009
Field of study

In reverberant environments, acoustic reflections interfere with the direct sound arriving at a listener's ears, distorting the spatial cues for sound localization. Yet, human listeners have little difficulty localizing sounds in most settings. Because reverberant energy builds up over time, the source location is represented relatively faithfully during the early portion of a sound, but this representation becomes increasingly degraded later in the stimulus. We show that the directional sensitivity of single neurons in the auditory midbrain of anesthetized cats follows a similar time course, although onset dominance in temporal response patterns results in more robust directional sensitivity than expected, suggesting a simple mechanism for improving directional sensitivity in reverberation. In parallel behavioral experiments, we demonstrate that human lateralization judgments are consistent with predictions from a population rate model decoding the observed midbrain responses, suggesting a subcortical origin for robust sound localization in reverberant environments.National Institutes of Health (U.S.) (Grant R01 DC002258)National Institutes of Health (U.S.) (Grant R01 DC05778-02)core National Institutes of Health (U.S.) (Eaton Peabody Laboratory. (Core) Grant P30 DC005209)National Institutes of Health (U.S.) (Grant T32 DC0003

DSpace@MIT

Elsevier - Publisher Connector

PubMed Central

An auditory-visual tradeoff in susceptibility to clutter

Author: Denison Rachel N.
Ihlefeld Antje
Le Thuy Tien C.
Pelli Denis G.
Zhang Min
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 07/12/2021
Field of study

Sensory cortical mechanisms combine auditory or visual features into perceived objects. This is difficult in noisy or cluttered environments. Knowing that individuals vary greatly in their susceptibility to clutter, we wondered whether there might be a relation between an individual's auditory and visual susceptibilities to clutter. In auditory masking, background sound makes spoken words unrecognizable. When masking arises due to interference at central auditory processing stages, beyond the cochlea, it is called informational masking. A strikingly similar phenomenon in vision, called visual crowding, occurs when nearby clutter makes a target object unrecognizable, despite being resolved at the retina. We here compare susceptibilities to auditory informational masking and visual crowding in the same participants. Surprisingly, across participants, we find a negative correlation (R = -0.7) between susceptibility to informational masking and crowding: Participants who have low susceptibility to auditory clutter tend to have high susceptibility to visual clutter, and vice versa. This reveals a tradeoff in the brain between auditory and visual processing.R01 DC019126 - NIDCD NIH HHS; R01 EY027964 - NEI NIH HHSAccepted manuscrip

Boston University Institutional Repository (OpenBU)

FORUM:Remote testing for psychological and physiological acoustics

Author: Beim Jordan A.
Bharadwaj Hari
Bosen Adam K.
Braza Meredith D.
Buss Emily
Diedesch Anna C.
Dorey Claire M.
Dykstra Andrew R.
Gallun Frederick J.
Goldsworthy Raymond L.
Gray Lincoln
Hoover Eric C.
Ihlefeld Antje
Koelewijn Thomas
Kopun Judy G.
Mesik Juraj
Peng Z. Ellen
Richards Virginia
Shen Yi
Shub Daniel E.
Stecker G. Christopher
Venezia Jonathan H.
Waz Sebastian
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/05/2022
Field of study

Acoustics research involving human participants typically takes place in specialized laboratory settings. Listening studies, for example, may present controlled sounds using calibrated transducers in sound-attenuating or anechoic chambers. In contrast, remote testing takes place outside of the laboratory in everyday settings (e.g., participants' homes). Remote testing could provide greater access to participants, larger sample sizes, and opportunities to characterize performance in typical listening environments at the cost of reduced control of environmental conditions, less precise calibration, and inconsistency in attentional state and/or response behaviors from relatively smaller sample sizes and unintuitive experimental tasks. The Acoustical Society of America Technical Committee on Psychological and Physiological Acoustics launched the Task Force on Remote Testing (https://tcppasa.org/remotetesting/) in May 2020 with goals of surveying approaches and platforms available to support remote testing and identifying challenges and considerations for prospective investigators. The results of this task force survey were made available online in the form of a set of Wiki pages and summarized in this report. This report outlines the state-of-the-art of remote testing in auditory-related research as of August 2021, which is based on the Wiki and a literature search of papers published in this area since 2020, and provides three case studies to demonstrate feasibility during practice

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

PubMed Central

University of Miami: Scholarship Miami

Dissertations of the University of Groningen

Sensory resolution drives auditory responses in lateral frontal cortex

Author: Ihlefeld Antje
Zhang Min
Publication venue: Deutsche Gesellschaft für Akustik
Publication date: 01/01/2019
Field of study

Publikationsserver der RWTH Aachen University

Increased reliance on temporal coding when target sound is softer than the background

Author: Antje Ihlefeld
Merri J. Rosen
Nima Alamatsaz
Publication venue: Nature Portfolio
Publication date: 01/02/2024
Field of study

Abstract Everyday environments often contain multiple concurrent sound sources that fluctuate over time. Normally hearing listeners can benefit from high signal-to-noise ratios (SNRs) in energetic dips of temporally fluctuating background sound, a phenomenon called dip-listening. Specialized mechanisms of dip-listening exist across the entire auditory pathway. Both the instantaneous fluctuating and the long-term overall SNR shape dip-listening. An unresolved issue regarding cortical mechanisms of dip-listening is how target perception remains invariant to overall SNR, specifically, across different tone levels with an ongoing fluctuating masker. Equivalent target detection over both positive and negative overall SNRs (SNR invariance) is reliably achieved in highly-trained listeners. Dip-listening is correlated with the ability to resolve temporal fine structure, which involves temporally-varying spike patterns. Thus the current work tests the hypothesis that at negative SNRs, neuronal readout mechanisms need to increasingly rely on decoding strategies based on temporal spike patterns, as opposed to spike count. Recordings from chronically implanted electrode arrays in core auditory cortex of trained and awake Mongolian gerbils that are engaged in a tone detection task in 10 Hz amplitude-modulated background sound reveal that rate-based decoding is not SNR-invariant, whereas temporal coding is informative at both negative and positive SNRs

Directory of Open Access Journals

Interaural Level Differences Do Not Suffice for Restoring Spatial Release from Masking in Simulated Cochlear Implant Listening

Author: Antje Ihlefeld
Joel Snyder
Ruth Y. Litovsky
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2012
Field of study

Spatial release from masking refers to a benefit for speech understanding. It occurs when a target talker and a masker talker are spatially separated. In those cases, speech intelligibility for target speech is typically higher than when both talkers are at the same location. In cochlear implant listeners, spatial release from masking is much reduced or absent compared with normal hearing listeners. Perhaps this reduced spatial release occurs because cochlear implant listeners cannot effectively attend to spatial cues. Three experiments examined factors that may interfere with deploying spatial attention to a target talker masked by another talker. To simulate cochlear implant listening, stimuli were vocoded with two unique features. First, we used 50-Hz low-pass filtered speech envelopes and noise carriers, strongly reducing the possibility of temporal pitch cues; second, co-modulation was imposed on target and masker utterances to enhance perceptual fusion between the two sources. Stimuli were presented over headphones. Experiments 1 and 2 presented high-fidelity spatial cues with unprocessed and vocoded speech. Experiment 3 maintained faithful long-term average interaural level differences but presented scrambled interaural time differences with vocoded speech. Results show a robust spatial release from masking in Experiments 1 and 2, and a greatly reduced spatial release in Experiment 3. Faithful long-term average interaural level differences were insufficient for producing spatial release from masking. This suggests that appropriate interaural time differences are necessary for restoring spatial release from masking, at least for a situation where there are few viable alternative segregation cues

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

University of Melbourne Institutional Repository

The effect of auditory spatial layout in a divided attention task

Author: Antje Ihlefeld
Barbara Shinn-cunningham
Virginia Best
Publication venue
Publication date: 01/07/2005
Field of study

The effect of spatial separation on the ability of listeners to report keywords from two simultaneous talkers was examined. The talkers were presented with equal intensity at a clearly audible level, and were designed to have little spectral overlap in order to reduce energetic interference. The two talkers were presented in a virtual auditory environment with various angular separations around references of-45º, 0º, or 45º azimuth. In Experiment 1, the virtual space was created using head-related transfer functions (HRTFs) which contained natural energy variations as a function of location. In Experiment 2, these energy variations were removed and the virtual space was created using only interaural time differences (ITDs). Overall, performance did not vary dramatically but depended on spatial separation, reference direction, and type of simulation. Around the 0º reference azimuth, performance in the HRTF condition tended to first increase and then decrease with increasing separation. This effect was greatly reduced in the ITD condition and thus appears to be related primarily to energy variations at the two ears. For sources around the ± 45º reference azimuths, there was an advantage to separating the two sources in both HRTF and ITD conditions, suggesting that perceived spatial separation is advantageous in a divided attention task, at least for lateral sources. 1

CiteSeerX

Scholarly Materials And Research @ Georgia Tech