96 research outputs found
The effect of an active transcutaneous bone conduction device on spatial release from masking
Objective: The aim was to quantify the effect of the experimental active transcutaneous Bone Conduction Implant (BCI) on spatial release from masking (SRM) in subjects with bilateral or unilateral conductive and mixed hearing loss. Design: Measurements were performed in a sound booth with five loudspeakers at 0\ub0, +/−30\ub0 and +/−150\ub0 azimuth. Target speech was presented frontally, and interfering speech from either the front (co-located) or surrounding (separated) loudspeakers. SRM was calculated as the difference between the separated and the co-located speech recognition threshold (SRT). Study Sample: Twelve patients (aged 22–76 years) unilaterally implanted with the BCI were included. Results: A positive SRM, reflecting a benefit of spatially separating interferers from target speech, existed for all subjects in unaided condition, and for nine subjects (75%) in aided condition. Aided SRM was lower compared to unaided in nine of the subjects. There was no difference in SRM between patients with bilateral and unilateral hearing loss. In aided situation, SRT improved only for patients with bilateral hearing loss. Conclusions: The BCI fitted unilaterally in patients with bilateral or unilateral conductive/mixed hearing loss seems to reduce SRM. However, data indicates that SRT is improved or maintained for patients with bilateral and unilateral hearing loss, respectively
Growing up with one ear : central auditory structure and function in unilateral ear canal atresia
The following thesis aims to give more insight into the functional and structural response of the central auditory system to congenital unilateral ear canal atresia (UCA) and the accompanying asymmetric hearing with conductive hearing loss on the atretic side.
There is clear evidence that unilateral hearing loss, including UCA, has a negative impact on sound localization ability and perception of speech in noise. There is a spread in performance within the group, and the reason for this is not well known. In paper I of this thesis we examined sound localization with eye tracking and perception of speech in a cocktail party setting, in participants with congenital unilateral ear canal atresia, who had no hearing aids before age 12 (n=12) and compared to normal hearing references. Results show that the level of hearing loss on the atretic ear was associated with sound localization ability but not to speech perception.
In the second study, participants with UCA (n=17) underwent MRI-scanning of the brain with diffusion weighted imaging (DWI). A method is described how to segment the white matter bundle between the medial geniculate body of the thalamus and the primary auditory cortex, the acoustic radiation (AR). Methods to define the AR are previously described in high resolution diffusion weighted imaging (DWI) scans but is very time consuming or has problems with including more structures around the primary auditory cortex (PAC). An algorithm was trained to quickly segment the core of the AR in individual clinical scans. The white matter tract was also assessed with measurements of fractional anisotropy (FA), but no differences were found between UCA and normal hearing (NH) controls.
The third study describes the measurements of the grey matter of the primary auditory cortex of the Heschl’s gyrus in the same participants as in paper II. Thickness and volume of the Heschl’s gyrus were compared within the groups of UCA and controls, and between the groups. A difference in thickness was found between the left and right side (right thicker than left, corrected p=0.0012) in UCA, whereas controls had symmetric thickness. Volumes and total thickness were not different compared to controls.
Rat brains from 12 months old rats with a surgically constructed left-sided ear canal atresia were examined in study IV. DWI was acquired in a research camera for rodents, 9.4 T magnetic field and a prolonged scanning time. Tractography and FA measurements were obtained both from whole brains and from tracts between auditory regions of interest (ROIs) using two different software. FA was generally higher in UCA rats than in controls. The AR was asymmetric in FA (left<right) in UCA, whereas FA was symmetric in controls. The FA was found to be lower at the left connection (same side as hearing loss) cochlear nucleus -inferior colliculus compared to the right side in UCA, while it was symmetric in controls. This finding (CN-IC) aligns with previous histology findings in ferrets with unilateral conductive hearing loss
Effects of Coordinated Bilateral Hearing Aids and Auditory Training on Sound Localization
This thesis has two main objectives: 1) evaluating the benefits of the bilateral coordination of the hearing aid Digital Signal Processing (DSP) features by measuring and comparing the auditory performance with and without the activation of this coordination, and 2) evaluating the benefits of acclimatization and auditory training on such auditory performance and, determining whether receiving training in one aspect of auditory performance (sound localization) would generalize to an improvement in another aspect of auditory performance (speech intelligibility in noise), and to what extent. Two studies were performed. The first study evaluated the speech intelligibility in noise and horizontal sound localization abilities in HI listeners using hearing aids that apply bilateral coordination of WDRC. A significant improvement was noted in sound localization with bilateral coordination on when compared to off, while speech intelligibility in noise did not seem to be affected. The second study was an extension of the first study, with a suitable period for acclimatization provided and then the participants were divided into training and control groups. Only the training group received auditory training. The training group performance was significantly better than the control group performance in some conditions, in both the speech intelligibility and the localization tasks. The bilateral coordination did not have significant effects on the results of the second study.
This work is among the early literature to investigate the impact of bilateral coordination in hearing aids on the users’ auditory performance. Also, this work is the first to demonstrate the effect of auditory training in sound localization on the speech intelligibility performance
Objective and Subjective Evaluation of Binaural Beamformers in Hearing Aids
Hearing aids use a variety of noise reduction techniques to enhance the experience of hearing impaired listeners. One of these techniques is beamforming, which typically aims to preserve sounds coming from the front of the user and suppresses those from the sides and back. Recently, hearing aids have begun employing a wireless connection between the left and right hearing aids in order to augment the directionality of the beamformers, called binaural beamformers. However, the effect of these binaural beamformers on perceived quality and intelligibility has not been thoroughly tested. This thesis investigated the benchmarking of hearing aids which utilize binaural beamforming algorithms using behavioural testing and computational models. Speech recordings from bilateral pairs of several popular hearing aids were obtained across different processing conditions, and in different noisy and reverberant environments. The quality of these recordings was evaluated subjectively by thirteen hearing impaired adults. In addition, computational predictors of perceived quality and intelligibility were extracted from the left and right hearing aid recordings. Objective and subjective analyses revealed that binaural beamforming has a generally positive effect on quality and intelligibility that was dependent on the directionality of the speech and noise. The ear recording with the better predicted quality score was also found to correlate better with the subjective quality ratings than the average of left and right ear predicted scores. A new weighting function that optimally combines the monaural computational metrics was developed, which was shown to be especially effective in environments where speech and/or noise sources are asymmetrically positioned
Spatial auditory display for acoustics and music collections
PhDThis thesis explores how audio can be better incorporated into how people access
information and does so by developing approaches for creating three-dimensional audio
environments with low processing demands. This is done by investigating three research
questions.
Mobile applications have processor and memory requirements that restrict the
number of concurrent static or moving sound sources that can be rendered with binaural
audio. Is there a more e cient approach that is as perceptually accurate as the traditional
method? This thesis concludes that virtual Ambisonics is an ef cient and accurate means
to render a binaural auditory display consisting of noise signals placed on the horizontal
plane without head tracking. Virtual Ambisonics is then more e cient than convolution
of HRTFs if more than two sound sources are concurrently rendered or if movement of
the sources or head tracking is implemented.
Complex acoustics models require signi cant amounts of memory and processing. If
the memory and processor loads for a model are too large for a particular device, that
model cannot be interactive in real-time. What steps can be taken to allow a complex
room model to be interactive by using less memory and decreasing the computational
load? This thesis presents a new reverberation model based on hybrid reverberation
which uses a collection of B-format IRs. A new metric for determining the mixing
time of a room is developed and interpolation between early re
ections is investigated.
Though hybrid reverberation typically uses a recursive lter such as a FDN for the late
reverberation, an average late reverberation tail is instead synthesised for convolution
reverberation.
Commercial interfaces for music search and discovery use little aural information
even though the information being sought is audio. How can audio be used in
interfaces for music search and discovery? This thesis looks at 20 interfaces and
determines that several themes emerge from past interfaces. These include using a two
or three-dimensional space to explore a music collection, allowing concurrent playback of
multiple sources, and tools such as auras to control how much information is presented. A
new interface, the amblr, is developed because virtual two-dimensional spaces populated
by music have been a common approach, but not yet a perfected one. The amblr is also
interpreted as an art installation which was visited by approximately 1000 people over 5
days. The installation maps the virtual space created by the amblr to a physical space
Realising the head-shadow benefit to cochlear implant users
Cochlear implant (CI) users struggle to understand speech in noise. They suffer from elevated hearing thresholds and, with practically no binaural unmasking, they rely heavily on better-ear listening and lip reading. Traditional measures of spatial release from masking (SRM) quantify the speech reception threshold (SRT) improvement due to the azimuthal separation of speech and interferers when directly facing the speech source. The Jelfs et al. (2011) model of SRM predicts substantial benefits of orienting the head away from the target speech.
Audio-only and audio-visual (AV) SRTs in normally hearing (NH) listeners and CI users confirmed model predictions of speech-facing SRM and head-orientation benefit (HOB). The lip-reading benefit (LRB) was not disrupted by a modest 30° orientation. When attending to speech with a gradually diminishing speech-to-noise-ratio (SNR), CI users were found to make little spontaneous use of their available HOB. Following a simple instruction to explore their HOB, CI users immediately reached as much as 5 dB lower SNRs. AV speech presentation significantly inhibited head movements (it nearly eradicated CI users’ spontaneous head turns), but had a limited impact on the SNRs reached post-instruction, compared to audio-only presentation. NH listeners age-matched to our CI participants made more spontaneous head turns in the free-head experiment but were poorer than CI users at exploiting their HOB post-instruction, despite their exhibiting larger objective HOB. NH listeners’ and CI users’ LRB measured 3 and 5 dB, respectively.
Our findings both dispel the erroneous beliefs held by CI professionals that facing the speech constitutes an optimal listening strategy (whether for lip-reading or to optimise the use of microphone directionality) and pave the way to obvious translational applications
Neural architecture for echo suppression during sound source localization based on spiking neural cell models
Zusammenfassung
Diese Arbeit untersucht die biologischen Ursachen des psycho-akustischen
Präzedenz Effektes, der Menschen in die Lage versetzt, akustische Echos während
der Lokalisation von Schallquellen zu unterdrücken. Sie enthält ein Modell zur
Echo-Unterdrückung während der Schallquellenlokalisation, welches in technischen
Systemen zur Mensch-Maschine Interaktion eingesetzt werden kann.
Die Grundlagen dieses Modells wurden aus eigenen elektrophysiologischen
Experimenten an der Mongolischen Wüstenrennmaus gewonnen. Die dabei erstmalig an
der Wüstenrennmaus erzielten Ergebnisse, zeigen ein besonderes Verhalten
spezifischer Zellen im Dorsalen Kern des Lateral Lemniscus, einer dedizierten
Region des auditorischen Hirnstammes. Die dort sichtbare Langzeithemmung scheint
die Grundlage für die Echounterdrückung in höheren auditorischen Zentren zu
sein. Das entwickelte Model war in der Lage dieses Verhalten nachzubilden, und
legt die Vermutung nahe, dass eine starke und zeitlich präzise Hyperpolarisation
der zugrundeliegende physiologische Mechanismus dieses Verhaltens ist.
Die entwickelte Neuronale Modellarchitektur modelliert das Innenohr und fünf
wesentliche Kerne des auditorischen Hirnstammes in ihrer Verbindungsstruktur und
internen Dynamik. Sie stellt einen neuen Typus neuronaler Modellierung dar, der
als Spike-Interaktionsmodell (SIM) bezeichnet wird. SIM nutzen die präzise
räumlich-zeitliche Interaktion einzelner Aktionspotentiale (Spikes) für die
Kodierung und Verarbeitung neuronaler Informationen. Die Basis dafür bilden
Integrate-and-Fire Neuronenmodelle sowie Hebb'sche Synapsen, welche um speziell
entwickelte dynamische Kernfunktionen erweitert wurden. Das Modell ist in der
Lage, Zeitdifferenzen von 10 mykrosekunden zu detektieren und basiert auf den
Prinzipien der zeitlichen und räumlichen Koinzidenz sowie der präzisen lokalen
Inhibition.
Es besteht ausschließlich aus Elementen einer eigens entwickelten Neuronalen
Basisbibliothek (NBL) die speziell für die Modellierung verschiedenster Spike-
Interaktionsmodelle entworfen wurde. Diese Bibliothek erweitert die kommerziell
verfügbare dynamische Simulationsumgebung von MATLAB/SIMULINK um verschiedene
Modelle von Neuronen und Synapsen, welche die intrinsischen dynamischen
Eigenschaften von Nervenzellen nachbilden. Die Nutzung dieser Bibliothek
versetzt sowohl den Ingenieur als auch den Biologen in die Lage, eigene,
biologisch plausible, Modelle der neuronalen Informationsverarbeitung ohne
detaillierte Programmierkenntnisse zu entwickeln. Die grafische Oberfläche
ermöglicht strukturelle sowie parametrische Modifikationen und ist in der Lage,
den Zeitverlauf mikroskopischer Zellpotentiale aber auch makroskopischer
Spikemuster während und nach der Simulation darzustellen.
Zwei grundlegende Elemente der Neuronalen Basisbibliothek wurden zur
Implementierung als spezielle analog-digitale Schaltungen vorbereitet.
Erste Silizium Implementierungen durch das Team des DFG Graduiertenkollegs GRK
164 konnten die Möglichkeit einer vollparallelen on line Verarbeitung von
Schallsignalen nachweisen. Durch Zuhilfenahme des im GRK entwickelten
automatisierten Layout Generators wird es möglich, spezielle Prozessoren zur
Anwendung biologischer Verarbeitungsprinzipien in technischen Systemen zu
entwickeln. Diese Prozessoren unterscheiden sich grundlegend von den klassischen
von Neumann Prozessoren indem sie räumlich und zeitlich verteilte Spikemuster,
anstatt sequentieller binärer Werte zur Informationsrepräsentation nutzen. Sie
erweitern das digitale Kodierungsprinzip durch die Dimensionen des Raumes (2
dimensionale Nachbarschaft) der Zeit (Frequenz, Phase und Amplitude) sowie der
zeitlichen Dynamik analoger Potentialverläufe.
Diese Dissertation besteht aus sieben Kapiteln, welche den verschiedenen
Bereichen der Computational Neuroscience gewidmet sind.
Kapitel 1 beschreibt die Motivation dieser Arbeit welche aus der Absicht rühren,
biologische Prinzipien der Schallverarbeitung zu erforschen und für technische
Systeme während der Interaktion mit dem Menschen nutzbar zu machen. Zusätzlich
werden fünf Gründe für die Nutzung von Spike-Interaktionsmodellen angeführt
sowie deren neuartiger Charakter beschrieben.
Kapitel 2 führt die biologischen Prinzipien der Schallquellenlokalisation und
den psychoakustischen Präzedenz Effekt ein. Aktuelle Hypothesen zur Entstehung
dieses Effektes werden anhand ausgewählter experimenteller Ergebnisse
verschiedener Forschungsgruppen diskutiert.
Kapitel 3 beschreibt die entwickelte Neuronale Basisbibliothek und führt die
einzelnen neuronalen Simulationselemente ein. Es erklärt die zugrundeliegenden
mathematischen Funktionen der dynamischen Komponenten und beschreibt deren
generelle Einsetzbarkeit zur dynamischen Simulation spikebasierter Neuronaler
Netzwerke.
Kapitel 4 enthält ein speziell entworfenes Modell des auditorischen Hirnstammes
beginnend mit den Filterkaskaden zur Simulation des Innenohres, sich fortsetzend
über mehr als 200 Zellen und 400 Synapsen in 5 auditorischen Kernen bis zum
Richtungssensor im Bereich des auditorischen Mittelhirns. Es stellt die
verwendeten Strukturen und Parameter vor und enthält grundlegende Hinweise zur
Nutzung der Simulationsumgebung.
Kapitel 5 besteht aus drei Abschnitten, wobei der erste Abschnitt die
Experimentalbedingungen und Ergebnisse der eigens durchgeführten Tierversuche
beschreibt. Der zweite Abschnitt stellt die Ergebnisse von 104 Modellversuchen
zur Simulationen psycho-akustischer Effekte dar, welche u.a. die Fähigkeit des
Modells zur Nachbildung des Präzedenz Effektes testen. Schließlich beschreibt
der letzte Abschnitt die Ergebnisse der 54 unter realen Umweltbedingungen
durchgeführten Experimente. Dabei kamen Signale zur Anwendung, welche in
normalen sowie besonders stark verhallten Räumen aufgezeichnet wurden.
Kapitel 6 vergleicht diese Ergebnisse mit anderen biologisch motivierten und
technischen Verfahren zur Echounterdrückung und Schallquellenlokalisation und
führt den aktuellen Status der Hardwareimplementierung ein.
Kapitel 7 enthält schließlich eine kurze Zusammenfassung und einen Ausblick auf
weitere Forschungsobjekte und geplante Aktivitäten.
Diese Arbeit möchte zur Entwicklung der Computational Neuroscience beitragen,
indem sie versucht, in einem speziellen Anwendungsfeld die Lücke zwischen
biologischen Erkenntnissen, rechentechnischen Modellen und Hardware Engineering
zu schließen. Sie empfiehlt ein neues räumlich-zeitliches Paradigma der
dynamischen Informationsverarbeitung zur Erschließung biologischer Prinzipien
der Informationsverarbeitung für technische Anwendungen.This thesis investigates the biological background of the psycho-acoustical precedence effect, enabling humans to suppress echoes during the localization of sound sources. It provides a technically feasible and biologically plausible model for sound source localization under echoic conditions, ready to be used by technical systems during man-machine interactions.
The model is based upon own electro-physiological experiments in the mongolian gerbil. The first time in gerbils obtained results reveal a special behavior of specific cells of the dorsal nucleus of the lateral lemniscus (DNLL) - a distinct region in the auditory brainstem. The explored persistent inhibition effect of these cells seems to account for the base of echo suppression at higher auditory centers. The developed model proved capable to duplicate this behavior and suggests, that a strong and timely precise hyperpolarization is the basic mechanism behind this cell behavior. The developed neural architecture models the inner ear as well as five major nuclei of the auditory brainstem in their connectivity and intrinsic dynamics. It represents a new type of neural modeling described as Spike Interaction Models (SIM). SIM use the precise spatio-temporal interaction of single spike events for coding and processing of neural information. Their basic elements are Integrate-and-Fire Neurons and Hebbian synapses, which have been extended by specially designed dynamic transfer functions. The model is capable to detect time differences as small as 10 mircrosecondes and employs the principles of coincidence detection and precise local inhibition for auditory processing. It consists exclusively of elements of a specifically designed Neural Base Library (NBL), which has been developed for multi purpose modeling of Spike Interaction Models. This library extends the commercially available dynamic simulation environment of MATLAB/SIMULINK by different models of neurons and synapses simulating the intrinsic dynamic properties of neural cells. The usage of this library enables engineers as well as biologists to design their own, biologically plausible models of neural information processing without the need for detailed programming skills. Its graphical interface provides access to structural as well as parametric changes and is capable to display the time course of microscopic cell parameters as well as macroscopic firing pattern during simulations and thereafter. Two basic elements of the Neural Base Library have been prepared for implementation by specialized mixed analog-digital circuitry. First silicon implementations were realized by the team of the DFG Graduiertenkolleg GRK 164 and proved the possibility of fully parallel on line processing of sounds. By using the automated layout processor under development in the Graduiertenkolleg, it will be possible to design specific processors in order to apply theprinciples of distributed biological information processing to technical systems. These processors differ from classical von Neumann processors by the use of spatio temporal spike pattern instead of sequential binary values. They will extend the digital coding principle by the dimensions of space (spatial neighborhood), time (frequency, phase and amplitude) as well as the dynamics of analog potentials and introduce a new type of information processing.
This thesis consists of seven chapters, dedicated to the different areas of computational neuroscience.
Chapter 1: provides the motivation of this study arising from the attempt to investigate the biological principles of sound processing and make them available to technical systems interacting with humans under real world conditions. Furthermore, five reasons to use spike interaction models are given and their novel characteristics are discussed.
Chapter 2: introduces the biological principles of sound source localization and the precedence effect. Current hypothesis on echo suppression and the underlying principles of the precedence effect are discussed by reference to a small selection of physiological and psycho-acoustical experiments.
Chapter 3: describes the developed neural base library and introduces each of the designed neural simulation elements. It also explains the developed mathematical functions of the dynamic compartments and describes their general usage for dynamic simulation of spiking neural networks.
Chapter 4: introduces the developed specific model of the auditory brainstem, starting from the filtering cascade in the inner ear via more than 200 cells and 400 synapses in five auditory regions up to the directional sensor at the level of the auditory midbrain. It displays the employed parameter sets and contains basic hints for the set up and configuration of the simulation environment.
Chapter 5: consists of three sections, whereas the first one describes the set up and results of the own electro-physiological experiments. The second describes the results of 104 model simulations, performed to test the
models ability to duplicate psycho-acoustical effects like the precedence effect. Finally, the last section of this chapter contains the results of 54 real world experiments using natural sound signals, recorded under normal as well as highly reverberating conditions.
Chapter 6: compares the achieved results to other biologically motivated and technical models for echo suppression and sound source localization and introduces the current status of silicon implementation.
Chapter 7: finally provides a short summary and an outlook toward future research subjects and areas of investigation.
This thesis aims to contribute to the field of computational neuroscience by bridging the gap between biological
investigation, computational modeling and silicon engineering in a specific field of application. It suggests a new spatio-temporal paradigm of information processing in order to access the capabilities of biological systems for technical applications
- …