82 research outputs found
Sensitivity of the human auditory cortex to acoustic degradation of speech and non-speech sounds
The perception of speech is usually an effortless and reliable process even in highly adverse listening conditions. In addition to external sound sources, the intelligibility of speech can be reduced by degradation of the structure of speech signal itself, for example by digital compression of sound. This kind of distortion may be even more detrimental to speech intelligibility than external distortion, given that the auditory system will not be able to utilize sound source-specific acoustic features, such as spatial location, to separate the distortion from the speech signal. The perceptual consequences of acoustic distortions on speech intelligibility have been extensively studied. However, the cortical mechanisms of speech perception in adverse listening conditions are not well known at present, particularly in situations where the speech signal itself is distorted. The aim of this thesis was to investigate the cortical mechanisms underlying speech perception in conditions where speech is less intelligible due to external distortion or as a result of digital compression.
In the studies of this thesis, the intelligibility of speech was varied either by digital compression or addition of stochastic noise. Cortical activity related to the speech stimuli was measured using magnetoencephalography (MEG). The results indicated that degradation of speech sounds by digital compression enhanced the evoked responses originating from the auditory cortex, whereas addition of stochastic noise did not modulate the cortical responses. Furthermore, it was shown that if the distortion was presented continuously in the background, the transient activity of auditory cortex was delayed. On the perceptual level, digital compression reduced the comprehensibility of speech more than additive stochastic noise. In addition, it was also demonstrated that prior knowledge of speech content enhanced the intelligibility of distorted speech substantially, and this perceptual change was associated with an increase in cortical activity within several regions adjacent to auditory cortex.
In conclusion, the results of this thesis show that the auditory cortex is very sensitive to the acoustic features of the distortion, while at later processing stages, several cortical areas reflect the intelligibility of speech. These findings suggest that the auditory system rapidly adapts to the variability of the auditory environment, and can efficiently utilize previous knowledge of speech content in deciphering acoustically degraded speech signals.Puheen havaitseminen on useimmiten vaivatonta ja luotettavaa myös erittäin huonoissa kuunteluolosuhteissa. Puheen ymmärrettävyys voi kuitenkin heikentyä ympäristön häiriölähteiden lisäksi myös silloin, kun puhesignaalin rakennetta muutetaan esimerkiksi pakkaamalla digitaalista ääntä. Tällainen häiriö voi heikentää ymmärrettävyyttä jopa ulkoisia häiriöitä voimakkaammin, koska kuulojärjestelmä ei pysty hyödyntämään äänilähteen ominaisuuksia, kuten äänen tulosuuntaa, häiriön erottelemisessa puheesta. Akustisten häiriöiden vaikutuksia puheen havaitsemiseen on tutkttu laajalti, mutta havaitsemiseen liittyvät aivomekanismit tunnetaan edelleen melko puutteelisesti etenkin tilanteissa, joissa itse puhesignaali on laadultaan heikentynyt. Tämän väitöskirjan tavoitteena oli tutkia puheen havaitsemisen aivomekanismeja tilanteissa, joissa puhesignaali on vaikeammin ymmärrettävissä joko ulkoisen äänilähteen tai digitaalisen pakkauksen vuoksi.
Väitöskirjan neljässä osatutkimuksessa lyhyiden puheäänien ja jatkuvan puheen ymmärrettävyyttä muokattiin joko digitaalisen pakkauksen kautta tai lisäämällä puhesignaaliin satunnaiskohinaa. Puheärsykkeisiin liittyvää aivotoimintaa tutkittiin magnetoenkefalografia-mittauksilla. Tutkimuksissa havaittiin, että kuuloaivokuorella syntyneet herätevasteet voimistuivat, kun puheääntä pakattiin digitaalisesti. Sen sijaan puheääniin lisätty satunnaiskohina ei vaikuttanut herätevasteisiin. Edelleen, mikäli puheäänien taustalla esitettiin jatkuvaa häiriötä, kuuloaivokuoren aktivoituminen viivästyi häiriön intensiteetin kasvaessa. Kuuntelukokeissa havaittiin, että digitaalinen pakkaus heikentää puheäänien ymmärrettävyyttä voimakkaammin kuin satunnaiskohina. Lisäksi osoitettiin, että aiempi tieto puheen sisällöstä paransi merkittävästi häiriöisen puheen ymmärrettävyyttä, mikä heijastui aivotoimintaan kuuloaivokuoren viereisillä aivoalueilla siten, että ymmärrettävä puhe aiheutti suuremman aktivaation kuin heikosti ymmärrettävä puhe.
Väitöskirjan tulokset osoittavat, että kuuloaivokuori on erittäin herkkä puheäänien akustisille häiriöille, ja myöhemmissä prosessoinnin vaiheissa useat kuuloaivokuoren viereiset aivoalueet heijastavat puheen ymmärrettävyyttä. Tulosten mukaan voi olettaa, että kuulojärjestelmä mukautuu nopeasti ääniympäristön vaihteluihin muun muassa hyödyntämällä aiempaa tietoa puheen sisällöstä tulkitessaan häiriöistä puhesignaalia
Pitch Enumeration: Failure to Subitize in Audition
Background: Subitizing involves recognition mechanisms that allow effortless enumeration of up to four visual objects, however despite ample resolution experimental data suggest that only one pitch can be reliably enumerated. This may be due to the grouping of tones according to harmonic relationships by recognition mechanisms prior to fine pitch processing. Poorer frequency resolution of auditory information available to recognition mechanisms may lead to unrelated tones being grouped, resulting in underestimation of pitch number. Methods, Results and Conclusion: We tested whether pitch enumeration is better for chords of full harmonic complex tones, where grouping errors are less likely, than for complexes with fewer and less accurately tuned harmonics. Chords of low familiarity were used to mitigate the possibility that participants would recognize the chord itself and simply recall the number of pitches. We found that accuracy of pitch enumeration was less than the visual system overall, and underestimation of pitch number increased for stimuli containing fewer harmonics. We conclude that harmonically related tones are first grouped at the poorer frequency resolution of the auditory nerve, leading to poor enumeration of more than one pitch
Insights on the Neuromagnetic Representation of Temporal Asymmetry in Human Auditory Cortex.
Communication sounds are typically asymmetric in time and human listeners are highly sensitive to this short-term temporal asymmetry. Nevertheless, causal neurophysiological correlates of auditory perceptual asymmetry remain largely elusive to our current analyses
and models. Auditory modelling and animal electrophysiological recordings suggest that perceptual asymmetry results from the presence of multiple time scales of temporal integration, central to the auditory periphery. To test this hypothesis we recorded auditory evoked fields (AEF) elicited by asymmetric sounds in humans. We found a strong correlation between perceived tonal salience of ramped and damped sinusoids and the AEFs, as quantified by the amplitude of the N100m dynamics. The N100m amplitude increased with stimulus
half-life time, showing a maximum difference between the ramped and damped stimulus for a modulation half-life time of 4 ms which is greatly reduced at 0.5 ms and 32 ms. This behaviour of the N100m closely parallels psychophysical data in a manner that: i) longer
half-life times are associated with a stronger tonal percept, and ii) perceptual differences between damped and ramped are maximal at 4 ms half-life time. Interestingly, differences in evoked fields were significantly stronger in the right hemisphere, indicating some degree of hemispheric specialisation. Furthermore, the N100m magnitude was successfully
explained by a pitch perception model using multiple scales of temporal integration of auditory
nerve activity patterns. This striking correlation between AEFs, perception, and model predictions suggests that the physiological mechanisms involved in the processing of pitch evoked by temporal asymmetric sounds are reflected in the N100m
Function and Assembly of a Chromatin-Associated RNase P that Is Required for Efficient Transcription by RNA Polymerase I
Human RNase P has been initially described as a tRNA processing enzyme, consisting of H1 RNA and at least ten distinct protein subunits. Recent findings, however, indicate that this catalytic ribonucleoprotein is also required for transcription of small noncoding RNA genes by RNA polymerase III (Pol III). Notably, subunits of human RNase P are localized in the nucleolus, thus raising the possibility that this ribonucleoprotein complex is implicated in transcription of rRNA genes by Pol I.By using biochemical and reverse genetic means we show here that human RNase P is required for efficient transcription of rDNA by Pol I. Thus, inactivation of RNase P by targeting its protein subunits for destruction by RNA interference or its H1 RNA moiety for specific cleavage causes marked reduction in transcription of rDNA by Pol I. However, RNase P restores Pol I transcription in a defined reconstitution system. Nuclear run on assays reveal that inactivation of RNase P reduces the level of nascent transcription by Pol I, and more considerably that of Pol III. Moreover, RNase P copurifies and associates with components of Pol I and its transcription factors and binds to chromatin of the promoter and coding region of rDNA. Strikingly, RNase P detaches from transcriptionally inactive rDNA in mitosis and reassociates with it at G1 phase through a dynamic and stepwise assembly process that is correlated with renewal of transcription.Our findings reveal that RNase P activates transcription of rDNA by Pol I through a novel assembly process and that this catalytic ribonucleoprotein determines the transcription output of Pol I and Pol III, two functionally coordinated transcription machineries
Function and Assembly of a Chromatin-Associated RNase P that Is Required for Efficient Transcription by RNA Polymerase I
Background: Human RNase P has been initially described as a tRNA processing enzyme, consisting of H1 RNA and at least ten distinct protein subunits. Recent findings, however, indicate that this catalytic ribonucleoprotein is also required for transcription of small noncoding RNA genes by RNA polymerase III (Pol III). Notably, subunits of human RNase P are localized in the nucleolus, thus raising the possibility that this ribonucleoprotein complex is implicated in transcription of rRNA genes by Pol I. Methodology/Principal Findings: By using biochemical and reverse genetic means we show here that human RNase P is required for efficient transcription of rDNA by Pol I. Thus, inactivation of RNase P by targeting its protein subunits for destruction by RNA interference or its H1 RNA moiety for specific cleavage causes marked reduction in transcription of rDNA by Pol I. However, RNase P restores Pol I transcription in a defined reconstitution system. Nuclear run on assays reveal that inactivation of RNase P reduces the level of nascent transcription by Pol I, and more considerably that of Pol III. Moreover, RNase P copurifies and associates with components of Pol I and its transcription factors and binds to chromatin of the promoter and coding region of rDNA. Strikingly, RNase P detaches from transcriptionally inactive rDNA in mitosis and reassociates with it at G1 phase through a dynamic and stepwise assembly process that is correlated with renewal of transcription
Auditory temporal processing in healthy aging: a magnetoencephalographic study
<p>Abstract</p> <p>Background</p> <p>Impaired speech perception is one of the major sequelae of aging. In addition to peripheral hearing loss, central deficits of auditory processing are supposed to contribute to the deterioration of speech perception in older individuals. To test the hypothesis that auditory temporal processing is compromised in aging, auditory evoked magnetic fields were recorded during stimulation with sequences of 4 rapidly recurring speech sounds in 28 healthy individuals aged 20 – 78 years.</p> <p>Results</p> <p>The decrement of the N1m amplitude during rapid auditory stimulation was not significantly different between older and younger adults. The amplitudes of the middle-latency P1m wave and of the long-latency N1m, however, were significantly larger in older than in younger participants.</p> <p>Conclusion</p> <p>The results of the present study do not provide evidence for the hypothesis that auditory temporal processing, as measured by the decrement (short-term habituation) of the major auditory evoked component, the N1m wave, is impaired in aging. The differences between these magnetoencephalographic findings and previously published behavioral data might be explained by differences in the experimental setting between the present study and previous behavioral studies, in terms of speech rate, attention, and masking noise. Significantly larger amplitudes of the P1m and N1m waves suggest that the cortical processing of individual sounds differs between younger and older individuals. This result adds to the growing evidence that brain functions, such as sensory processing, motor control and cognitive processing, can change during healthy aging, presumably due to experience-dependent neuroplastic mechanisms.</p
Interaction between the neuromagnetic responses to sound energy onset and pitch onset suggests common generators
The pitch-onset response (POR) is a negative component of the auditory evoked field which is elicited when the temporal fine structure of a continuous noise is regularized to produce a pitch perception without altering the gross spectral characteristics of the sound. Previously, we showed that the latency of the POR is inversely related to the pitch value and its amplitude is correlated with the salience of the pitch, suggesting that the underlying generators are part of a pitch-processing network [Krumbholz, K., Patterson, R.D., Seither-Preisler, A., Lammertmann, C. & Lütkenhöner, B. (2003) Cereb. Cortex,13, 765-772]. The source of the POR was located near the medial part of Heschl's gyrus. The present study was designed to determine whether the POR originates from the same generators as the energy-onset response (EOR) represented by the N100m/P200m complex. The EOR to the onset of a noise, and the POR to a subsequent transition from noise to pitch, were recorded as the time interval between the noise onset and the transition varied from 500 to 4000 ms. The mean amplitude of the POR increased by approximately 5.9 nA.m with each doubling of the time between noise onset and transition. This suggests an interaction between the POR and the EOR, which may be based on common neural generators
- …