Search CORE

142 research outputs found

Relative Pitch Perception and the Detection of Deviant Tone Patterns.

Author: AS Bregman
BC Moore
C Micheyl
G Stefanics
I Winkler
JH McDermott
JH McDermott
K Friston
M Chait
M Fahle
M Kubovy
NE Foster
S Tew
SM Stringer
TD Griffiths
W Köhler
WJ Dowling
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Most people are able to recognise familiar tunes even when played in a different key. It is assumed that this depends on a general capacity for relative pitch perception; the ability to recognise the pattern of inter-note intervals that characterises the tune. However, when healthy adults are required to detect rare deviant melodic patterns in a sequence of randomly transposed standard patterns they perform close to chance. Musically experienced participants perform better than naïve participants, but even they find the task difficult, despite the fact that musical education includes training in interval recognition.To understand the source of this difficulty we designed an experiment to explore the relative influence of the size of within-pattern intervals and between-pattern transpositions on detecting deviant melodic patterns. We found that task difficulty increases when patterns contain large intervals (5-7 semitones) rather than small intervals (1-3 semitones). While task difficulty increases substantially when transpositions are introduced, the effect of transposition size (large vs small) is weaker. Increasing the range of permissible intervals to be used also makes the task more difficult. Furthermore, providing an initial exact repetition followed by subsequent transpositions does not improve performance. Although musical training correlates with task performance, we find no evidence that violations to musical intervals important in Western music (i.e. the perfect fifth or fourth) are more easily detected. In summary, relative pitch perception does not appear to be conducive to simple explanations based exclusively on invariant physical ratios

Crossref

Springer - Publisher Connector

PEARL (Univ. of Plymouth)

Repository of the Academy's Library

Effect of stimulus type and pitch salience on pitch-sequence processing

Author: Christophe Micheyl
Daniel Pressnitzer
Demany L.
Laurent Demany
Macmillan N. A.
Marion Cousineau
Moore B. C. J.
Noreen D. L.
Patterson R. D.
Plack C. J.
Samuele Carcagno
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/06/2018
Field of study

Using a same-different discrimination task, it has been shown that discrimination performance for sequences of complex tones varying just detectably in pitch is less dependent on sequence length (1, 2, or 4 elements) when the tones contain resolved harmonics than when they do not [Cousineau, Demany, and Pessnitzer (2009). J. Acoust. Soc. Am. 126, 3179-3187]. This effect had been attributed to the activation of automatic frequency-shift detectors (FSDs) by the shifts in resolved harmonics. The present study provides evidence against this hypothesis by showing that the sequence-processing advantage found for complex tones with resolved harmonics is not found for pure tones or other sounds supposed to activate FSDs (narrow bands of noise and wide-band noises eliciting pitch sensations due to interaural phase shifts). The present results also indicate that for pitch sequences, processing performance is largely unrelated to pitch salience per se: for a fixed level of discriminability between sequence elements, sequences of elements with salient pitches are not necessarily better processed than sequences of elements with less salient pitches. An ideal-observer model for the same-different binary-sequence discrimination task is also developed in the present study. The model allows the computation of d' for this task using numerical methods

Crossref

Lancaster E-Prints

Dimension-specific attention directs learning and listening on auditory training tasks

Author: A Karni
BA Wright
C Micheyl
DA Roth
David R. Moore
DB Polley
DG Jamieson
DG Jamieson
DG Jamieson
DM Green
FA Wichmann
GT Fechner
H Levitt
IS Hairston
Jenny L. Taylor
Lorna F. Halliday
M Ahissar
M Ahissar
M Ahissar
M Soltani
MA García-Pérez
R Ulrich
R Ulrich
S Amitay
S Amitay
S Amitay
Sygal Amitay
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/07/2011
Field of study

The relative contributions of bottom-up versus top-down sensory inputs to auditory learning are not well established. In our experiment, listeners were instructed to perform either a frequency discrimination (FD) task ("FD-train group") or an intensity discrimination (ID) task ("ID-train group") during training on a set of physically identical tones that were impossible to discriminate consistently above chance, allowing us to vary top-down attention whilst keeping bottom-up inputs fixed. A third, control group did not receive any training. Only the FD-train group improved on a FD probe following training, whereas all groups improved on ID following training. However, only the ID-train group also showed changes in performance accuracy as a function of interval with training on the ID task. These findings suggest that top-down, dimension-specific attention can direct auditory learning, even when this learning is not reflected in conventional performance measures of threshold change

Crossref

UCL Discovery

PubMed Central

The University of Manchester - Institutional Repository

The contribution of visual information to the perception of speech in noise with and without informative temporal fine structure

Author: Altieri
Bernstein
Blamey
Boothroyd
Braida
Brungart
Christian J. Sumner
Cohen
Cooke
Dau
Davis
Desai
Dorman
Eaves
Erber
Faul
Freyman
Fu
Glasberg
Grant
Grant
Grant
Grant
Grant
Green
Horne
IEEE
Ihlefeld
Kaiser
Kong
Kramer
Lakatos
Langeheine
Lorenzi
Luce
Lunner
Massaro
McGettigan
Micheyl
Miller
Moore
Moore
Paula C. Stacey
Peelle
Pádraig T. Kitterick
Qin
Rader
Rosen
Rouger
Saffron D. Morris
Schafer
Seldran
Shamma
Shannon
Skinner
Sumby
Sumner
Treisman
Tye-Murray
Tyler
Whitmal
Wolfe
Yakel
Publication venue: 'Elsevier BV'
Publication date: 01/06/2016
Field of study

Understanding what is said in demanding listening situations is assisted greatly by looking at the face of a talker. Previous studies have observed that normal-hearing listeners can benefit from this visual information when a talker’s voice is presented in background noise. These benefits have also been observed in quiet listening conditions in cochlear-implant users, whose device does not convey the informative temporal fine structure cues in speech, and when normal-hearing individuals listen to speech processed to remove these informative temporal fine structure cues. The current study (1) characterised the benefits of visual information when listening in background noise; and (2) used sine-wave vocoding to compare the size of the visual benefit when speech is presented with or without informative temporal fine structure. The accuracy with which normal-hearing individuals reported words in spoken sentences was assessed across three experiments. The availability of visual information and informative temporal fine structure cues was varied within and across the experiments. The results showed that visual benefit was observed using open- and closed-set tests of speech perception. The size of the benefit increased when informative temporal fine structure cues were removed. This finding suggests that visual information may play an important role in the ability of cochlear-implant users to understand speech in many everyday situations. Models of audio-visual integration were able to account for the additional benefit of visual information when speech was degraded and suggested that auditory and visual information was being integrated in a similar way in all conditions. The modelling results were consistent with the notion that audio-visual benefit is derived from the optimal combination of auditory and visual sensory cues

Nottingham ePrints

Nottingham eTheses

Crossref

Elsevier - Publisher Connector

Repository@Nottingham

Nottingham Trent Institutional Repository (IRep)

UCL Discovery

Interaction between Attention and Bottom-Up Saliency Mediates the Representation of Foreground and Background in an Auditory Scene

Author: A Bidet-Caulet
A Brechmann
A Delorme
A Gazzaley
A Gutschalk
A Martinez
A Zani
AS Bregman
AS Bregman
B Efron
B Ross
C Alain
C Alain
C Liegeois-Chauvel
C Micheyl
C Micheyl
CD Gilbert
D Coch
DB Polley
DH Hubel
DL Woods
DL Woods
E Craft
E Niebur
EC Cherry
EI Knudsen
ES Sussman
FT Qiu
G Kidd Jr
GM Ghose
H Okamoto
H Tiitinen
I Nelken
J Kauramaki
J Martinez-Trujillo
J Xiang
JB Fritz
JB Fritz
JC Hansen
JH Maunsell
JH Reynolds
JH Reynolds
Jonathan Z. Simon
JP Gottlieb
JS Bendat
JS Snyder
JT Devlin
Juanjuan Xiang
JZ Simon
K Alho
LM Miller
M Elhilali
MG Woldorff
MI Posner
Mounya Elhilali
MW Donald
N Kowalski
N Muller
NE Ahmar
NI Fisher
NM Weinberger
P Neri
PR Roelfsema
PT Michie
R Desimone
R Hari
R Naatanen
R Naatanen
R Naatanen
R Srinivasan
RC Oldfield
RJ Zatorre
RP Carlyon
RP Carlyon
S Anstis
S Atiani
S Sheft
SA Hillyard
Shihab A. Shamma
SM Kay
SR Arnott
TD Griffiths
Timothy D. Griffiths
TJ Buschman
TW Picton
TW Picton
VA Lamme
W Li
YB Saalmann
YI Fishman
YI Fishman
Publication venue: Public Library of Science
Publication date: 16/06/2009
Field of study

Bottom-up (stimulus-driven) and top-down (attentional) processes interact when a complex acoustic scene is parsed. Both modulate the neural representation of the target in a manner strongly correlated with behavioral performance

Public Library of Science (PLOS)

Crossref

PubMed Central

Individual Differences in Sound-in-Noise Perception Are Related to the Strength of Short-Latency Neural Responses to Noise

Author: A Bregman
A Gutschalk
A Heinrich
A Heinrich
AJ Oxenham
AJ Shahin
B Godey
BG Shinn-Cunningham
C Cherry
C Liegeois-Chauvel
C Micheyl
CC Wood
CI Petkov
D Pressnitzer
DL Neff
DL Neff
E Vinnik
Ekaterina Vinnik
EL Oh
Evan Balaban
G Kidd
GA Miller
GL Powers
J Polich
JB Grier
JR Binder
JS Snyder
JV Patterson
Kropotov
L Riecke
L Riecke
L Riecke
L Riecke
L Riecke
M Elhilali
M Reite
M Steinschneider
MA Howard
MX Huang
Noorden van
Pavel M. Itskov
R Warren
RA Lutfi
RM Warren
S Grossberg
S Siegel
SR Sokal
T Rosburg
Warren H. Meck
WM Hartmann
Y Benjamini
Y Tougas
YI Fishman
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Important sounds can be easily missed or misidentified in the presence of extraneous noise. We describe an auditory illusion in which a continuous ongoing tone becomes inaudible during a brief, non-masking noise burst more than one octave away, which is unexpected given the frequency resolution of human hearing. Participants strongly susceptible to this illusory discontinuity did not perceive illusory auditory continuity (in which a sound subjectively continues during a burst of masking noise) when the noises were short, yet did so at longer noise durations. Participants who were not prone to illusory discontinuity showed robust early electroencephalographic responses at 40–66 ms after noise burst onset, whereas those prone to the illusion lacked these early responses. These data suggest that short-latency neural responses to auditory scene components reflect subsequent individual differences in the parsing of auditory scenes

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Enhanced Syllable Discrimination Thresholds in Musicians

Author: A Bhide
A Caclin
A Cutler
A Parbery-Clark
A Parbery-Clark
A Santos
AD Patel
AD Patel
B Chandrasekaran
BR Zendel
C Deguchi
C Francois
C Magne
C Marie
C Micheyl
C Moritz
C Pantev
CL Krumhansl
Claude Alain
D Klatt
D Schon
D Schön
DL Strait
EG Schellenberg
F Degé
G Musacchia
GM Bidelman
Heesoo Kim
I Hurwitz
J Barwick
J Chobert
J Chobert
J Morais
J Swaminathan
Jennifer Zuk
JM Standley
JM Thomson
JM Thomson
John D. E. Gabrieli
K Banai
K Lakshminarayanan
K Overy
K Overy
Kala Lakshminarayanan
KM Lee
KN Stevens
L Przbylski
LR Slevc
M Besson
M Huss
M Muneaux
M Tervaniemi
M Tervaniemi
MF Spiegel
Nadine Gaab
Ola Ozernov-Palchik
P Tallal
P Tallal
P Tallal
Paula Tallal
PH Wolff
RJ Zatorre
RV Shannon
S Douglas
S Jentschke
S Jentschke
S Koelsch
S Koelsch
S Moreno
S Nittrouer
S Rosen
SH Anvari
SJ Lamb
T Dutoit
T Fujioka
T Kujala
U Goswami
U Goswami
U Goswami
W Strong
WF Thompson
Y Benjamini
ZF Peynircioglu
ZM Smith
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Speech processing inherently relies on the perception of specific, rapidly changing spectral and temporal acoustic features. Advanced acoustic perception is also integral to musical expertise, and accordingly several studies have demonstrated a significant relationship between musical training and superior processing of various aspects of speech. Speech and music appear to overlap in spectral and temporal features; however, it remains unclear which of these acoustic features, crucial for speech processing, are most closely associated with musical training. The present study examined the perceptual acuity of musicians to the acoustic components of speech necessary for intra-phonemic discrimination of synthetic syllables. We compared musicians and non-musicians on discrimination thresholds of three synthetic speech syllable continua that varied in their spectral and temporal discrimination demands, specifically voice onset time (VOT) and amplitude envelope cues in the temporal domain. Musicians demonstrated superior discrimination only for syllables that required resolution of temporal cues. Furthermore, performance on the temporal syllable continua positively correlated with the length and intensity of musical training. These findings support one potential mechanism by which musical training may selectively enhance speech perception, namely by reinforcing temporal acuity and/or perception of amplitude rise time, and implications for the translation of musical training to long-term linguistic abilities.Grammy FoundationWilliam F. Milton Fun

Public Library of Science (PLOS)

DSpace@MIT

Crossref

Boston University Institutional Repository (OpenBU)

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

A Corticothalamic Circuit Model for Sound Identification in Complex Scenes

Author: A Caicedo
AS Bregman
BA Olshausen
BE Russ
C Micheyl
CA Atencio
CC Lee
Christian Leibold
DD Stettler
DJ Willshaw
EC Smith
G Chechik
G Rothschild
GH Otazu
Gonzalo H. Otazu
H Asari
HV Oviedo
I Nelken
J Fritz
JA Winer
JF Jehee
JF Jehee
JJ Hopfield
K Friston
KJ Astrom
M Elhilali
M Steriade
MS Lewicki
N Grimault
Olaf Sporns
RA Jortner
RE Kalman
RP Carlyon
RP Rao
S Atiani
S Bandyopadhyay
S Sadagopan
S Sakata
SE Anderson
SG Mallat
SL Sally
SS Chen
T Hromadka
TR Agus
X Wang
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

The identification of the sound sources present in the environment is essential for the survival of many animals. However, these sounds are not presented in isolation, as natural scenes consist of a superposition of sounds originating from multiple sources. The identification of a source under these circumstances is a complex computational problem that is readily solved by most animals. We present a model of the thalamocortical circuit that performs level-invariant recognition of auditory objects in complex auditory scenes. The circuit identifies the objects present from a large dictionary of possible elements and operates reliably for real sound signals with multiple concurrently active sources. The key model assumption is that the activities of some cortical neurons encode the difference between the observed signal and an internal estimate. Reanalysis of awake auditory cortex recordings revealed neurons with patterns of activity corresponding to such an error signal

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Open Access LMU ( Ludwig-Maximilians-Univ. München)

Pitch Comparisons between Electrical Stimulation of a Cochlear Implant and Acoustic Stimuli Presented to a Normal-hearing Contralateral Ear

Four cochlear implant users, having normal hearing in the unimplanted ear, compared the pitches of electrical and acoustic stimuli presented to the two ears. Comparisons were between 1,031-pps pulse trains and pure tones or between 12 and 25-pps electric pulse trains and bandpass-filtered acoustic pulse trains of the same rate. Three methods—pitch adjustment, constant stimuli, and interleaved adaptive procedures—were used. For all methods, we showed that the results can be strongly influenced by non-sensory biases arising from the range of acoustic stimuli presented, and proposed a series of checks that should be made to alert the experimenter to those biases. We then showed that the results of comparisons that survived these checks do not deviate consistently from the predictions of a widely-used cochlear frequency-to-place formula or of a computational cochlear model. We also demonstrate that substantial range effects occur with other widely used experimental methods, even for normal-hearing listeners

Crossref

PubMed Central

Leiden University Scholary Publications

Finding Your Mate at a Cocktail Party: Frequency Separation Promotes Auditory Stream Segregation of Concurrent Voices in Multi-Species Frog Choruses

Author: A Izumi
AL Megela
AN Popper
AS Bregman
BCJ Moore
C Micheyl
CJ Darwin
CJ Edwards
CM Hillery
CM Hillery
CM Hillery
D Pressnitzer
EC Cherry
EM Swanson
H Endepols
HC Gerhardt
HC Gerhardt
HC Gerhardt
HH Zakon
J Bird
J Schul
JH McDermott
JJ Schwartz
JPL Brokx
KA Larson
L Elliott
L Ma
MA Bee
MA Bee
MA Bee
MA Bee
MA Bee
MA Bee
MA Bee
MA Bee
Mark A. Bee
MB Ptacek
Melissa Coleman
PF Assmann
PM Narins
PM Narins
RB Cocroft
RD Howard
RP Carlyon
RR Capranica
RR Fay
RR Fay
RR Fay
SA MacDougall-Shackleton
SA Shamma
SH Hulse
SL Bush
TB Alder
Vivek Nityananda
VT Marshall
WA Yost
YI Fishman
YI Fishman
Publication venue: Public Library of Science
Publication date: 15/06/2011
Field of study

Vocal communication in crowded social environments is a difficult problem for both humans and nonhuman animals. Yet many important social behaviors require listeners to detect, recognize, and discriminate among signals in a complex acoustic milieu comprising the overlapping signals of multiple individuals, often of multiple species. Humans exploit a relatively small number of acoustic cues to segregate overlapping voices (as well as other mixtures of concurrent sounds, like polyphonic music). By comparison, we know little about how nonhuman animals are adapted to solve similar communication problems. One important cue enabling source segregation in human speech communication is that of frequency separation between concurrent voices: differences in frequency promote perceptual segregation of overlapping voices into separate “auditory streams” that can be followed through time. In this study, we show that frequency separation (ΔF) also enables frogs to segregate concurrent vocalizations, such as those routinely encountered in mixed-species breeding choruses. We presented female gray treefrogs (Hyla chrysoscelis) with a pulsed target signal (simulating an attractive conspecific call) in the presence of a continuous stream of distractor pulses (simulating an overlapping, unattractive heterospecific call). When the ΔF between target and distractor was small (e.g., ≤3 semitones), females exhibited low levels of responsiveness, indicating a failure to recognize the target as an attractive signal when the distractor had a similar frequency. Subjects became increasingly more responsive to the target, as indicated by shorter latencies for phonotaxis, as the ΔF between target and distractor increased (e.g., ΔF = 6–12 semitones). These results support the conclusion that gray treefrogs, like humans, can exploit frequency separation as a perceptual cue to segregate concurrent voices in noisy social environments. The ability of these frogs to segregate concurrent voices based on frequency separation may involve ancient hearing mechanisms for source segregation shared with humans and other vertebrates

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central