Search CORE

24 research outputs found

Semantic Object Prediction and Spatial Sound Super-Resolution with Binaural Sounds

Author: A Geiger
A Owens
A Owens
BC Russell
C Rascon
Computational auditory scene analysis
D Li
F Antonacci
H Wallach
H Zhao
I Dokmanic
J Delmerico
J Tiete
KI McAnally
L-C Chen
LC Chen
LD Rosenblum
R Fendrich
R Gao
S Argentieri
S Hecker
U Klee
W Huang
WR Thurlow
WW Gaver
Y Tian
Publication venue
Publication date: 09/03/2020
Field of study

Humans can robustly recognize and localize objects by integrating visual and auditory cues. While machines are able to do the same now with images, less work has been done with sounds. This work develops an approach for dense semantic labelling of sound-making objects, purely based on binaural sounds. We propose a novel sensor setup and record a new audio-visual dataset of street scenes with eight professional binaural microphones and a 360 degree camera. The co-existence of visual and audio cues is leveraged for supervision transfer. In particular, we employ a cross-modal distillation framework that consists of a vision `teacher' method and a sound `student' method -- the student method is trained to generate the same results as the teacher method. This way, the auditory system can be trained without using human annotations. We also propose two auxiliary tasks namely, a) a novel task on Spatial Sound Super-resolution to increase the spatial resolution of sounds, and b) dense depth prediction of the scene. We then formulate the three tasks into one end-to-end trainable multi-tasking network aiming to boost the overall performance. Experimental results on the dataset show that 1) our method achieves promising results for semantic prediction and the two auxiliary tasks; and 2) the three tasks are mutually beneficial -- training them together achieves the best performance and 3) the number and orientations of microphones are both important. The data and code will be released to facilitate the research in this new direction.Comment: Project page: https://www.trace.ethz.ch/publications/2020/sound_perception/index.htm

arXiv.org e-Print Archive

Repository for Publications and Research Data

Crossref

Behavioral and molecular genetics of reading-related AM and FM detection thresholds

Author: Amber D. Shindhelm
B Boets
B Boets
C Kaernbach
C Witton
Caroline Witton
Christopher W. Bartlett
CR Marshall
CW Bartlett
CW Bartlett
CW Bartlett
CW Bartlett
CW Bartlett
CW Bartlett
CW Bartlett
DF Newbury
DV Bishop
DV Glidden
DV Glidden
E de Wit
F Ramus
GM McArthur
H Poelmans
HW Catts
J Logan
JA Hamalainen
JB Talcott
JB Talcott
JB Talcott
JM Carroll
JM Hodgson
JM Law
Judy F. Flax
KI McAnally
L Almasy
Linda M. Brzustowicz
M Boehnke
MA Spence
Matthew Bruni
MP Epstein
N Li
N Morris
NE Morton
P Szatmari
PR Burton
PR Burton
RC Elston
RL Peterson
RM Barker
RN Woodcock
Steven Buyske
TH Beaty
TR Simmons
U Goswami
VJ Vieland
VJ Vieland
VJ Vieland
VJ Vieland
YE Song
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2017
Field of study

Auditory detection thresholds for certain frequencies of both amplitude modulated (AM) and frequency modulated (FM) dynamic auditory stimuli are associated with reading in typically developing and dyslexic readers. We present the first behavioral and molecular genetic characterization of these two auditory traits. Two extant extended family datasets were given reading tasks and psychoacoustic tasks to determine FM 2 Hz and AM 20 Hz sensitivity thresholds. Univariate heritabilities were significant for both AM (h2 = 0.20) and FM (h2 = 0.29). Bayesian posterior probability of linkage (PPL) analysis found loci for AM (12q, PPL = 81 %) and FM (10p, PPL = 32 %; 20q, PPL = 65 %). Bivariate heritability analyses revealed that FM is genetically correlated with reading, while AM was not. Bivariate PPL analysis indicates that FM loci (10p, 20q) are not also associated with reading

Crossref

PubMed Central

Aston Publications Explorer

How auditory temporal processing deficits relate to dyslexia

Author: Agnew JA
Banai K
Ben-Yehudah G
Berwanger D
Birch HG
C.F.B. Murphy
Chermak GD
E. Schochat
Fitch RH
Frasca MFSS
Friel-Patti S
Habib M
Heiervang E
Heim S
McAnally KI
Merzenich MM
Nagarajan S
Neter J
Share D
Simon C
Talcott JB
Tallal P
Tallal P
Tallal P
Tallal P
Van Ingelghem M
Publication venue: 'FapUNIFESP (SciELO)'
Publication date
Field of study

Crossref

Sensory theories of developmental dyslexia: three challenges for research.

Author: A Bhide
A Carrion-Castillo
A Facoetti
A Facoetti
A Facoetti
A Kevan
A-L Giraud
AB Smith
AJ Power
AJ Sperling
AL Giraud
B Boets
B Boets
B Boets
C Bogliotti
C de Santos Loureiro
C McBride-Chang
C Read
C Read
C Witton
CM Marshall
D Spinelli
D Swan
E Temple
F Hutzler
FR Vellutino
G Eden
G Stefanics
GM McArthur
H Poelmans
HS Huang
J Atkinson
J Mehler
J Stein
J Thomson
J Thomson
JA Hämäläinen
JC Ziegler
JC Ziegler
JC Ziegler
JC Ziegler
K Lehongre
K Pammer
KE Stanovich
KH Corriveau
KI McAnally
KR Kitzen
L Bradley
M Huss
M Muneaux
M Studdert-Kennedy
M Zorzi
MH Schneps
MJ Snowling
ML Bosse
ML Bosse
ML Bosse
MS Livingstone
N Choudray
N Gaab
N Raschle
O Megnin-Viggars
OA Olulade
P Cornelisson
P Tallal
P Tallal
P Tallal
P Tallal
P Zoccolotti
PHK Seymour
PHT Leppanen
PK Kuhl
R Frost
R Hari
R Port
RJ Brand
S Amitay
S Dehaene
S Franceschini
S Franceschini
S Gori
S Greenberg
S Hawelka
S Nittrouer
S Ross-Sheehy
S Telkemeyer
S Valdois
T Fernandes
TC Papadopoulos
TK Guttorm
TL Van Zuijen
TR Vidyasagar
TR Vidyasagar
TV Mitchell
U Goswami
U Goswami
U Goswami
U Goswami
U Goswami
U Goswami
U Goswami
U Goswami
U Goswami
U Maurer
Usha Goswami
V Blau
V Leong
V Leong
W Serniclaes
WA Lovegrove
Z Surányi
Publication venue: Nat Rev Neurosci
Publication date: 05/11/2014
Field of study

Recent years have seen the publication of a range of new theories suggesting that the basis of dyslexia might be sensory dysfunction. In this Opinion article, the evidence for and against several prominent sensory theories of dyslexia is closely scrutinized. Contrary to the causal claims being made, my analysis suggests that many proposed sensory deficits might result from the effects of reduced reading experience on the dyslexic brain. I therefore suggest that longitudinal studies of sensory processing, beginning in infancy, are required to successfully identify the neural basis of developmental dyslexia. Such studies could have a powerful impact on remediation.This is the accepted manuscript. The final version is available from NPG at http://www.nature.com/nrn/journal/v16/n1/abs/nrn3836.html

Crossref

Apollo (Cambridge)

Late, not early mismatch responses to changes in frequency are reduced or deviant in children with dyslexia: an event-related potential study

Author: A Delorme
AC Tang
AL Giraud
British Society of Audiology
C Sebastian
CF Norbury
D Wechsler
Dorothy VM Bishop
DVM Bishop
DVM Bishop
DVM Bishop
DVM Bishop
DVM Bishop
DVM Bishop
DVM Bishop
DVM Bishop
DVM Bishop
EG Willcutt
F Ramus
FJ Hsiao
G Schulte-Körne
G Schulte-Körne
G Schulte-Körne
GJ August
GM McArthur
H Renvall
HH Jasper
HW Catts
J Stein
JA Hämäläinen
JA Hämäläinen
JA Hämäläinen
JK Torgesen
JM Findlay
Johanna G Barry
JW Hall
K Hugdahl
K Lehongre
K Nation
KI McAnally
L Fuentemilla
LF Halliday
LF Halliday
LF Halliday
Lorna F Halliday
M Cheour
M Korkman
M Semrudclikeman
M Sharma
Mervyn J Hardiman
MJ Snowling
N Neuhoff
NC Thompson
NW Roach
P Korpilahti
P Sutcliffe
P Tallal
PA Sutcliffe
R Näätänen
R Näätänen
R Näätänen
RL Freyman
S Heim
S Makeig
S Rosen
SD Dalebout
SE Gathercole
SJ Pickering
T Baldeweg
T Kujala
T Kujala
T Kujala
T Lachmann
TW Picton
TW Picton
U Goswami
V Csépe
VC Shankarnarayan
WHO
XZ Meng
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Association between auditory and visual neglect following right hemisphere damage

Author: Eramudugolla R
Irvine DRF
Martin RL
Mattingley JB
McAnally KI
Publication venue: M I T PRESS
Publication date: 01/01/2005
Field of study

University of Queensland eSpace

Dissociation between detection and identification of change in complex auditory scenes

Author: Eramudugolla R.
Irvine DRF.
Martin RL.
Mattingley JB.
McAnally KI.
Publication venue: TAYLOR & FRANCIS LTD
Publication date: 01/01/2006
Field of study

University of Queensland eSpace

Dyslexia (neuropsychological)

Author: Brady SA
Harm MW
Hynd GW
Jackson NE
McAnally KI
Rutter M
Snowling MJ
Snowling MJ
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

Feasibility of using normobaric hypoxic stress in mTBI research

Author: Capó-Aponte JE
Cymerman A
Department of the Army Flight Regulations
Dikmen S
Hilz MJ
Holness DE
Jantzen KJ
Leddy JJ
Len TK
Marshall LF
Maruta J
Maruta J
McAnally KI
Ofner M
Publication venue: 'Future Medicine Ltd'
Publication date
Field of study

Crossref

Semantic Object Prediction and Spatial Sound Super-Resolution with Binaural Sounds

Author: A Geiger
A Owens
A Owens
BC Russell
C Rascon
Computational auditory scene analysis
D Li
F Antonacci
H Wallach
H Zhao
I Dokmanic
J Delmerico
J Tiete
KI McAnally
L-C Chen
LC Chen
LD Rosenblum
R Fendrich
R Gao
S Argentieri
S Hecker
U Klee
W Huang
WR Thurlow
WW Gaver
Y Tian
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2020
Field of study

Humans can robustly recognize and localize objects by integrating visual and auditory cues. While machines are able to do the same now with images, less work has been done with sounds. This work develops an approach for dense semantic labelling of sound-making objects, purely based on binaural sounds. We propose a novel sensor setup and record a new audio-visual dataset of street scenes with eight professional binaural microphones and a 360 degree camera. The co-existence of visual and audio cues is leveraged for supervision transfer. In particular, we employ a cross-modal distillation framework that consists of a vision `teacher' method and a sound `student' method -- the student method is trained to generate the same results as the teacher method. This way, the auditory system can be trained without using human annotations. We also propose two auxiliary tasks namely, a) a novel task on Spatial Sound Super-resolution to increase the spatial resolution of sounds, and b) dense depth prediction of the scene. We then formulate the three tasks into one end-to-end trainable multi-tasking network aiming to boost the overall performance. Experimental results on the dataset show that 1) our method achieves good results for all the three tasks; and 2) the three tasks are mutually beneficial -- training them together achieves the best performance and 3) the number and the orientations of microphones are both important.ISSN:0302-9743ISSN:1611-334

Repository for Publications and Research Data

Crossref