Search CORE

7 research outputs found

A machine-hearing system exploiting head movements for binaural sound localisation in reverberant conditions

Author: Brown G.J.
Ma N.
May T.
Wierstorf H.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2015
Field of study

This paper is concerned with machine localisation of multiple active speech sources in reverberant environments using two (binaural) microphones. Such conditions typically present a problem for `classical' binaural models. Inspired by the human ability to utilise head movements, the current study investigated the influence of different head movement strategies on binaural sound localisation. A machine-hearing system that exploits a multi-step head rotation strategy for sound localisation was found to produce the best performance in simulated reverberant acoustic space. This paper also reports the public release of a free binaural room impulse responses (BRIRs) database that allows the simulation of head rotation used in this study

Crossref

White Rose Research Online

Online Research Database In Technology

A metric for predicting binaural speech intelligibility in stationary noise and competing speech maskers

Author: Bruno M. Fazenda
Cooke M.
Durlach N. I.
Falk T.
IEC 60268-16:2011
Mapp P.
Martin Cooke
Rothauser E. H.
Sauert B.
Sonnenscheinn D.
Taal C. H.
Tang Y.
Trevor J. Cox
Wierstorf H.
Yan Tang
Zurek P. M.
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 21/09/2016
Field of study

One criterion in the design of binaural sound scenes in audio production is the extent to which the intended speech message is correctly understood. Object-based audio broadcasting systems have permitted sound editors to gain more access to the metadata (e.g., intensity and location) of each sound source, providing better control over speech intelligibility. The current study describes and evaluates a binaural distortion-weighted glimpse proportion metric -- BiDWGP -- which is motivated by better-ear glimpsing and binaural masking level differences. BiDWGP predicts intelligibility from two alternative input forms: either binaural recordings or monophonic recordings from each sound source along with their locations. Two listening experiments were performed with stationary noise and competing speech, one in the presence of a single masker, the other with multiple maskers, for a variety of spatial conﬁgurations. Overall, BiDWGP with both input forms predicts listener keyword scores with correlations of 0.95 and 0.91 for single- and multi-masker conditions, respectively. When considering masker type separately, correlations rise to 0.95 and above for both types of maskers. Predictions using the two input forms are very similar, suggesting that BiDWGP can be applied to the design of sound scenes where only individual sound sources and their locations are available

University of Salford Institutional Repository

Crossref

Computing interaural differences through finite element modeling of idealized human heads

Author: Algazi V. R.
Brad Rakerd
Duda R. O.
Tingli Cai
Wierstorf H.
William M. Hartmann
Publication venue: 'Acoustical Society of America (ASA)'
Publication date
Field of study

Crossref

Software and data for the paper and presentation: Estimating the Loudness Balance of Musical Mixtures using Audio Source Separation

Author: Hummersone C. (5585873)
Mason R. D. (5585867)
Plumbley M. D. (5585870)
Ward D. (5585861)
Wierstorf H. (5585864)
Publication venue
Publication date
Field of study

All code and data used to produce the paper and presentation for the paper: D. Ward, H. Wierstorf, R. D. Mason, M. D. Plumbley, and C. Hummersone, “Estimating the Loudness Balance of Musical Mixtures using Audio Source Separation,” in 3rd Workshop on Intelligent Music Production, Salford, UK, 2017. The pdf of the paper can be accessed at http://epubs.surrey.ac.uk/841966/ Also, see the Musical Audio Repurposing using Source Separation (MARuSS) website: https://cvssp.github.io/maruss-website/ This publication corresponds to revision 7:ff82ab47b8f9 of the code repository which is maintained at: https://code.soundsoftware.ac.uk/hg/wimp17-ward-et-al</p

FigShare

The boomRoom

Author: Berkhout A
Blauert J.
Brungart D. S.
Daniel J.
de Vries D.
Fohl W.
Geier M.
Leslie G.
Melchior F.
Spors S.
Völk F.
Wierstorf H.
Zotter F.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

The Effect of Nonlinear Amplitude Growth on the Speech Perception Benefits Provided by a Single-Sided Vocoder

Author: Buechner A.
Buss E.
Erbele I. D.
Grantham D. W.
Jessica M. Wess
Joshua G. W. Bernstein
Kawano A.
Moore B. C. J.
van Hoesel R. J. M.
Wierstorf H.
Zurek P. M.
Publication venue: 'American Speech Language Hearing Association'
Publication date
Field of study

Crossref

Abstracts of the Fourth Joint Annual Conference, Experimental and Clinical Short Papers Meetings of the British Society of Audiology

Author: Ahmmed A.U.
Anderson K
Clark J.G.
Cone-Wesson B.
David N Furness
Davis A
Davis A.
Dillon H.
English K.M
Ferguson M.
Gardner M.Y
Hines J
Hobson J
Janse E.
Keith R.W
Killion M.C
Kline P
Laudanski J.
Lindstone H.A.
Litovsky RY.
Lovett B.J.
MacMillan N.A.
McAlpine D.
McKenna L
Morillon B.
NICE clinical guideline 60
Oliver B.R.
Ostreicher H.J.
Pearson C.
Renfrew C.E.
Roberts J.C.
Roizen N
Scollie S
Scovel T
Smoski W.J.
Smoski W.J.
Stone M.A.
Thai-Van H
van Besouw R.M.
Weschler D
Wierstorf H.
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref