Location of Repository

The development of a design tool for 5-speaker surround sound decoders

By David J. Moore

Abstract

This thesis presents the development of a software-based decoder design tool (DDT) for producing Ambisonic decoders optimised for playback over 5-speaker layouts. The research specifically focuses on developing decoders for irregular layouts with loudspeakers at a constant radial distance from the central listening position. It was motivated by the desire to provide better surround sound over the standard ITU 5-speaker layout for listeners in the sweet spot and off-centre positions. A wide-ranging literature review is presented revealing the need for such work.\ud The DDT employs the Tabu Search algorithm to seek improved decoder parameters according to a multi-objective fitness function. The fitness function encapsulates criteria from psychoacoustic models as a set of objectives. In order to ensure the objectives were treated equally a method known as „range-removal‟ was used for the first time in Ambisonic decoder design. A companion technique termed „importance‟ allows the systematic prioritisation of range-removed objectives giving a designer control over desired decoder criteria.\ud Additional elements exist in the DDT that can be turned on or off in different combinations. They include: a novel component for producing decoders with even performance by angle, a novel component for producing performance that correlates with the pattern of human spatial resolution estimated in previous Minimum Audible Angle experiments, and the ability to produce frequency dependent or independent decoders of different orders. Moreover, the user of the DDT can optimise performance for a single listener or multiple distributed listeners. To make the DDT as interactive as possible searches can optionally run on a High Performance Computer.\ud This thesis also details the extensive testing of Ambisonic decoders for the ITU layout. Decoders have been assessed subjectively in listening tests and objectively using binaural measurements which has verified the methods developed in this research and the DDT‟s concept. Furthermore, decoders derived by the DDT have been compared to existing decoders and the results show they give equal or better performance.\ud The development of a fully-functioning DDT which incorporates techniques for range-removal, importance, even performance by angle, minimum audible angle, off-centre listeners and their use in any combination represent the key outcomes of this work

Topics: M1, T1
OAI identifier: oai:eprints.hud.ac.uk:9050

Suggested articles

Preview

Citations

  1. (2000). 5.1 Surround Sound: Up and Running.
  2. (1962). 73(Part A - Technical),
  3. (2005). A Binaural Model to Predict Position and Extension of Spatial Images Created with Standard Sound Recording Techniques. Presented at the 119 th Audio Engineering Society Convention,
  4. (1999). A Comparative Study of 3-D Audio Encoding and Rendering Techniques.
  5. (2007). A Comparative Study of the Performance of Spatialization Techniques for a Distributed Audience in a Concert Hall Environment.
  6. (1997). A comparison of heuristic search algorithms for molecular docking.
  7. (2002). A comparison of the performance of different metaheuristics on the timetabling problem.
  8. (2007). A composite-neighborhood tabu search approach to the traveling tournament problem.
  9. (1989). A Computer Model of Binaural Localization for Stereo Imaging Measurement. Presented at the 87 th Audio Engineering Society Convention,
  10. (1982). A Computer Model of Binaural Localization. Presented at the 72 th Audio Engineering Society Convention,
  11. (2008). A Method for Estimating Interaural Time Difference for Binaural Synthesis. Presented at the 125 th Audio Engineering Society Convention
  12. (1992). A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction.
  13. (1985). A New Approach to the Assessment of Stereophonic Sound System Performance.
  14. (2005). A Study of Multi-objective Optimization Methods for Engineering Applications.
  15. (2004). A Symposium on Multichannel Audio for Radio Broadcasters.
  16. (1994). A Tabu Search Heuristic for the Vehicle Routing Problem.
  17. (1995). A Tutorial on Tabu Search,
  18. (1998). A Wavefield Approach to Multichannel Sound. Presented at the 104 th Audio Engineering Society Convention,
  19. (1997). Acoustics - Measurement of the Reverberation Time of Rooms with Reference to other Acoustical Parameters, Geneva, Switzerland: International Organisation for Standardisation,
  20. (2001). Acoustics and Psychoacoustics,
  21. (1984). Acuity of Sound Localisation: a Topography of Auditory Space. 1. Normal Hearing Conditions.
  22. (2004). Advanced Multichannel Audio Systems with Superior Impression of Presence and Reality. Presented at the 116 th Audio Engineering Society Convention,
  23. (2007). Advancements in Impulse Reponse Measurements by Sine Sweeps. Presented at the 122 th Audio Engineering Society Convention,
  24. (2001). Ambiophonics: Achieving Psychological Realism in Music Recording and Reproduction. Presented at the 111 th Audio Engineering Society Convention,
  25. (1992). Ambisonic Decoders for HDTV. Presented at the 92 nd Audio Engineering Society Convention
  26. (1995). Ambisonic Sound for Us. Presented at the 99 th Audio Engineering Society Convention,
  27. (1998). Ambisonics Encoding of Other Audio Formats for Multiple Listening Conditions. Presented at the 105 th Audio Engineering Society Convention,
  28. (1985). Ambisonics in Multichannel Broadcasting and Video.
  29. (2008). An Efficient Method to Generate Particle Sounds in Wave Field Synthesis. Presented at the 125 th Audio Engineering Society Convention,
  30. (2003). An Introduction to the Psychology of Hearing,
  31. (2004). An Investigation into the real-time Manipulation and Control of ThreeDimentional Sound Fields.
  32. (2002). An Objective Model of Localisation in Binaural Sound Reproduction Systems.
  33. (1993). An Overview of Evolutionary Algorithms for Parameter Optimization.
  34. (1999). Analyzing Virtual Sound Source Attributes Using a Binaural Auditory Model.
  35. (1987). Anomalies of Wavefront Reconstruction in Stereo and Surround Sound Reproduction. Presented at the 83 rd Audio Engineering Society Convention,
  36. (1999). Auditory Localization of Nearby Sources. Head Related Transfer Functions.
  37. (1972). Auditory Localization. In
  38. (2003). Auditory Spatial Resolution in Horizontal, Vertical, and Diagonal Planes.
  39. (2008). AURORA Plug-ins. Available at: http://www.auroraplugins.com/Aurora_XP/index.htm [Accessed
  40. (2008). Available at: http://gravity.phy.umassd.edu/ps3.html [Accessed
  41. (2009). Available at: http://interface.cipic.ucdavis.edu/CIL_html/CIL_HRTF_database.htm [Accessed
  42. (1996). Binaural Technique: Do we need Individual Recordings?
  43. (2001). Boundary element method calculation of individual head-related transfer function. I. Rigid model calculation.
  44. (2001). Boundary element method calculation of individual head-related transfer function. II. Impedance effects and comparisons to real measurements.
  45. (2004). CIPIC Interface Laboratory: HRTF Database. The CIPIC HRTF Database.
  46. (2008). ClearSpeed - Home. Available at: http://www.clearspeed.com/ [Accessed
  47. (1977). Coincident microphone simulation covering three dimensional space and yielding various directional outputs.
  48. (1987). Computer Control of Ambisonic Soundfields. Presented at the 82 nd Audio Engineering Society Convention,
  49. (2003). Continuous Surround Panning for 5-Speaker Reproduction. Presented at the 24
  50. (2002). Contribution of spectral cues to human sound localization.
  51. (1977). Creating Source Elevation Illusions by Spectral Manipulation.
  52. (2005). Cues for Localisation in the Horizontal Plane. Presented at the th Audio Engineering Society Convention,
  53. (2006). Decoding Second Order Ambisonics to 5.1 Surround Systems. Presented at the 121 st Audio Engineering Society Convention,
  54. (1989). Detection and discrimination of spectral peaks and notches at 1 and 8 kHz.
  55. (1989). Detection and Discrimination of Spectral Peaks and Notches at 1 kHz and 8 kHz.
  56. (2002). Determination of the Relative Hierarchy of Audible Cues
  57. (1972). Discrete-Matrix Multichannel Stereo.
  58. (1967). Effect of Induced Head Movements on Localization of Direction of Sounds.
  59. (2001). Effective Heuristics for the GPS Survey Network of Malta: Simulated Annealing and Tabu Search Techniques.
  60. (2005). Effects of Uncertainty on auditory spatial resolution in the horizontal plane.
  61. (2007). Examination of Multichannel Sound-Field Recomposition Utilizing Frequency-Dependent Interaural Cross Correlation (FIACC).
  62. (1992). Experience with large area 3-D Ambisonic Sound Systems.
  63. (2008). Exploiting Human Spatial Resolution in Surround Sound Decoder Design. Presented at the 125 th Audio Engineering Society Convention,
  64. (2003). Further Investigations of High Order Ambisonics and Wavefield Synthesis for Holophonic Sound Imaging. Presented at the 114 th Audio Engineering Society Convention,
  65. (2004). Further Study of Sound Field Coding with Higher Order Ambisonics. Presented at the 116 th Audio Engineering Society Convention,
  66. (1992). General Metatheory of Auditory Localisation. Presented at the 92 nd Audio Engineering Society Convention,
  67. (1998). Genetic Algorithms + Data Structures = Evolution Programs,
  68. (2000). Grid computing: the European Data Grid Project.
  69. (2008). Has Ambisonics Finally Come of Age?
  70. (1991). HDTV Sound Systems: How many Channels? Presented at
  71. (2007). Head-Related Transfer Function Calculation Using Boundary Element Method. Presented at the 122 th Audio Engineering Society Convention,
  72. (1995). Headphone System with out-of-head Localisation Applying Dynamic HRTF (Head Related Transfer Function).
  73. (2003). History of Spatial Coding.
  74. (2007). Hollywood sound for Cricklewood Money. Presented at the 22 psychoacoustics to audio),
  75. (2008). Home Audio Division. CEA: Home Audio Division - Home Audio Division. Available at:
  76. (2003). HRTF personalization using anthropometric measurements.
  77. (1998). Implementation of Transaural Systems in Software on a PC. Presented at the th Audio Engineering Society Convention,
  78. (1972). Interaural Crosscorrelation for Multichannel Loudspeaker Reproduction.
  79. (2001). Introduction to Head-Related Transfer Functions (HRTFs): Representations of HRTFs in Time, Frequency, and Space.
  80. (1985). Latency, and Listener-Search Behaviour in Localization in the Horizonal and Vertical Planes.
  81. (2001). Localisation of Amplitude Panned Virtual Sources 1: Stereophonic Panning.
  82. (2006). Localization in Horizontal-Only Ambisonic Systems. Presented at the 121 st Audio Engineering Society Convention,
  83. (2001). Localization of Amplitude-Panned Virtual Sources II: Two- and ThreeDimensional Panning.
  84. (1957). Localization of High-Frequency Tones.
  85. (2002). Localization of Lateral Phantom Images in a 5-Channel System with and without Simulated Early Reflections. Presented at the 113 th Audio Engineering Society Convention,
  86. (1977). Localization of Lateral Phantom Sources.
  87. (2005). Localization of virtual sources in multichannel audio reproduction. Speech and Audio Processing,
  88. (1993). Localization using Non-Individualized Head-Related Transfer Functions.
  89. (1994). Measuring a Dummy Head in Search of Pinna Cues.
  90. (2008). Methods for Sharing Stereo and Multichannel Recordings Among Planetariums. Presented at the 124 th Audio Engineering Society Convention Amsterdam, The Netherlands,
  91. (2000). Minimum Audible Angles in the Horizontal and Vertical Planes: Effects of Stimulas Onset Asynchrony and Burst Duration.
  92. (2007). Monophonic Source Localization for a Distributed Audience in a Small Concert Hall.
  93. (2009). MRC Institute of Hearing Research. Available at: http://www.ihr.mrc.ac.uk/products/index.php?page=matlab [Accessed
  94. (1994). Multichannel stereophonic sound system with and without accompanying picture,
  95. (2008). New Dimensions for Ambisonics. Presented at the 124 th Audio Engineering Society Convention,
  96. (1999). Objective Measures of Spaciousness and Envelopment. Presented at the th International Audio Engineering Society Conference,
  97. (1907). On Our Perception of Sound Direction.
  98. (1989). On the Minimum Audible Angle - A Descision Theory Approach.
  99. (1958). On the Minimum Audible Angle.
  100. (2002). On the Theory of the Second-Order Soundfield
  101. (1983). Optimization by simulated annealing.
  102. (1992). Optimum Reproduction Matrices for Multispeaker Stereo.
  103. (2000). Perceptual coding of digital audio.
  104. (2006). Perceptual Recalibration in Human Sound Localization: Learning to Remediate Front-Back Reversals.
  105. (1961). Phasor Analysis of Some Stereophonic Phenomena.
  106. (1974). Pinna Reflections as Cues for Localization.
  107. (1980). Practical Periphony: The Reproduction of Full-Sphere Sound. Presented at the 65 th Audio Engineering Society Convention,
  108. (1998). Presented at the 105 th Audio Engineering Society Convention,
  109. (2006). Presented at the 120 th Audio Engineering Society Convention,
  110. (2006). Presented at the 121 st Audio Engineering Society Convention,
  111. (2008). Presented at the 125 th Audio Engineering Society Convention,
  112. (1993). Presented at the 94 th Audio Engineering Society Convention,
  113. (1973). Problem of Localization in the Median Plane: Effect of Pinnae Cavity Occlusion.
  114. (1988). Prospects for Transaural Recording. Presented at the 85 th Audio Engineering Society Convention,
  115. (2007). Psychoacoustics in Sound Reinforcement and PA System Design. Presented at the 22 nd UK Audio Engineering Society Conference, Illusions in Sound: the application of psychoacoustics to audio.
  116. (2003). Radio in 5.1 - The True Experience. Presented at the 24
  117. (1997). Reactive search, a history-based heuristic for MAX-SAT.
  118. (1999). Recent Advances in AI Planning,
  119. (2001). Reproduction of a plane-wave sound field using an array of loudspeakers. Speech and Audio Processing,
  120. (1999). Resolution of front--back ambiguity in spatial hearing by listener and source movement.
  121. (2007). Robust Two-Dimensional Surround Sound Reproduction for Nonuniform Loudspeaker Layouts.
  122. (2005). Shelf Filters for Ambisonic Decoders, Available at: http://www.ambisonia.com/Members/ricardo/shelfs.zip/view,
  123. (2000). Simultaneous Measurement of Impulse Response and Distortion with a Swept Sine. Presented at the 108 th Audio Engineering Society Convention,
  124. (1999). Solution of the Cumulative Assignment Problem With a Well-Structured TabuSearch Method.
  125. (1959). Some Measurements on the Effects of Interchannel Intensity and Time Differences in Two Channel Systems.
  126. (1999). Sound Source Localization in a Five-Channel Surround Sound Reproduction System. Presented at the 107 th Audio Engineering Society Convention
  127. (1937). Sound-transmission, sound-recording, and sound-reproducing system. Available at:
  128. (2008). Sound: Past, Present and Future. Available at: http://www.dolby.com/uploadedFiles/English_(US)/Professional/Technical_Libra ry/Technologies/Dolby_Surround/2_Surround_Past.Present.pdf [Accessed
  129. (2007). Spatial Audio Quality Evaluation: Comparing Transaural,
  130. (2001). Spatial Audio, Focal Press,
  131. (2001). Spatial Hearing: The Psychophysics of Human Sound Localization,
  132. (1981). Spatial impression due to early lateral reflections in concert halls: the derivation of a physical measure.
  133. (2007). Spatial Sound Reproduction with Directional Audio Coding.
  134. (2008). Statistics in a Nutshell: A Desktop Quick Reference (In a Nutshell,
  135. (1940). Stereophonic Sound Production.
  136. (1998). Surround Sound Apparatus.
  137. (1974). Surround-sound psychoacoustics.
  138. (2004). Survey of multi-objective optimization methods for engineering.
  139. (1989). Tabu Search - Part 1.
  140. (1990). Tabu Search - Part 2.
  141. (1997). The "Stereo Dipole" - A Virtual Source Imaging System Using Two Closely Spaced Loudspeakers.
  142. (2001). The analysis of multi-channel sound reproduction algorithms using
  143. (2006). The Computational and Storage Potential of Volunteer Computing.
  144. (2007). The Design and Detailed Analysis of First Order Ambisonic Decoders for the ITU layout. Presented at the 122 th Audio Engineering Society Convention,
  145. (2003). The Design and Optimisation of Surround Sound Decoders Using Heuristic Methods.
  146. (2008). The Design of Ambisonic Decoders for the ITU 5.1 Layout with Even Performance Characteristics. Presented at the 124 th Audio Engineering Society Convention,
  147. (2007). The Design of Improved First Order Ambisonic Decoders by the Application of Range-Removal and Importance in a Heuristic Search Algorithm.
  148. (1992). The Dominant Role of Low-Frequency Interaural Time Differences in Sound Localization.
  149. (2007). The Effect of Head Diffraction on Stereo Localisation in the Mid-Frequency Range. Presented at the 122 nd Audio Engineering Society Convention,
  150. (2003). The Effect of Head Motion on the Accuracy of Sound Localization.
  151. (2007). The Generation of Panning Laws for Irregular Speaker Arrays using Heuristic Methods.
  152. (1936). The Localization of Actual Sources of Sound.
  153. (1993). The New Sound Format "3/2-Stereo". Presented at the 10 th International Audio Engineering Society Conference,
  154. (1999). The normalized interaural correlation: Accounting for NoS pi thresholds obtained with Gaussian and ``low-noise'' masking noise.
  155. (2009). The Potential of High Peformance Computing in Audio Engineering. Presented at the 126 th Audio Engineering Society Convention,
  156. (1949). The Precedence Effect in Sound Localization.
  157. (1999). The Precedence Effect.
  158. (2003). The Significance of Phase as an Auditory Cue. Presented at the 114 th Audio Engineering Society Convention,
  159. (2002). Theory and design of high order sound field microphones using spherical microphone array.
  160. (2005). Theory and design of sound field reproduction in reverberant rooms.
  161. (2005). Three-Dimensional Surround Sound Systems Based on Spherical Harmonics.
  162. (1995). Time-domain modeling of peripheral auditory processing: A modular architecture and a software platform.
  163. (2008). Towards Ultra-High Resolution Models of Climate and Weather.
  164. (1991). Useful AI tools-a review of heuristic search methods.
  165. (1994). Virtual Imaging Capabilities of Surround Sound Systems.
  166. (1998). Virtual Sound Source Positioning and Mixing
  167. (1992). Wave Front Synthesis: A New Direction in Electroacoustics. Presented at the 93 rd Audio Engineering Society Convention,

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.