Search CORE

2 research outputs found

Temporal quantization of spatial information using directional clustering for multichannel audio coding

Author: Hiroshi Saruwatari
Keisuke Masatoki
Kiyohiro Shikano
Shigeki Miyabe
Toshiyuki Nomura
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 02/03/2023
Field of study

Institutional Repositories DataBase (IRDB)

Temporal quantization of spatial information using directional clustering for multichannel audio coding

Author: Hiroshi Saruwatari
Keisuke Masatoki
Kiyohiro Shikano
Shigeki Miyabe
Toshiyuki Nomura
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/10/2009
Field of study

WASPAA2009: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, October 10-15, 2009, New Paltz, NewYork, USA,Binaural cue coding, which is a representing low bit-rate coding of multichannel audio, generates large distortion when the audio data have complex spatial image, such as symphony. Such distortion caused by the low frequency resolution of spatial information because BCC quantizes the parameters of localization. In this paper we propose a new coding framework by quantizing the spatial information temporally. The single-channel sum signal is panned to the multiple channels by selecting the prototypes of the spatial filter. Optimization of the prototypes with minimum coding error is given by a k-means-like clustering of the angles whose centroids are given by the first principal components of the covariances in the classes. The efficiency of the proposed coding with high quality is verified both in the objective and subjective evaluations

NAIST Academic Repository