Article thumbnail

1Joint Optimization of the Frequency-domain and Time-domain Transformations in Deriving Generalized Static and Dynamic MFCCs

By Yiu-pong Lai, Student Member, Manhung Siu, Senior Member and Brian Mak

Abstract

Abstract — Traditionally, static mel-frequency cepstral coeffi-cients (MFCCs) are derived by discrete cosine transformation (DCT), and dynamic MFCCs are derived by linear regression. Their derivation may be generalized as a frequency-domain transformation of the log filter-bank energies (FBEs) followed by a time-domain transformation. In the past, these two trans-formations are usually estimated or optimized separately. In this paper, we consider sequences of log FBEs as a set of spectrogram images, and investigate an image compression technique to jointly optimize the two transformations so that the reconstruction error of the spectrogram images is minimized; there is an efficient algorithm that solves the optimization problem. The framework allows extension to other optimization costs as well. Index Terms — low-rank approximation of matrices, time-frequency representation, mel-frequency cepstral coefficients, discrete cosine transform I

Year: 2016
OAI identifier: oai:CiteSeerX.psu:10.1.1.953.5231
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://repository.ust.hk/ir/bi... (external link)
  • http://repository.ust.hk/ir/bi... (external link)
  • http://citeseerx.ist.psu.edu/v... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.