A Feature Learning Siamese Model for Intelligent Control of the Dynamic
  Range Compressor

Fazekas, György; Sheng, Di

research

A Feature Learning Siamese Model for Intelligent Control of the Dynamic Range Compressor

Authors: György Fazekas
Di Sheng
Publication date: 1 May 2019
Publisher
Doi

Abstract

In this paper, a siamese DNN model is proposed to learn the characteristics of the audio dynamic range compressor (DRC). This facilitates an intelligent control system that uses audio examples to configure the DRC, a widely used non-linear audio signal conditioning technique in the areas of music production, speech communication and broadcasting. Several alternative siamese DNN architectures are proposed to learn feature embeddings that can characterise subtle effects due to dynamic range compression. These models are compared with each other as well as handcrafted features proposed in previous work. The evaluation of the relations between the hyperparameters of DNN and DRC parameters are also provided. The best model is able to produce a universal feature embedding that is capable of predicting multiple DRC parameters simultaneously, which is a significant improvement from our previous research. The feature embedding shows better performance than handcrafted audio features when predicting DRC parameters for both mono-instrument audio loops and polyphonic music pieces.Comment: 8 pages, accepted in IJCNN 201

Similar works

Full text

Available Versions

Crossref

Last time updated on 10/08/2021