Search CORE

68 research outputs found

A Web Audio Node for the Fast Creation of Natural Language Interfaces for Audio Production

Author: Donovan Michael
Pardo Bryan
Seetharaman Prem
Publication venue
Publication date: 30/09/2017
Field of study

Audio production involves the use of tools such as reverberators, compressors, and equalizers to transform raw audio into a state ready for public consumption. These tools are in wide use by both musicians and expert audio engineers for this purpose. The typical interfaces for these tools use low-level signal parameters as controls for the audio effect. These signal parameters often have unintuitive names such as “feedback” or “low-high” that have little meaning to many people. This makes them diffi cult to use and learn for many people. Such low-level interfaces are also common throughout audio production interfaces using the Web Audio API. Recent work in bridging the semantic gap between verbal descriptions of audio effects (e.g. “underwater”, “warm”, “bright”) and low-level signal parameters has resulted in provably better interfaces for a population of laypeople. In that work, a vocabulary of hundreds of descriptive terms was crowdsourced, along with their mappings to audio effects settings for reverberation and equalization. In this paper, we present a Web Audio node that lets web developers leverage this vocabulary to easily create web-based audio effects tools that use natural language interfaces. Our Web Audio node and additional documentation can be accessed at https://interactiveaudiolab.github.io/audealize_api

Queen Mary Research Online

A Semantic Approach To Autonomous Mixing

Author: De Man B
Reiss JD
Publication venue
Publication date: 01/01/2013
Field of study

Queen Mary Research Online

From my pen to your ears: automatic production of radio plays from unstructured story text

Author: 15th Sound and Music Computing Conference
CHOURDAKIS ET
REISS JOSHUAD
Publication venue
Publication date: 14/07/2018
Field of study

A radio play is a form of drama which exists in the acoustic domain and is usually consumed over broadcast radio. In this paper a method is proposed that, given a story in the form of unstructured text, produces a radio play that tells this story. First, information about characters, acting lines, and environments is retrieved from the text. The information extracted serves to generate a production script which can be used either by producers of radiodrama, or subsequently used to automatically generate the radio play as an audio file. The system is evaluated in two parts: precision, recall, and f1 scores are computed for the information retrieval part while multistimulus listening tests are used for subjective evaluation of the generated audio

Queen Mary Research Online

Word Embeddings for Automatic Equalization in Audio Mixing

Author: Miranda E
Moffat D
Venkatesh S
Publication venue: 'Audio Engineering Society'
Publication date: 12/09/2022
Field of study

In recent years, machine learning has been widely adopted to automate the audio mixing process. Automatic mixing systems have been applied to various audio effects such as gain-adjustment, equalization, and reverberation. These systems can be controlled through visual interfaces, providing audio examples, using knobs, and semantic descriptors. Using semantic descriptors or textual information to control these systems is an effective way for artists to communicate their creative goals. In this paper, we explore the novel idea of using word embeddings to represent semantic descriptors. Word embeddings are generally obtained by training neural networks on large corpora of written text. These embeddings serve as the input layer of the neural network to create a translation from words to EQ settings. Using this technique, the machine learning model can also generate EQ settings for semantic descriptors that it has not seen before. We compare the EQ settings of humans with the predictions of the neural network to evaluate the quality of predictions. The results showed that the embedding layer enables the neural network to understand semantic descriptors. We observed that the models with embedding layers perform better than those without embedding layers, but still not as good as human labels

arXiv.org e-Print Archive

Plymouth Electronic Archive and Research Library

A Machine-Learning Approach to Application of Intelligent Artificial Reverberation

Author: Chourdakis ET
Reiss JD
Publication venue: 'Audio Engineering Society'
Publication date: 14/02/2017
Field of study

Queen Mary Research Online

Perceptual Evaluation and Analysis of Reverberation in Multitrack Music Production

Author: De Man B
McNally K
Reiss JD
Publication venue: 'Audio Engineering Society'
Publication date: 14/02/2017
Field of study

Crossref

Queen Mary Research Online

Word Embeddings for Automatic Equalization in Audio Mixing

Author: Miranda Eduardo Reck
Moffat David
Venkatesh Satvik
Publication venue
Publication date: 19/07/2022
Field of study

In recent years, machine learning has been widely adopted to automate the audio mixing process. Automatic mixing systems have been applied to var- ious audio effects such as gain-adjustment, stereo panning, equalization, and reverberation. These systems can be controlled through visual interfaces, pro- viding audio examples, using knobs, and semantic descriptors. Using semantic descriptors or textual information to control these systems is an effective way for artists to communicate their creative goals. Furthermore, sometimes artists use non-technical words that may not be understood by the mixing system, or even a mixing engineer. In this paper, we explore the novel idea of using word embeddings to represent semantic descriptors. Word embeddings are generally obtained by training neural networks on large corpora of written text. These embeddings serve as the input layer of the neural network to create a trans- lation from words to EQ settings. Using this technique, the machine learning model can also generate EQ settings for semantic descriptors that it has not seen before. We perform experiments to demonstrate the feasibility of this idea. In addition, we compare the EQ settings of humans with the predictions of the neural network to evaluate the quality of predictions. The results showed that the embedding layer enables the neural network to understand semantic descrip- tors. We observed that the models with embedding layers perform better those without embedding layers, but not as good as human labels

arXiv.org e-Print Archive

Plymouth Marine Science Electronic Archive (PlyMSEA)

Plymouth Electronic Archive and Research Library

PnP Maxtools: Autonomous Parameter Control in MaxMSP Utilizing MIR Algorithms

Author: Franklin Austin Alexander
Publication venue: LSU Digital Commons
Publication date: 15/12/2022
Field of study

This research presents a new approach to computer automation through the implementation of novel real-time music information retrieval algorithms developed for this project. It documents the development of the PnP.Maxtools package, a set of open source objects designed within the popular programming environment MaxMSP. The package is a set of pre/post processing filters, objective and subjective timbral descriptors, audio effects, and other objects that are designed to be used together to compose music or improvise without the use of external controllers or hardware. The PnP.Maxtools package objects are designed to be used quickly and easily using a `plug and play\u27 style with as few initial arguments needed as possible. The PnP.Maxtools package is designed to take incoming audio from a microphone, analyze it, and use the analysis to control an audio effect on the incoming signal in real-time. In this way, the audio content has a real musical and analogous relationship with the resulting musical transformations while the control parameters become more multifaceted and better able to serve the needs of artists. The term Reflexive Automation is presented that describes this unsupervised relationship between the content of the sound being analyzed and the analogous and automatic control over a specific musical parameter. A set of compositions are also presented that demonstrate ideal usage of the object categories for creating reflexive systems and achieving fully autonomous control over musical parameters

Louisiana State University

Analysis of Peer Reviews in Music Production

Author: Man BD
Reiss JD
Publication venue
Publication date: 25/04/2016
Field of study

Queen Mary Research Online