Search CORE

18 research outputs found

Modal Synthesis of Weapon Sounds

Author: Audio Engineering Society Conference: 61st International Conference: Audio for Games
Mengual L
MOFFAT DJ
Reiss JD
Publication venue
Publication date: 24/05/2016
Field of study

Leveraging Neural Representations for Audio Manipulation

Author: 154th Convention of the Audio Engineering Society
Hawley S
Steinmetz C
Publication venue
Publication date: 10/04/2023
Field of study

We investigate applying audio manipulations using pretrained neural network-based autoencoders as an alternative to traditional signal processing methods, since the former may provide greater semantic or perceptual organization. To establish the potential of this approach, we first establish if representations from these models encode information about manipulations. We carry out experiments and produce visualizations using representations from two different pretrained autoencoders. Our findings indicate that, while some information about audio manipulations is encoded, this information is both limited and encoded in a non-trivial way. This is supported by our attempts to visualize these representations, which demonstrated that trajectories of representations for common manipulations are typically nonlinear and content dependent, even for linear signal manipulations. As a result, it is not yet clear how these pretrained autoencoders can be used to manipulate audio signals, however, our results indicate this may be due to the lack of disentanglement with respect to common audio manipulations

Queen Mary Research Online

An Audio-Driven System for Real-Time Music Visualisation

Author: Audio Engineering Society Convention 150
Barthet M
Graf M
Opara HC
Publication venue: 'American Society for Aesthetics'
Publication date: 24/06/2021
Field of study

Computer-generated visualisations can accompany recorded or live music to create novel audiovisual experiences for audiences. We present a system to streamline the creation of audio-driven visualisations based on audio feature extraction and mapping interfaces. Its architecture is based on three modular software components: backend (audio plugin), frontend (3D game-like environment), and middleware (visual mapping interface). We conducted a user evaluation comprising two stages. Results from the first stage (34 participants) indicate that music visualisations generated with the system were significantly better at complementing the music than a baseline visualisation. Nine participants took part in the second stage involving interactive tasks. Overall, the system yielded a Creativity Support Index above average (68.1) and a System Usability Scale index (58.6) suggesting that ease of use can be improved. Thematic analysis revealed that participants enjoyed the system’s synchronicity and expressive capabilities, but found technical problems and difficulties understanding the audio feature terminology

Queen Mary Research Online

Alignment and Timeline Construction for Incomplete Analogue Audience Recordings of Historical Live Music Concerts

Author: 147th Audio Engineering Society International Convention 2019
Sandler MB
Thalmann F
Wilmering T
Publication venue
Publication date: 08/10/2019
Field of study

Analogue recordings pose specific problems during automatic alignment, such as distortion due to physical degradation, or differences in tape speed during recording, copying, and digitisation. Oftentimes, recordings are incomplete, exhibiting gaps with different lengths. In this paper we propose a method to align multiple digitised analogue recordings of same concerts of varying quality and song segmentations. The process includes the automatic construction of a reference concert timeline. We evaluate alignment methods on a synthetic dataset and apply our algorithm to real-world data

Queen Mary Research Online