A Convolutional-Attentional Neural Framework for Structure-Aware Performance-Score Synchronization

Agrawal, R; Dixon, S; Wolff, D

A Convolutional-Attentional Neural Framework for Structure-Aware Performance-Score Synchronization

Authors: R Agrawal
S Dixon
D Wolff
Publication date: 1 January 2022
Publisher: 'Institute of Electrical and Electronics Engineers (IEEE)'
Doi

Abstract

Performance-score synchronization is an integral task in signal processing, which entails generating an accurate mapping between an audio recording of a performance and the corresponding musical score. Traditional synchronization methods compute alignment using knowledge-driven and stochastic approaches, and are typically unable to generalize well to different domains and modalities. We present a novel data-driven method for structure-aware performance-score synchronization. We propose a convolutional-attentional architecture trained with a custom loss based on time-series divergence. We conduct experiments for the audio-to-MIDI and audio-to-image alignment tasks pertained to different score modalities. We validate the effectiveness of our method via ablation studies and comparisons with state-of-the-art alignment approaches. We demonstrate that our approach outperforms previous synchronization methods for a variety of test settings across score modalities and acoustic conditions. Our method is also robust to structural differences between the performance and score sequences, which is a common limitation of standard alignment approaches.Comment: Published in IEEE Signal Processing Letters, Volume 29, December 202

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Supporting member

Queen Mary Research Online

oai:qmro.qmul.ac.uk:123456789/...

Last time updated on 24/04/2022