Melodic Phrase Segmentation By Deep Neural Networks

Guan, Yixing; Qiu, Yiqin; Xia, Gus; Zhang, Zheng; Zhao, Jinyu

Melodic Phrase Segmentation By Deep Neural Networks

Authors: Yixing Guan
Yiqin Qiu
Gus Xia
Zheng Zhang
Jinyu Zhao
Publication date: 14 November 2018
Publisher

Abstract

Automated melodic phrase detection and segmentation is a classical task in content-based music information retrieval and also the key towards automated music structure analysis. However, traditional methods still cannot satisfy practical requirements. In this paper, we explore and adapt various neural network architectures to see if they can be generalized to work with the symbolic representation of music and produce satisfactory melodic phrase segmentation. The main issue of applying deep-learning methods to phrase detection is the sparse labeling problem of training sets. We proposed two tailored label engineering with corresponding training techniques for different neural networks in order to make decisions at a sequential level. Experiment results show that the CNN-CRF architecture performs the best, being able to offer finer segmentation and faster to train, while CNN, Bi-LSTM-CNN and Bi-LSTM-CRF are acceptable alternatives

Similar works

Full text

Available Versions

Biodiversity Heritage Library OAI Repository

oai:biodiversitylibrary.org:ti...

Last time updated on 11/03/2020

Biodiversity Heritage Library OAI Repository

oai:biodiversitylibrary.org:it...

Last time updated on 11/03/2020