'Institute of Electrical and Electronics Engineers (IEEE)'
Doi
Abstract
In this paper we present a new approach for the generation of
multi-instrument symbolic music driven by musical emotion. The principal
novelty of our approach centres on conditioning a state-of-the-art transformer
based on continuous-valued valence and arousal labels. In addition, we provide
a new large-scale dataset of symbolic music paired with emotion labels in terms
of valence and arousal. We evaluate our approach in a quantitative manner in
two ways, first by measuring its note prediction accuracy, and second via a
regression task in the valence-arousal plane. Our results demonstrate that our
proposed approaches outperform conditioning using control tokens which is
representative of the current state of the art.‘la Caixa’’ Foundation under Grant 100010434 and Grant LCF/BQ/DI19/1173003 - FCT—Foundation for Science and Technology, I.P., through the Project MERGE through the National Funds (PIDDAC) through the Portuguese State Budget under Grant PTDC/CCI-COM/3171/2021 - European Social Fund through the Regional Operational Program Centro 2020 Project CISUC under Grant UID/CEC/00326/2020info:eu-repo/semantics/publishedVersio