Towards evaluating multiple predominant melody annotations in jazz recordings

Abeßer, Jakob; Balke, Stefan; Dittmar, Christian; Driedger, Jonathan; Müller, Meinard

Towards evaluating multiple predominant melody annotations in jazz recordings

Authors: Jakob Abeßer
Stefan Balke
Christian Dittmar
Jonathan Driedger
Meinard Müller
Publication date
Publisher

Abstract

Melody estimation algorithms are typically evaluated by separately assessing the task of voice activity detection and fundamental frequency estimation. For both subtasks, computed results are typically compared to a single human reference annotation. This is problematic since different human experts may differ in how they specify a predominant melody, thus leading to a pool of equally valid reference annotations. In this paper, we address the problem of evaluating melody extraction algorithms within a jazz music scenario. Using four human and two automatically computed annotations, we discuss the limitations of standard evaluation measures and introduce an adaptation of Fleiss’ kappa that can better account for multiple reference annotations. Our experiments not only highlight the behavior of the different evaluation measures, but also give deeper insights into the melody extraction task

Similar works

Full text

Available Versions

Fraunhofer-ePrints

oai:fraunhofer.de:N-435939

Last time updated on 21/07/2017