Search CORE

6 research outputs found

Recommended from our members

Melody Transcription From Music Audio: Approaches and Evaluation

Author: Ehmann Andreas F.
Ellis Daniel P. W.
Gomez Emilia
Ong Beesuan
Poliner Graham E.
Streich Sebastian
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2007
Field of study

Although the process of analyzing an audio recording of a music performance is complex and difficult even for a human listener, there are limited forms of information that may be tractably extracted and yet still enable interesting applications. We discuss melody--roughly, the part a listener might whistle or hum--as one such reduced descriptor of music audio, and consider how to define it, and what use it might be. We go on to describe the results of full-scale evaluations of melody transcription systems conducted in 2004 and 2005, including an overview of the systems submitted, details of how the evaluations were conducted, and a discussion of the results. For our definition of melody, current systems can achieve around 70% correct transcription at the frame level, including distinguishing between the presence or absence of the melody. Melodies transcribed at this level are readily recognizable, and show promise for practical applications

Columbia University Academic Commons

Melody Transcription From Music Audio: Approaches and Evaluation

Author: Andreas F. Ehmann
Beesuan Ong
Daniel P. W. Ellis
Emilia Gomez
Graham E. Poliner
Sebastian Streich
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Audio Engineering Society Convention Paper Presented at the 121st Convention

Author: Beesuan Ong
Emilia Gómez
Perfecto Herrera
Publication venue
Publication date
Field of study

This convention paper has been reproduced from the author's advance manuscript, without editing, corrections, or consideration by the Review Board. The AES takes no responsibility for the contents. Additional papers may be obtained by sending reques

CiteSeerX

Personal communication with A. Agogino

Author: Andreas F. Ehmann
Beesuan Ong
Daniel P. W. Ellis
Emilia Gómez
Graham E. Poliner
Sebastian Streich
Publication venue
Publication date
Field of study

Abstract — Although the process of analyzing an audio recording of a music performance is complex and difficult even for a human listener, there are limited forms of information that may be tractably extracted and yet still enable interesting applications. We discuss melody – roughly, the part a listener might whistle or hum – as one such reduced descriptor of music audio, and consider how to define it, and what use it might be. We go on to describe the results of full-scale evaluations of melody transcription systems conducted in 2004 and 2005, including an overview of the systems submitted, details of how the evaluations were conducted, and a discussion of the results. For our definition of melody, current systems can achieve around 70 % correct transcription at the frame level, including distinguishing between the presence or absence of the melody. Melodies transcribed at this level are readily recognizable, and show promise for practical applications. I

CiteSeerX

ISMIR 2004 audio description contest

Author: Beesuan Ong
Emilia Gómez
Emilia Gómez
Fabien Gouyon
Fabien Gouyon
Nicolas Wack
Pedro Cano
Pedro Cano
Perfecto Herrera
Perfecto Herrera
Sebastian Streich
Xavier Serra
Publication venue
Publication date
Field of study

Contest. We first detail the contest organization, evaluation metrics, data and infrastructure. We then provide the details and results of each contest in turn. Published papers and algorithm source codes are given when originally available. We finally discuss some aspects of these contests and propose ways to organize future, improved, audio description contests. This work is licenced under the Creative Common

CiteSeerX