Search CORE

1,741 research outputs found

Score-Informed Source Separation for Musical Audio Recordings [An overview]

Author: Ewert S
Mueller M
Pardo B
Plumbley MD
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2014
Field of study

(c) 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works

CiteSeerX

Crossref

Queen Mary Research Online

Surrey Research Insight

Robust Joint Alignment of Multiple Versions of a Piece of Music

Author: Dixon S
Ewert S
Wang S
Publication venue
Publication date: 09/01/2016
Field of study

Large music content libraries often comprise multiple versions of a piece of music. To establish a link between different versions, automatic music alignment methods map each position in one version to a corresponding position in another version. Due to the leeway in interpreting a piece, any two versions can differ significantly, for example, in terms of local tempo, articulation, or playing style. For a given pair of versions, these differences can be significant such that even state-of-the-art methods fail to identify a correct alignment. In this paper, we present a novel method that increases the robustness for difficult to align cases. Instead of aligning only pairs of versions as done in previous methods, our method aligns multiple versions in a joint manner. This way, the alignment can be computed by comparing each version not only with one but with several versions, which stabilizes the comparison and leads to an increase in alignment robustness. Using recordings from the Mazurka Project, the alignment error for our proposed method was 14% lower on average compared to a state-of-the-art method, with significantly less outliers (standard deviation 53% lower).Comment: International Society for Music Information Retrieval Conference (ISMIR

arXiv.org e-Print Archive

CiteSeerX

Queen Mary Research Online

Identifying Missing and Extra Notes in Piano Recordings Using Score-Informed Dictionary Learning

Author: Dixon S
Ewert S
Wang S
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 12/07/2017
Field of study

Crossref

Queen Mary Research Online

USING SCORE-INFORMED CONSTRAINTS FOR NMF-BASED SOURCE SEPARATION

Author: Ewert S
IEEE
Mueller M
Publication venue
Publication date: 28/04/2016
Field of study

Queen Mary Research Online

The Audio Degradation Toolbox and its Application to Robustness Evaluation

Author: Ewert S
International Society for Music Information Retrieval Conference (ISMIR 2013)
MAUCH M
Publication venue
Publication date: 01/01/2013
Field of study

We introduce the Audio Degradation Toolbox (ADT) for the controlled degradation of audio signals, and propose its usage as a means of evaluating and comparing the robustness of audio processing algorithms. Music recordings encountered in practical applications are subject to varied, sometimes unpredictable degradation. For example, audio is degraded by low-quality microphones, noisy recording environments, MP3 compression, dynamic compression in broadcasting or vinyl decay. In spite of this, no standard software for the degradation of audio exists, and music processing methods are usually evaluated against clean data. The ADT fills this gap by providing Matlab scripts that emulate a wide range of degradation types. We describe 14 degradation units, and how they can be chained to create more complex, `real-world' degradations. The ADT also provides functionality to adjust existing ground-truth, correcting for temporal distortions introduced by degradation. Using four different music informatics tasks, we show that performance strongly depends on the combination of method and degradation applied. We demonstrate that specific degradations can reduce or even reverse the performance difference between two competing methods. ADT source code, sounds, impulse responses and definitions are freely available for download

Queen Mary Research Online

COMPENSATING FOR ASYNCHRONIES BETWEEN MUSICAL VOICES IN SCORE-PERFORMANCE ALIGNMENT

Author: Dixon S
Ewert S
IEEE
Wang S
Publication venue
Publication date: 28/04/2016
Field of study

Queen Mary Research Online

Notentext-Informierte Quellentrennung für Musiksignale

Author: Driedger J
Ewert S
Müller M
Publication venue
Publication date: 28/04/2016
Field of study

codedemo: http://www.audiolabs-erlangen.de/resources/2013-ACMMM-AudioDecomp/codedemo: http://www.audiolabs-erlangen.de/resources/2013-ACMMM-AudioDecomp/codedemo: http://www.audiolabs-erlangen.de/resources/2013-ACMMM-AudioDecomp/codedemo: http://www.audiolabs-erlangen.de/resources/2013-ACMMM-AudioDecomp

Queen Mary Research Online

STRUCTURED DROPOUT FOR WEAK LABEL AND MULTI-INSTANCE LEARNING AND ITS APPLICATION TO SCORE-INFORMED SOURCE SEPARATION

Author: Ewert S
IEEE
Sandler MB
Publication venue
Publication date: 08/08/2017
Field of study

Queen Mary Research Online

Improving Time-Scale Modification of Music Signals Using Harmonic-Percussive Separation

Author: Driedger J
Ewert S
Mueller M
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2014
Field of study

A major problem in time-scale modification (TSM) of music signals is that percussive transients are often perceptually degraded. To prevent this degradation, some TSM approaches try to explicitly identify transients in the input signal and to handle them in a special way. However, such approaches are problematic for two reasons. First, errors in the transient detection have an immediate influence on the final TSM result and, second, a perceptual transparent preservation of transients is by far not a trivial task. In this paper we present a TSM approach that handles transients implicitly by first separating the signal into a harmonic component as well as a percussive component which typically contains the transients. While the harmonic component is modified with a phase vocoder approach using a large frame size, the noise-like percussive component is modified with a simple time-domain overlap-add technique using a short frame size, which preserves the transients to a hig h degree without any explicit transient detection

Crossref

Fraunhofer-ePrints

Queen Mary Research Online