Search CORE

211 research outputs found

Drum extraction in single channel audio signals using multi-layer non negative matrix factor deconvolution

Author: Kowalski Matthieu
Laroche Clément
Papadopoulos Hélène
Richard Gael
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 05/03/2017
Field of study

International audienceIn this paper, we propose a supervised multilayer factorization method designed for harmonic/percussive source separation and drum extraction. Our method decomposes the audio signals in sparse orthogonal components which capture the harmonic content, while the drum is represented by an extension of non negative matrix factorization which is able to exploit time-frequency dictionaries to take into account non stationary drum sounds. The drum dictionaries represent various real drum hits and the decomposition has more physical sense and allows for a better interpretation of the results. Experiments on real music data for a harmonic/percussive source separation task show that our method outperforms other state of the art algorithms. Finally, our method is very robust to non stationary harmonic sources that are usually poorly decomposed by existing methods

HAL-CentraleSupelec

Crossref

INRIA a CCSD electronic archive server

HAL-CEA

HAL-Rennes 1

Sound Source Separation

Author: Evangelista G
Marchand S
Plumbley MD
Vincent E
Publication venue: 'Wiley'
Publication date: 01/01/2011
Field of study

This is the author's accepted pre-print of the article, first published as G. Evangelista, S. Marchand, M. D. Plumbley and E. Vincent. Sound source separation. In U. Zölzer (ed.), DAFX: Digital Audio Effects, 2nd edition, Chapter 14, pp. 551-588. John Wiley & Sons, March 2011. ISBN 9781119991298. DOI: 10.1002/9781119991298.ch14file: Proof:e\EvangelistaMarchandPlumbleyV11-sound.pdf:PDF owner: markp timestamp: 2011.04.26file: Proof:e\EvangelistaMarchandPlumbleyV11-sound.pdf:PDF owner: markp timestamp: 2011.04.2

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

Queen Mary Research Online

Surrey Research Insight

HAL-Rennes 1

Adapted NMFD update procedure for removing double hits in drum mixture decompositions

Author: De Bie Tijl
De Boom Cedric
Vande Veire Len
Publication venue
Publication date: 01/01/2020
Field of study

Ghent University Academic Bibliography

Non-negative mixtures

Author: Bro R
Cichocki A
Plumbley MD
Publication venue: 'Elsevier BV'
Publication date: 01/01/2010
Field of study

This is the author's accepted pre-print of the article, first published as M. D. Plumbley, A. Cichocki and R. Bro. Non-negative mixtures. In P. Comon and C. Jutten (Ed), Handbook of Blind Source Separation: Independent Component Analysis and Applications. Chapter 13, pp. 515-547. Academic Press, Feb 2010. ISBN 978-0-12-374726-6 DOI: 10.1016/B978-0-12-374726-6.00018-7file: Proof:p\PlumbleyCichockiBro10-non-negative.pdf:PDF owner: markp timestamp: 2011.04.26file: Proof:p\PlumbleyCichockiBro10-non-negative.pdf:PDF owner: markp timestamp: 2011.04.2

CiteSeerX

Copenhagen University Research Information System

Queen Mary Research Online

Surrey Research Insight

Unsupervised Learning for Monaural Source Separation Using Maximization–Minimization Algorithm with Time–Frequency Deconvolution

Author: Bouridane Ahmed
Chin Cheng Siong
Gao Bin
Ling Bingo Wing-Kuen
Woo Wai Lok
Publication venue: 'MDPI AG'
Publication date: 27/04/2018
Field of study

This paper presents an unsupervised learning algorithm for sparse nonnegative matrix factor time–frequency deconvolution with optimized fractional β -divergence. The β -divergence is a group of cost functions parametrized by a single parameter β . The Itakura–Saito divergence, Kullback–Leibler divergence and Least Square distance are special cases that correspond to β=0, 1, 2 , respectively. This paper presents a generalized algorithm that uses a flexible range of β that includes fractional values. It describes a maximization–minimization (MM) algorithm leading to the development of a fast convergence multiplicative update algorithm with guaranteed convergence. The proposed model operates in the time–frequency domain and decomposes an information-bearing matrix into two-dimensional deconvolution of factor matrices that represent the spectral dictionary and temporal codes. The deconvolution process has been optimized to yield sparse temporal codes through maximizing the likelihood of the observations. The paper also presents a method to estimate the fractional β value. The method is demonstrated on separating audio mixtures recorded from a single channel. The paper shows that the extraction of the spectral dictionary and temporal codes is significantly more efficient by using the proposed algorithm and subsequently leads to better source separation performance. Experimental tests and comparisons with other factorization methods have been conducted to verify its efficacy

Multidisciplinary Digital Publishing Institute

Northumbria Research Link

Sigmoidal NMFD : convolutional NMF with saturating activations for drum mixture decomposition

Author: De Bie Tijl
De Boom Cedric
Vande Veire Len
Publication venue: 'MDPI AG'
Publication date: 01/01/2021
Field of study

In many types of music, percussion plays an essential role to establish the rhythm and the groove of the music. Algorithms that can decompose the percussive signal into its constituent components would therefore be very useful, as they would enable many analytical and creative applications. This paper describes a method for the unsupervised decomposition of percussive recordings, building on the non-negative matrix factor deconvolution (NMFD) algorithm. Given a percussive music recording, NMFD discovers a dictionary of time-varying spectral templates and corresponding activation functions, representing its constituent sounds and their positions in the mix. We observe, however, that the activation functions discovered using NMFD do not show the expected impulse-like behavior for percussive instruments. We therefore enforce this behavior by specifying that the activations should take on binary values: either an instrument is hit, or it is not. To this end, we rewrite the activations as the output of a sigmoidal function, multiplied with a per-component amplitude factor. We furthermore define a regularization term that biases the decomposition to solutions with saturated activations, leading to the desired binary behavior. We evaluate several optimization strategies and techniques that are designed to avoid poor local minima. We show that incentivizing the activations to be binary indeed leads to the desired impulse-like behavior, and that the resulting components are better separated, leading to more interpretable decompositions

Multidisciplinary Digital Publishing Institute

Ghent University Academic Bibliography

Directory of Open Access Journals

An review of automatic drum transcription

Author: Dittmar Christian
Hockman Jason
Lerch Alexander
Muller Meinard
Southall Carl
Vogl Richard
Widmer Gerhard
Wu Chih-Wei
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 26/04/2018
Field of study

In Western popular music, drums and percussion are an important means to emphasize and shape the rhythm, often deﬁning the musical style. If computers were able to analyze the drum part in recorded music, it would enable a variety of rhythm-related music processing tasks. Especially the detection and classiﬁcation of drum sound events by computational methods is considered to be an important and challenging research problem in the broader ﬁeld of Music Information Retrieval. Over the last two decades, several authors have attempted to tackle this problem under the umbrella term Automatic Drum Transcription(ADT).This paper presents a comprehensive review of ADT research, including a thorough discussion of the task-speciﬁc challenges, categorization of existing techniques, and evaluation of several state-of-the-art systems. To provide more insights on the practice of ADT systems, we focus on two families of ADT techniques, namely methods based on Nonnegative Matrix Factorization and Recurrent Neural Networks. We explain the methods’ technical details and drum-speciﬁc variations and evaluate these approaches on publicly available datasets with a consistent experimental setup. Finally, the open issues and under-explored areas in ADT research are identiﬁed and discussed, providing future directions in this ﬁel

Birmingham City University Open Access Repository

BCU Open Access

Single-channel source separation using non-negative matrix factorization

Author: Schmidt Mikkel Nørgaard
Publication venue: Technical University of Denmark, DTU Informatics, Building 321
Publication date: 01/01/2009
Field of study

Online Research Database In Technology

Guitar String Separation Using Non-Negative Matrix Factorization and Factor Deconvolution

Author: Audio Mostly 2019
Pauwels J
Sandler M
Senvaityte D
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 18/09/2019
Field of study

Queen Mary Research Online