Search CORE

25 research outputs found

Adversarial Semi-Supervised Audio Source Separation applied to Singing Voice Extraction

Author: Dixon Simon
Ewert Sebastian
Stoller Daniel
Publication venue
Publication date: 06/04/2018
Field of study

The state of the art in music source separation employs neural networks trained in a supervised fashion on multi-track databases to estimate the sources from a given mixture. With only few datasets available, often extensive data augmentation is used to combat overfitting. Mixing random tracks, however, can even reduce separation performance as instruments in real music are strongly correlated. The key concept in our approach is that source estimates of an optimal separator should be indistinguishable from real source signals. Based on this idea, we drive the separator towards outputs deemed as realistic by discriminator networks that are trained to tell apart real from separator samples. This way, we can also use unpaired source and mixture recordings without the drawbacks of creating unrealistic music mixtures. Our framework is widely applicable as it does not assume a specific network architecture or number of sources. To our knowledge, this is the first adoption of adversarial training for music source separation. In a prototype experiment for singing voice separation, separation performance increases with our approach compared to purely supervised training.Comment: 5 pages, 2 figures, 1 table. Final version of manuscript accepted for 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Implementation available at https://github.com/f90/AdversarialAudioSeparatio

arXiv.org e-Print Archive

Crossref

Learning to separate vocals from polyphonic mixtures via ensemble methods and structured output prediction

Author: De Bie Tijl
Mcvicar Matt
Santos-Rodriguez Raul
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 19/05/2016
Field of study

Crossref

Explore Bristol Research

The 2015 Signal Separation Evaluation Campaign

Author: Ito Nobutaka
Kitamura Daichi
Liutkus Antoine
Ono Nobutaka
Rafii Zafar
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

International audienceIn this paper, we report the 2015 community-based Signal Separation Evaluation Campaign (SiSEC 2015). This SiSEC consists of four speech and music datasets including two new datasets: " Professionally produced music recordings " and " Asynchronous recordings of speech mixtures ". Focusing on them, we overview the campaign specifications such as the tasks, datasets and evaluation criteria. We also summarize the performance of the submitted systems

Crossref

INRIA a CCSD electronic archive server

HAL-Rennes 1