Search CORE

22 research outputs found

Deep Karaoke: Extracting Vocals from Musical Mixtures Using a Convolutional Deep Neural Network

Author: D Pressnitzer
E Vincent
F Abrard
JH McDermott
MJ Terrell
N Ding
Y Wang
Publication venue
Publication date: 01/01/2015
Field of study

Identification and extraction of singing voice from within musical mixtures is a key challenge in source separation and machine audition. Recently, deep neural networks (DNN) have been used to estimate 'ideal' binary masks for carefully controlled cocktail party speech separation problems. However, it is not yet known whether these methods are capable of generalizing to the discrimination of voice and non-voice in the context of musical mixtures. Here, we trained a convolutional DNN (of around a billion parameters) to provide probabilistic estimates of the ideal binary mask for separation of vocal sounds from real-world musical mixtures. We contrast our DNN results with more traditional linear methods. Our approach may be useful for automatic removal of vocal sounds from musical mixtures for 'karaoke' type applications

arXiv.org e-Print Archive

Crossref

University of Surrey

Surrey Research Insight

Evaluations on underdetermined blind source separation in adverse environments using time-frequency masking

Author: A Cichocki
AK Jain
AW Rix
BR Jipkate
CM Coviello
D Arthur
E Vincent
E Vincent
EA Lehmann
EC Cherry
F Abrard
G Hamerly
G Li
H Li
H Sawada
H Sawada
H Sun
I Jafari
ITU-T
J Bezdek
J Han
JB MacQueen
JM Pena
L Di Persia
L Rabiner
L Zhu
M Kühne
M Kühne
M Mandel
MI Mandel
N Mitianoudis
O Yılmaz
P Georgiev
P Smaragdis
PC Loizou
R Huber
R Lippmann
R Roy
RJ Hathaway
S Araki
S Araki
S Araki
S Araki
S Araki
S Araki
S Araki
S Araki
S Godsill
S Theodoridis
T Melia
T Velmurugan
V Emiya
VG Reju
W Fisher
X-Y Wang
Y Hu
Y Izumi
Z Shi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

The successful implementation of speech processing systems in the real world depends on its ability to handle adverse acoustic conditions with undesirable factors such as room reverberation and background noise. In this study, an extension to the established multiple sensors degenerate unmixing estimation technique (MENUET) algorithm for blind source separation is proposed based on the fuzzy c-means clustering to yield improvements in separation ability for underdetermined situations using a nonlinear microphone array. However, rather than test the blind source separation ability solely on reverberant conditions, this paper extends this to include a variety of simulated and real-world noisy environments. Results reported encouraging separation ability and improved perceptual quality of the separated sources for such adverse conditions. Not only does this establish this proposed methodology as a credible improvement to the system, but also implies further applicability in areas such as noise suppression in adverse acoustic environments

Crossref

Springer - Publisher Connector

espace@Curtin

A novel underdetermined source recovery algorithm based on k-sparse component analysis

Author: A Aissa-El-Bey
A Hyvärinen
B Liu
Bahador Makkiabadi
D Coppersmith
D Needell
D Peng
DL Donoho
E Berg Van Den
Ehsan Eqlimi
F Abrard
F Marvasti
Fahimeh Mohagheghian
H Mohimani
H Zayyani
Hassan Khajehpour
I Takigawa
IF Gorodnitsky
J Wen
JA Tropp
M Elad
M Niknazar
M Zibulevsky
Nasser Samadzadehaghdam
P Bofill
P Comon
P Georgiev
R Gribonval
R Gribonval
R Mises
S Araki
Saeid Sanei
SF Cotter
SG Mallat
SS Chen
Y Li
Y Li
Y Li
Y Saad
Z He
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/08/2018
Field of study

Sparse component analysis (SCA) is a popular method for addressing underdetermined blind source separation in array signal processing applications. We are motivated by problems that arise in the applications where the sources are densely sparse (i.e. the number of active sources is high and very close to the number of sensors). The separation performance of current underdetermined source recovery (USR) solutions, including the relaxation and greedy families, reduces with decreasing the mixing system dimension and increasing the sparsity level (k). In this paper, we present a k-SCA-based algorithm that is suitable for USR in low-dimensional mixing systems. Assuming the sources is at most (m−1) sparse where m is the number of mixtures; the proposed method is capable of recovering the sources from the mixtures given the mixing matrix using a subspace detection framework. Simulation results show that the proposed algorithm achieves better separation performance in k-SCA conditions compared to state-of-the-art USR algorithms such as basis pursuit, minimizing norm-L1, smoothed L0, focal underdetermined system solver and orthogonal matching pursuit

Crossref

Nottingham Trent Institutional Repository (IRep)

Ghent University Academic Bibliography

A new source separation approach for instantaneous mixtures based on time-frequency analysis

Author: Abrard F.
Deville Y.
White P.R.
Publication venue
Publication date: 01/01/2001
Field of study

Southampton (e-Prints Soton)

Blind partial separation of underdetermined convolutive mixtures of complex sources based on differential normalized kurtosis

Author: Abrard F.
Deville Y.
Thomas J.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2008
Field of study

International audienc

HAL-INSU

From blind source separation to blind source cancellation in the underdetermine case: A new approach based on time-frequency analysis

Author: Abrard F.
Deville Y.
White P.R.
Publication venue
Publication date: 01/01/2001
Field of study

Many source separation methods are restricted to non-Gaussian, stationary and independent sources. This yields some problems in real applications where the sources often do not match these hypotheses. Moreover, in some cases we are dealing with more sources than available observations which is critical for most classical source separation approaches. In this paper, we propose a new simple source separation method which uses time-frequency information to cancel one source signal from two observations in linear instantaneous mixtures. This efficient method is directly designed for non-stationary sources and applies to various dependent or Gaussian signals which have different time-frequency representations

CiteSeerX

Southampton (e-Prints Soton)

Image Source Separation Using Color Channel Dependencies

Author: A. Hyvarinen
A.M. Bronstein
F. Abrard
K. Zhang
L. Bedini
M. Kawanabe
W.R. Gilks
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Crossref

SPEAKER LOCALIZATION AND SPEECH SEPARATIONIN TWO ECHOIC MIXTURES / KALBĖTOJO APTIKIMAS IR ŠNEKOS IŠSKYRIMAS DVIEJŲ SIGNALŲ MIŠINIUOSE SU AIDU

Author: Abrard F
Aoki M
Arberet S
He Z
Makino S
Ouchi H
Rickard S
Yilmaz O
Publication venue: 'Vilnius Gediminas Technical University'
Publication date
Field of study

Crossref

Lucien Morellet's fossil algae from lraq: a pioneer occasion in the petroleum palaeontology of the Middle East

Author: ABRARD R.
ELLIOTT G. F.
ELLIOTT G. F.
ELLIOTT G. F.
HEDBERG H. D.
JOHNSON J. H.
MASSIEUX M.
PFENDER J.
Publication venue: 'Edinburgh University Press'
Publication date
Field of study

Crossref

Lucien Morellet's fossil algae from lraq: a pioneer occasion in the petroleum palaeontology of the Middle East

Author: ABRARD R.
ELLIOTT G. F.
ELLIOTT G. F.
ELLIOTT G. F.
HEDBERG H. D.
JOHNSON J. H.
MASSIEUX M.
PFENDER J.
Publication venue: 'Edinburgh University Press'
Publication date
Field of study

Crossref