Location of Repository

Sparse and shift-invariant representations of music

By Thomas Blumensath and Mike Davies


Redundancy reduction has been proposed as the main computational process in the primary sensory pathways in the mammalian brain. This idea has led to the development of sparse coding techniques, which are exploited in this article to extract salient structure from musical signals. In particular, we use a sparse coding formulation within a generative model that explicitly enforces shift-invariance. Previous work has applied these methods to relatively small problem sizes. In this paper, we present a subset selection step to reduce the computational complexity of these methods, which then enables us to use the sparse coding approach for many real world applications. We demonstrate the algorithm's potential on two tasks in music analysis: the extraction of individual notes from polyphonic piano music and single-channel blind source separation.<br/><br/

Year: 2006
OAI identifier: oai:eprints.soton.ac.uk:142533
Provided by: e-Prints Soton

Suggested articles



  1. (2002). A probabilistic approach to single channel sourceseparation,”AdvancesinNeuralInformationProcessingSystems,
  2. (1999). A probabilistic framework for the adaptation and comparison of image codes,” doi
  3. (2003). Adaptive sparseness for supervised learning,” doi
  4. (1999). An affine scaling methodology for best basis selection,” doi
  5. (1999). Coding time-varying signals using sparse, shift-invariant representations,”
  6. (2002). Efficient coding of natural sounds,” doi
  7. (2001). Emergence of movement-sensitive neurons’ properties by learning a sparse code of natural moving images,”
  8. (1996). Emergence of simple-cell receptive field properties by learning a sparse code for natural images,” doi
  9. (1997). Exploring the Neural Code. doi
  10. (1990). Forming sparse representations by local anti-Hebian learning,” doi
  11. (2000). Learning overcomplete representations,” doi
  12. (2000). Learning sparse codes with a mixture-of-Gaussians prior,”
  13. (2003). Learning sparse multiscale image representations,” doi
  14. (2004). Non-negative matrix factor deconvolution: Extraction of multiple sound sources from monophonic inputs,” in doi
  15. (1961). Possible principles underlying the transformations of sensory messages,” doi
  16. (2003). Proposals for Performance Measurement in Source Separation,” Institut de Recherche et Coordination Acoustique/Musique, Paris, France,
  17. (2004). Separation of sound sources by convolutive sparse coding,” in
  18. (2000). Sparse coding of time-varying natural images,” doi
  19. (2003). Sparse coding with invariance constraints,” in doi
  20. Sparse representationsofpolyphonicmusic,”EURASIPSignalProcessing J.,to be published. doi
  21. (2004). Underdetermined source separation with structured source priors,” in doi

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.