Search CORE

3 research outputs found

Learning and Evaluating Musical Features with Deep Autoencoders

Author: Bretan Mason
Eck Doug
Heck Larry
Oore Sageev
Publication venue
Publication date: 15/06/2017
Field of study

In this work we describe and evaluate methods to learn musical embeddings. Each embedding is a vector that represents four contiguous beats of music and is derived from a symbolic representation. We consider autoencoding-based methods including denoising autoencoders, and context reconstruction, and evaluate the resulting embeddings on a forward prediction and a classification task

arXiv.org e-Print Archive

Self-supervised audio representation learning for mobile devices

Author: Gfeller Beat
Quitry Félix de Chaumont
Roblek Dominik
Tagliasacchi Marco
Publication venue
Publication date: 24/05/2019
Field of study

We explore self-supervised models that can be potentially deployed on mobile devices to learn general purpose audio representations. Specifically, we propose methods that exploit the temporal context in the spectrogram domain. One method estimates the temporal gap between two short audio segments extracted at random from the same audio clip. The other methods are inspired by Word2Vec, a popular technique used to learn word embeddings, and aim at reconstructing a temporal spectrogram slice from past and future slices or, alternatively, at reconstructing the context of surrounding slices from the current slice. We focus our evaluation on small encoder architectures, which can be potentially run on mobile devices during both inference (re-using a common learned representation across multiple downstream tasks) and training (capturing the true data distribution without compromising users' privacy when combined with federated learning). We evaluate the quality of the embeddings produced by the self-supervised learning models, and show that they can be re-used for a variety of downstream tasks, and for some tasks even approach the performance of fully supervised models of similar size

arXiv.org e-Print Archive

Cortical encoding of melodic expectations in human temporal cortex

Author: Bianco R.
de Cheveigne A.
Di Liberto G. M.
Herrero J. L.
Mehta A. D.
Mesgarani N.
Patel P.
Pelofi C.
Shamma S.
Publication venue: Donald and Barbara Zucker School of Medicine Academic Works
Publication date: 01/01/2020
Field of study

Hofstra Northwell Academic Works (Hofstra Northwell School of Medicine)