Search CORE

348 research outputs found

Recommended from our members

Large-Scale Cover Song Recognition Using the 2D Fourier Transform Magnitude

Author: Ellis Daniel P. W.
Thierry Bertin-Mahieux
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2012
Field of study

Large-scale cover song recognition involves calculating item-to-item similarities that can accommodate differences in timing and tempo, rendering simple Euclidean measures unsuitable. Expensive solutions such as dynamic time warping do not scale to million of instances, making them inappropriate for commercial-scale applications. In this work, we transform a beat-synchronous chroma matrix with a 2D Fourier transform and show that the resulting representation has properties that fit the cover song recognition task. We can also apply PCA to efficiently scale comparisons. We report the best results to date on the largest available dataset of around 18,000 cover songs amid one million tracks, giving a mean average precision of 3.0%

Columbia University Academic Commons

Reassigned spectrum-based feature extraction for GMM-based automatic chord recognition

Author: Khadkevich Maksim
Omologo Maurizio
Publication venue
Publication date: 01/01/2013
Field of study

Springer - Publisher Connector

Archivio della ricerca - Fondazione Bruno Kessler

Open Access Repository

Sequential Complexity as a Descriptor for Musical Similarity

Author: Dixon S
Foster P
Mauch M
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2014
Field of study

We propose string compressibility as a descriptor of temporal structure in audio, for the purpose of determining musical similarity. Our descriptors are based on computing track-wise compression rates of quantised audio features, using multiple temporal resolutions and quantisation granularities. To verify that our descriptors capture musically relevant information, we incorporate our descriptors into similarity rating prediction and song year prediction tasks. We base our evaluation on a dataset of 15500 track excerpts of Western popular music, for which we obtain 7800 web-sourced pairwise similarity ratings. To assess the agreement among similarity ratings, we perform an evaluation under controlled conditions, obtaining a rank correlation of 0.33 between intersected sets of ratings. Combined with bag-of-features descriptors, we obtain performance gains of 31.1% and 10.9% for similarity rating prediction and song year prediction. For both tasks, analysis of selected descriptors reveals that representing features at multiple time scales benefits prediction accuracy.Comment: 13 pages, 9 figures, 8 tables. Accepted versio

arXiv.org e-Print Archive

CiteSeerX

Crossref

Queen Mary Research Online

Reassigned spectrum-based feature extraction for GMM-based automatic chord recognition

Author: A Sheh
C Harte
C Harte
C Joder
C Zieger
DA Reynolds
DA Reynolds
DJ Nelson
DPW Ellis
E Gómez
E Gómez
F Auger
G Peeters
H Papadopoulos
H Papadopoulos
H Papadopoulos
JP Bello
K Kodera
K Lee
K Lee
KR Fitz
L Oudre
L Rabiner
LR Rabiner
M Goto
M Khadkevich
M Khadkevich
M Khadkevich
M Khadkevich
M Mauch
M Mauch
M Mauch
M Müller
M Stein
M Varewyck
Maksim Khadkevich
Maurizio Omologo
PJ Loughlin
SA Fulop
SW Hainsworth
SW Hainsworth
T Abe
T Cho
T Fujishima
T Rocher
Y Ueda
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

content based music access an approach and its applications

Author: Riccardo Miotto
Publication venue
Publication date: 01/09/2009
Field of study

At current time, the availability of largemusic repositories poses challenging research problems. Among all, content-based identification is gaining an increasing interest because it can provide new tools for easy access and retrieval. In this paper we describe an ongoing methodology for the content-based identification of unknown music recordings through a collection of music documents. Moreover, as future prospective scenario, identification is viewed in a more general similarity context, where also the perception of the users is considered

Crossref

Open Access Repository

Identifying 'Cover Songs' with Chroma Features and Dynamic Programming Beat Tracking

Author: Ellis Daniel P. W.
Poliner Graham E.
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2007
Field of study

Large music collections, ranging from thousands to millions of tracks, are unsuited to manual searching, motivating the development of automatic search methods. When different musicians perform the same underlying song or piece, these are known as 'cover' versions. We describe a system that attempts to identify such a relationship between music audio recordings. To overcome variability in tempo, we use beat tracking to describe each piece with one feature vector per beat. To deal with variation in instrumentation, we use 12-dimensional 'chroma' feature vectors that collect spectral energy supporting each semitone of the octave. To compare two recordings, we simply cross-correlate the entire beat-by-chroma representation for two tracks and look for sharp peaks indicating good local alignment between the pieces. Evaluation on several databases indicate good performance, including best performance on an independent international evaluation, where the system achieved a mean reciprocal ranking of 0.49 for true cover versions among top-10 returns

Crossref

Columbia University Academic Commons