Search CORE

4,287 research outputs found

Music classification by low-rank semantic mappings

Author: Kotropoulos Constantine
Panagakis Yannis
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

A challenging open question in music classification is which music representation (i.e., audio features) and which machine learning algorithm is appropriate for a specific music classification task. To address this challenge, given a number of audio feature vectors for each training music recording that capture the different aspects of music (i.e., timbre, harmony, etc.), the goal is to find a set of linear mappings from several feature spaces to the semantic space spanned by the class indicator vectors. These mappings should reveal the common latent variables, which characterize a given set of classes and simultaneously define a multi-class linear classifier that classifies the extracted latent common features. Such a set of mappings is obtained, building on the notion of the maximum margin matrix factorization, by minimizing a weighted sum of nuclear norms. Since the nuclear norm imposes rank constraints to the learnt mappings, the proposed method is referred to as low-rank semantic mappings (LRSMs). The performance of the LRSMs in music genre, mood, and multi-label classification is assessed by conducting extensive experiments on seven manually annotated benchmark datasets. The reported experimental results demonstrate the superiority of the LRSMs over the classifiers that are compared to. Furthermore, the best reported classification results are comparable with or slightly superior to those obtained by the state-of-the-art task-specific music classification methods

Springer - Publisher Connector

Middlesex University Research Repository

Music classification by low-rank semantic mappings

Author: Kotropoulos C.
Kotropoulos C.
Panagakis Y.
Panagakis Y.
Publication venue: Springer
Publication date: 01/01/2013
Field of study

Middlesex University Research Repository

Deep Cross-Modal Correlation Learning for Audio and Lyrics in Music Retrieval

Author: Francisco Raposo
Lei Chen
Suhua Tang
Yi Yu
Publication venue: 'American College of Medical Physics (ACMP)'
Publication date: 01/02/2019
Field of study

Deep cross-modal learning has successfully demonstrated excellent performance in cross-modal multimedia retrieval, with the aim of learning joint representations between different data modalities. Unfortunately, little research focuses on cross-modal correlation learning where temporal structures of different data modalities such as audio and lyrics should be taken into account. Stemming from the characteristic of temporal structures of music in nature, we are motivated to learn the deep sequential correlation between audio and lyrics. In this work, we propose a deep cross-modal correlation learning architecture involving two-branch deep neural networks for audio modality and text modality (lyrics). Data in different modalities are converted to the same canonical space where inter modal canonical correlation analysis is utilized as an objective function to calculate the similarity of temporal structures. This is the first study that uses deep architectures for learning the temporal correlation between audio and lyrics. A pre-trained Doc2Vec model followed by fully-connected layers is used to represent lyrics. Two significant contributions are made in the audio branch, as follows: i) We propose an end-to-end network to learn cross-modal correlation between audio and lyrics, where feature extraction and correlation learning are simultaneously performed and joint representation is learned by considering temporal structures. ii) As for feature extraction, we further represent an audio signal by a short sequence of local summaries (VGG16 features) and apply a recurrent neural network to compute a compact feature that better learns temporal structures of music audio. Experimental results, using audio to retrieve lyrics or using lyrics to retrieve audio, verify the effectiveness of the proposed deep correlation learning architectures in cross-modal music retrieval

Creative Repository of Electro-Communications

Retrieval and Annotation of Music Using Latent Semantic Models

Author: Levy Mark
Publication venue: 'Queen Mary University of London'
Publication date: 01/01/2012
Field of study

PhDThis thesis investigates the use of latent semantic models for annotation and retrieval from collections of musical audio tracks. In particular latent semantic analysis (LSA) and aspect models (or probabilistic latent semantic analysis, pLSA) are used to index words in descriptions of music drawn from hundreds of thousands of social tags. A new discrete audio feature representation is introduced to encode musical characteristics of automatically-identified regions of interest within each track, using a vocabulary of audio muswords. Finally a joint aspect model is developed that can learn from both tagged and untagged tracks by indexing both conventional words and muswords. This model is used as the basis of a music search system that supports query by example and by keyword, and of a simple probabilistic machine annotation system. The models are evaluated by their performance in a variety of realistic retrieval and annotation tasks, motivated by applications including playlist generation, internet radio streaming, music recommendation and catalogue searchEngineering and Physical Sciences Research Counci

Queen Mary Research Online

Recommended from our members

Results of the ontology alignment evaluation initiative 2017

Author: Achichi M.
Cheatham M.
Dragisic Z.
Euzenat J.
Faria D.
Ferrara A.
Flouris G.
Fundulaki I.
Harrow I.
Ivanova V.
Jimenez-Ruiz E.
Kolthoff K.
Kuss E.
Lambrix P.
Leopold H.
Li H.
Meilicke C.
Mohammadi M.
Montanelli S.
Pesquita C.
Saveta T.
Shvaiko P.
Splendiani A.
Stuckenschmidt H.
Thiéblin E.
Todorov K.
Trojahn C.
Zamazal O.
Publication venue
Publication date: 01/01/2016
Field of study

Ontology matching consists of finding correspondences between semantically related entities of different ontologies. The Ontology Alignment Evaluation Initiative (OAEI) aims at comparing ontology matching systems on precisely defined test cases. These test cases can be based on ontologies of different levels of complexity (from simple thesauri to expressive OWL ontologies) and use different evaluation modalities (e.g., blind evaluation, open evaluation, or consensus). The OAEI 2017 campaign offered 9 tracks with 23 test cases, and was attended by 21 participants. This paper is an overall presentation of that campaign

City Research Online

Scientific Publications of the University of Toulouse II Le Mirail

Hal - Université Grenoble Alpes

AIR Universita degli studi di Milano

TU Delft Repository

INRIA a CCSD electronic archive server

Open Archive Toulouse Archive Ouverte

MAnnheim DOCument Server

Hal-Diderot