Search CORE

17,735 research outputs found

Unsupervised automatic music genre classification

Author: Barreira Luís Filipe Marques
Publication venue: Faculdade de Ciências e Tecnologia
Publication date: 01/01/2010
Field of study

Trabalho apresentado no âmbito do Mestrado em Engenharia Informática, como requisito parcial para obtenção do grau de Mestre em Engenharia InformáticaIn this study we explore automatic music genre recognition and classification of digital music. Music has always been a reflection of culture di erences and an influence in our society. Today’s digital content development triggered the massive use of digital music. Nowadays,digital music is manually labeled without following a universal taxonomy, thus, the labeling process to audio indexing is prone to errors. A human labeling will always be influenced by culture di erences, education, tastes, etc. Nonetheless, this indexing process is primordial to guarantee a correct organization of huge databases that contain thousands of music titles. In this study, our interest is about music genre organization. We propose a learning and classification methodology for automatic genre classification able to group several music samples based on their characteristics (this is achieved by the proposed learning process) as well as classify a new test music into the previously learned created groups(this is achieved by the proposed classification process). The learning method intends to group the music samples into di erent clusters only based on audio features and without any previous knowledge on the genre of the samples, and therefore it follows an unsupervised methodology. In addition a Model-Based approach is followed to generate clusters as we do not provide any information about the number of genres in the dataset. Features are related with rhythm analysis, timbre, melody, among others. In addition, Mahalanobis distance was used so that the classification method can deal with non-spherical clusters. The proposed learning method achieves a clustering accuracy of 55% when the dataset contains 11 di erent music genres: Blues, Classical, Country, Disco, Fado, Hiphop, Jazz, Metal,Pop, Reggae and Rock. The clustering accuracy improves significantly when the number of genres is reduced; with 4 genres (Classical, Fado, Metal and Reggae), we obtain an accuracy of 100%. As for the classification process, 82% of the submitted music samples were correctly classified

Repositório da Universidade Nova de Lisboa

FMA: A Dataset For Music Analysis

Author: Benzi Kirell
Bresson Xavier
Defferrard Michaël
Vandergheynst Pierre
Publication venue
Publication date: 27/04/2017
Field of study

We introduce the Free Music Archive (FMA), an open and easily accessible dataset suitable for evaluating several tasks in MIR, a field concerned with browsing, searching, and organizing large music collections. The community's growing interest in feature and end-to-end learning is however restrained by the limited availability of large audio datasets. The FMA aims to overcome this hurdle by providing 917 GiB and 343 days of Creative Commons-licensed audio from 106,574 tracks from 16,341 artists and 14,854 albums, arranged in a hierarchical taxonomy of 161 genres. It provides full-length and high-quality audio, pre-computed features, together with track- and user-level metadata, tags, and free-form text such as biographies. We here describe the dataset and how it was created, propose a train/validation/test split and three subsets, discuss some suitable MIR tasks, and evaluate some baselines for genre recognition. Code, data, and usage examples are available at https://github.com/mdeff/fmaComment: ISMIR 2017 camera-read

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Extended pipeline for content-based feature engineering in music genre recognition

Author: Bientinesi Paolo
Raissi Tina
Tibo Alessandro
Publication venue
Publication date: 01/01/2018
Field of study

We present a feature engineering pipeline for the construction of musical signal characteristics, to be used for the design of a supervised model for musical genre identification. The key idea is to extend the traditional two-step process of extraction and classification with additive stand-alone phases which are no longer organized in a waterfall scheme. The whole system is realized by traversing backtrack arrows and cycles between various stages. In order to give a compact and effective representation of the features, the standard early temporal integration is combined with other selection and extraction phases: on the one hand, the selection of the most meaningful characteristics based on information gain, and on the other hand, the inclusion of the nonlinear correlation between this subset of features, determined by an autoencoder. The results of the experiments conducted on GTZAN dataset reveal a noticeable contribution of this methodology towards the model's performance in classification task.Comment: ICASSP 201

arXiv.org e-Print Archive

Crossref

Publikationsserver der RWTH Aachen University

Collaborative tagging as a tripartite network

Author: Ausloos M.
Lambiotte R.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 29/12/2005
Field of study

We describe online collaborative communities by tripartite networks, the nodes being persons, items and tags. We introduce projection methods in order to uncover the structures of the networks, i.e. communities of users, genre families... To do so, we focus on the correlations between the nodes, depending on their profiles, and use percolation techniques that consist in removing less correlated links and observing the shaping of disconnected islands. The structuring of the network is visualised by using a tree representation. The notion of diversity in the system is also discussed

arXiv.org e-Print Archive

Crossref

Repository of the University of Namur

Recommended from our members

Non-Negative Tensor Factorization Applied to Music Genre Classification

Author: Benetos E.
Kotropoulos C.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/11/2010
Field of study

Music genre classification techniques are typically applied to the data matrix whose columns are the feature vectors extracted from music recordings. In this paper, a feature vector is extracted using a texture window of one sec, which enables the representation of any 30 sec long music recording as a time sequence of feature vectors, thus yielding a feature matrix. Consequently, by stacking the feature matrices associated to any dataset recordings, a tensor is created, a fact which necessitates studying music genre classification using tensors. First, a novel algorithm for non-negative tensor factorization (NTF) is derived that extends the non-negative matrix factorization. Several variants of the NTF algorithm emerge by employing different cost functions from the class of Bregman divergences. Second, a novel supervised NTF classifier is proposed, which trains a basis for each class separately and employs basis orthogonalization. A variety of spectral, temporal, perceptual, energy, and pitch descriptors is extracted from 1000 recordings of the GTZAN dataset, which are distributed across 10 genre classes. The NTF classifier performance is compared against that of the multilayer perceptron and the support vector machines by applying a stratified 10-fold cross validation. A genre classification accuracy of 78.9% is reported for the NTF classifier demonstrating the superiority of the aforementioned multilinear classifier over several data matrix-based state-of-the-art classifiers

City Research Online

Multimodal music information processing and retrieval: survey and future challenges

Author: Avanzini Federico
Ntalampiras Stavros
Simonetta Federico
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 14/02/2019
Field of study

Towards improving the performance in various music information processing tasks, recent studies exploit different modalities able to capture diverse aspects of music. Such modalities include audio recordings, symbolic music scores, mid-level representations, motion, and gestural data, video recordings, editorial or cultural tags, lyrics and album cover arts. This paper critically reviews the various approaches adopted in Music Information Processing and Retrieval and highlights how multimodal algorithms can help Music Computing applications. First, we categorize the related literature based on the application they address. Subsequently, we analyze existing information fusion approaches, and we conclude with the set of challenges that Music Information Retrieval and Sound and Music Computing research communities should focus in the next years

arXiv.org e-Print Archive

Crossref

Automatic Genre Classification of Latin Music Using Ensemble of Classifiers

Author: Kaestner Celso A.A.
Koerich Alessandro L.
Silla Jr Carlos N.
Publication venue: Brazilian Computer Society
Publication date: 01/07/2006
Field of study

This paper presents a novel approach to the task of automatic music genre classification which is based on ensemble learning. Feature vectors are extracted from three 30-second music segments from the beginning, middle and end of each music piece. Individual classifiers are trained to account for each music segment. During classification, the output provided by each classifier is combined with the aim of improving music genre classification accuracy. Experiments carried out on a dataset containing 600 music samples from two Latin genres (Tango and Salsa) have shown that for the task of automatic music genre classification, the features extracted from the middle and end music segments provide better results than using the beginning music segment. Furthermore, the proposed ensemble method provides better accuracy than using single classifiers and any individual segment

Kent Academic Repository

Ohio impromptu, genre and Beckett on film

Author: Dowd Garin
Publication venue: Intellect Books
Publication date: 01/01/2005
Field of study

Samuel Beckett’s choice of the title Ohio Impromptu to name the play first performed to an audience of academics and scholars at Columbus Ohio in 1981 is one manifestation of its author’s interest in the question of literary genre; more generally, in Beckett’s dramatic works one encounters a meticulous attention to the activity of categorisation, even if the energy is often directed toward the creation of phantom genres for spectral exemplars. This essay concerns itself with Ohio Impromptu in particular because by means of elements specific to this play (including the context in which it was first performed) it comments upon its own very failure to occupy its designated genre co-ordinates (these include its identity both as a play and as an ‘impromptu’). This play, which is so apt to incorporate other genres, however, is presided over by a stage direction which locates it firmly in the theatrical context. It is in its deliberate failure to attend to this stage direction that the Beckett on Film version of the play goes beyond the mere treacherous fidelity that is inevitably a feature of any adaptation. In arguing this, the essay analyses the foregrounding in the play of questions that can be said to pertain to genre (in several senses). Its more specific intention is to suggest that, via a combination of casting and special effects, the adaptation succeeds not only in cancelling the critical reflection on the ‘genre gesture’ that is lodged in Ohio Impromptu, but also in eradicating the very disjunction between Reader and Listener upon which the play depends

UWL Repository