1,856 research outputs found
An investigation of feature models for music genre classification using the support vector classifier
In music genre classification the decision time is typically of the order of several seconds, however, most automatic music genre classification systems focus on short time features derived from 10?50ms. This work investigates two models, the multivariate Gaussian model and the multivariate autoregressive model for modelling short time features. Furthermore, it was investigated how these models can be integrated over a segment of short time features into a kernel such that a support vector machine can be applied. Two kernels with this property were considered, the convolution kernel and product probability kernel. In order to examine the different methods an 11 genre music setup was utilized. In this setup the Mel Frequency Cepstral Coefficients were used as short time features. The accuracy of the best performing model on this data set was 44% compared to a human performance of 52% on the same data set
Towards efficient music genre classification using FastMap
Automatic genre classification aims to correctly categorize an unknown recording with a music genre. Recent studies use the Kullback-Leibler (KL) divergence to estimate music similarity then perform classification using k-nearest neighbours (k-NN). However, this approach is not practical for large databases. We propose an efficient genre classifier that addresses the scalability problem. It uses a combination of modified FastMap algorithm and KL divergence to return the nearest neighbours then use 1- NN for classification. Our experiments showed that high accuracies are obtained while performing classification in less than 1/20 second per track
Variation of word frequencies across genre classification tasks
This paper examines automated genre classification of text documents and its role in enabling the effective management of digital documents by digital libraries and other repositories. Genre classification, which narrows down the possible structure of a document, is a valuable step in
realising the general automatic extraction of semantic metadata essential to the efficient management and use of digital objects. In the present report, we present an analysis of word frequencies in different genre classes in an effort to understand the distinction between independent classification tasks. In particular, we examine automated experiments on thirty-one genre classes to determine the relationship between the word frequency metrics and the degree of its significance in carrying out classification in varying environments
Large-Scale Music Genre Analysis and Classification Using Machine Learning with Apache Spark
The trend for listening to music online has greatly increased over the past decade due to the number of online musical tracks. The large music databases of music libraries that are provided by online music content distribution vendors make music streaming and downloading services more accessible to the end-user. It is essential to classify similar types of songs with an appropriate tag or index (genre) to present similar songs in a convenient way to the end-user. As the trend of online music listening continues to increase, developing multiple machine learning models to classify music genres has become a main area of research. In this research paper, a popular music dataset GTZAN which contains ten music genres is analysed to study various types of music features and audio signals. Multiple scalable machine learning algorithms supported by Apache Spark, including naĆÆve Bayes, decision tree, logistic regression, and random forest, are investigated for the classification of music genres. The performance of these classifiers is compared, and the random forest performs as the best classifier for the classification of music genres. Apache Spark is used in this paper to reduce the computation time for machine learning predictions with no computational cost, as it focuses on parallel computation. The present work also demonstrates that the perfect combination of Apache Spark and machine learning algorithms reduces the scalability problem of the computation of machine learning predictions. Moreover, different hyperparameters of the random forest classifier are optimized to increase the performance efficiency of the classifier in the domain of music genre classification. The experimental outcome shows that the developed random forest classifier can establish a high level of performance accuracy, especially for the mislabelled, distorted GTZAN dataset. This classifier has outperformed other machine learning classifiers supported by Apache Spark in the present work. The random forest classifier manages to achieve 90% accuracy for music genre classification compared to other work in the same domain
Leadership capability of team leaders in construction industry
This research was conducted to identify the important leadership capabilities for
Malaysia construction industry team leaders. This research used exploratory sequential
mix-method research design which is qualitative followed by quantitative research
method. In the qualitative phase, semi-structured in-depth interview was selected
and purposive sampling was employed in selecting 15 research participants involving
team leaders and Human Resource Managers. Qualitative data was analysed using
content and thematic analyses. Quantitative data was collected using survey
questionnaire involving 171 randomly selected team leaders as respondents. The data
was analyzed using descriptive and inferential statistics consisting of t-test, One-way
Analysis of Variance (ANOVA), Pearson Correlation, Multiple Regression and
Structured Equation Modeling (SEM). This study found that personal integrity, working
within industry, customer focus and quality, communication and interpersonal skill,
developing and empowering people and working as a team were needed leadership
capabilities among construction industry team leaders. The research was also able to
prove that leadership skill is a key element to develop leadership capability. A
framework was developed based on the results of this study, which can be used as a
guide by employers and relevant agencies in enhancing leadership capability of
Malaysia construction industry team leade
Information-theoretic measures of music listening behaviour
We present an information-theoretic approach to the mea-
surement of usersā music listening behaviour and selection of music features. Existing
ethnographic studies of mu- sic use have guided the design of music retrieval systems however are
typically qualitative and exploratory in nature. We introduce the SPUD dataset, comprising 10, 000
hand- made playlists, with user and audio stream metadata. With this, we illustrate the use of
entropy for analysing music listening behaviour, e.g. identifying when a user changed music
retrieval system. We then develop an approach to identifying music features that reflect usersā
criteria for playlist curation, rejecting features that are independent of user behaviour. The
dataset and the code used to produce it are made available. The techniques described support a
quantitative yet user-centred approach to the evaluation of music features and retrieval systems,
without assuming objective ground truth labels
Information-theoretic measures of music listening behaviour
We present an information-theoretic approach to the mea-
surement of usersā music listening behaviour and selection of music features. Existing
ethnographic studies of mu- sic use have guided the design of music retrieval systems however are
typically qualitative and exploratory in nature. We introduce the SPUD dataset, comprising 10, 000
hand- made playlists, with user and audio stream metadata. With this, we illustrate the use of
entropy for analysing music listening behaviour, e.g. identifying when a user changed music
retrieval system. We then develop an approach to identifying music features that reflect usersā
criteria for playlist curation, rejecting features that are independent of user behaviour. The
dataset and the code used to produce it are made available. The techniques described support a
quantitative yet user-centred approach to the evaluation of music features and retrieval systems,
without assuming objective ground truth labels
- ā¦