Search CORE

1,417 research outputs found

Verifying tag annotation and performing genre classification in music data via association analysis

Author: Arjannikov Tom
Publication venue: 'University of Central Missouri, Department of Mathematics and Computer Science'
Publication date: 01/01/2014
Field of study

Music Information Retrieval aims to automate the access to large-volume music data, including browsing, retrieval, storage, etc. The work presented in this thesis tackles two non-trivial problems in the field. First problem deals with music tags, which provide descriptive and rich information about a music piece, including its genre, artist, emotion, instrument, etc. At present, tag annotation is largely a manual process, which often results in tags that are subjective, ambiguous, and error-prone. We propose a novel approach to verify the quality of tag annotation in a music dataset through association analysis. Second, we employ association analysis to predict music genres based on features extracted directly from music. We build an association-based classifier, which finds inherent associations between music features and genres. We demonstrate the effectiveness of our approaches through a series of simulations and experiments using various benchmark music datasets

OPUS: Open Uleth Scholarship - University of Lethbridge Research Repository

Vocal Detection: An evaluation between general versus focused models

Author: Tsai Yi-Na
Publication venue: 'University of Waikato'
Publication date: 26/04/2011
Field of study

This thesis focuses on presenting a technique on improving current vocal detection methods. One of the most popular methods employs some type of statistical approach where vocal signals can be distinguished automatically by first training a model on both vocal and non-vocal example data, then using this model to classify audio signals into vocals or non-vocals. There is one problem with this method which is that the model that has been trained is typically very general and does its best at classifying various different types of data. Since the audio signals containing vocals that we care about are songs, we propose to improve vocal detection accuracies by creating focused models targeted at predicting vocal segments according to song artist and artist gender. Such useful information like artist name are often overlooked, this restricts opportunities in processing songs more specific to its type and hinders its potential success. Experiment results with several models built according to artist and artist gender reveal improvements of up to 17% when compared to using the general approach. With such improvements, applications such as automatic lyric synchronization to vocal segments in real-time may become more achievable with greater accuracy

Research Commons@Waikato

Sentiment Classification Using Negation as a Proxy for Negative Sentiment

Author: Delany Sarah Jane
Ohana Bruno
Tierney Brendan
Publication venue: Dublin Institute of Technology
Publication date: 01/01/2016
Field of study

We explore the relationship between negated text and neg- ative sentiment in the task of sentiment classiﬁcation. We propose a novel adjustment factor based on negation occur- rences as a proxy for negative sentiment that can be applied to lexicon-based classiﬁers equipped with a negation detec- tion pre-processing step. We performed an experiment on a multi-domain customer reviews dataset obtaining accuracy improvements over a baseline, and we further improved our results using out-of-domain data to calibrate the adjustment factor. We see future work possibilities in exploring nega- tion detection reﬁnements, and expanding the experiment to a broader spectrum of opinionated discourse, beyond that of customer reviews

Arrow@TUDublin

The Construction of a 500-Million-Word Reference Corpus of Contemporary Written Dutch

Author: A Bosch Van den
A Braasch
C Rijsbergen Van
G Aston
J Leveling
J Trapman
JC Carletta
M Recasens
M Reynaert
Martin W. C. Reynaert
W Daelemans
W Daelemans
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Crossref

Springer - Publisher Connector

Tilburg University Repository

Recommended from our members

On machine learning and knowledge organisation in Multimedia Information Retrieval

Author: Frankowska-Takhari S.
MacFarlane A.
Missaoui S.
Publication venue: 'Nomos Verlag'
Publication date: 01/01/2020
Field of study

Recent technological developments have increased the use of machine learning to solve many problems, including many in information retrieval (IR). Deployment of machine-learning techniques is widespread in text search, notability web search engines (Dai et al., 2011). Multimedia information retrieval as a problem however still represents a significant challenge to machine learning as a technological solution, but some problems in IR can still be addressed by using appropriate AI techniques. In this paper we review the technological developments, and provide a perspective on the use of machine-learning techniques in conjunction with knowledge organisation techniques to address multimedia IR needs. We take the perspective from the MacFarlane (2016) position paper, that there are some problems in multimedia IR that AI and machine learning cannot currently solve. The semantic gap in multimedia IR (Enser, 2008) remains a significant problem in the field, and solutions to them are many years off. However, there are occasions where the new technological developments allow the use of knowledge organisation and machine learning in multimedia search systems and services. Specifically we argue that the improvement of detection of some classes of low level features in images (Karpathy and Li, 2015), music (Byrd and Crawford, 2002) and video (Hu et al., 2011) can be used in conjunction with knowledge organisation to tag or label multimedia content for better retrieval performance. We advocate the use of supervised learning techniques. We provide an overview of the use of knowledge organisation schemes in machine learning, and make recommendations to information professionals on the use of this technology with knowledge organisation techniques to solve multimedia IR problems

City Research Online

The analysis of canonical and non-canonical questions in an English language podcast

Author: Roomäe Kärt
Publication venue: Tartu Ülikool
Publication date: 01/01/2019
Field of study

This bachelor’s thesis studies direct questions in “Grammar Day,” an episode of a linguistics podcast Talk the Talk. The aim is to analyze the formulation and function of canonical and non-canonical direct questions in natural oral discourse. The approach used is similar to that of conversation analysis and, hence, the analysis does not proceed from any specific hypotheses. Instead, the thesis takes a data-driven approach. The thesis consists of five sections: the introduction, the section comprising the literature review, an analysis of direct questions in the annotated transcript of the podcast episode, the conclusion, and the list of references. The introduction highlights the importance of examining canonical and non-canonical questions as well as the reasons for choosing podcast as a source for compiling the corpus of this study.https://www.ester.ee/record=b5239088*es

DSpace at Tartu University Library

Compression-based Parts-of-Speech Tagger for the Arabic Language

Author: Alkhazi Ibrahim
Publication venue
Publication date: 18/12/2019
Field of study

Bangor University Research Portal