Search CORE

3,694 research outputs found

Assessing the Prosody of Non-Native Speakers of English: Measures and Feature Sets

Author: Batliner A
Coutinho E
Hantke S
Hönig F
Nöth E
Schuller B
Zhang Y
Publication venue: European Language Resources Association
Publication date: 01/02/2016
Field of study

In this paper, we describe a new database with audio recordings of non-native (L2) speakers of English, and the perceptual evaluation experiment conducted with native English speakers for assessing the prosody of each recording. These annotations are then used to compute the gold standard using different methods, and a series of regression experiments is conducted to evaluate their impact on the performance of a regression model predicting the degree of Abstract naturalness of L2 speech. Further, we compare the relevance of different feature groups modelling prosody in general (without speech tempo), speech rate and pauses modelling speech tempo (fluency), voice quality, and a variety of spectral features. We also discuss the impact of various fusion strategies on performance.Overall, our results demonstrate that the prosody of non-native speakers of English as L2 can be reliably assessed using supra- segmental audio features; prosodic features seem to be the most important ones

University of Liverpool Repository

Spiral - Imperial College Digital Repository

Audio Features Affected by Music Expressiveness

Author: Barthet M.
Bishop C. M.
Eerola T.
Kim Y. E.
Krimphoff J.
Pollard H. F.
Presti G.
Presti G.
Wu B.
Publication venue
Publication date: 17/05/2016
Field of study

Within a Music Information Retrieval perspective, the goal of the study presented here is to investigate the impact on sound features of the musician's affective intention, namely when trying to intentionally convey emotional contents via expressiveness. A preliminary experiment has been performed involving

10

tuba players. The recordings have been analysed by extracting a variety of features, which have been subsequently evaluated by combining both classic and machine learning statistical techniques. Results are reported and discussed.Comment: Submitted to ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2016), Pisa, Italy, July 17-21, 201

arXiv.org e-Print Archive

Crossref

An information theoretic characterisation of auditory encoding.

Author: Carlyon RP
Cusack R
Griffiths TD
Grube M
Kumar S
Overath T
von Kriegstein K
Warren JD
Publication venue
Publication date: 01/01/2007
Field of study

The entropy metric derived from information theory provides a means to quantify the amount of information transmitted in acoustic streams like speech or music. By systematically varying the entropy of pitch sequences, we sought brain areas where neural activity and energetic demands increase as a function of entropy. Such a relationship is predicted to occur in an efficient encoding mechanism that uses less computational resource when less information is present in the signal: we specifically tested the hypothesis that such a relationship is present in the planum temporale (PT). In two convergent functional MRI studies, we demonstrated this relationship in PT for encoding, while furthermore showing that a distributed fronto-parietal network for retrieval of acoustic information is independent of entropy. The results establish PT as an efficient neural engine that demands less computational resource to encode redundant signals than those with high information content

CiteSeerX

Directory of Open Access Journals

UCL Discovery

PubMed Central

MPG.PuRe

Beat histogram features for rhythm-based musical genre classification using multiple novelty functions

Author: Lerch Alexander
Lykartsis Athanasios
Publication venue
Publication date: 01/01/2015
Field of study

In this paper we present beat histogram features for multiple level rhythm description and evaluate them in a musical genre classification task. Audio features pertaining to various musical content categories and their related novelty functions are extracted as a basis for the creation of beat histograms. The proposed features capture not only amplitude, but also tonal and general spectral changes in the signal, aiming to represent as much rhythmic information as possible. The most and least informative features are identified through feature selection methods and are then tested using Support Vector Machines on five genre datasets concerning classification accuracy against a baseline feature set. Results show that the presented features provide comparable classification accuracy with respect to other genre classification approaches using periodicity histograms and display a performance close to that of much more elaborate up-to-date approaches for rhythm description. The use of bar boundary annotations for the texture frames has provided an improvement for the dance-oriented Ballroom dataset. The comparably small number of descriptors and the possibility of evaluating the influence of specific signal components to the general rhythmic content encourage the further use of the method in rhythm description tasks

DepositOnce

Broadcasting personalities: the relationship between occupation and music preferences in the BBC Radio programme Desert Island Discs

Author: Knox Don
MacDonald Raymond
Publication venue: 'SAGE Publications'
Publication date: 07/10/2016
Field of study

Edinburgh Research Explorer

ResearchOnline@GCU