Search CORE

34,180 research outputs found

Stacked Convolutional and Recurrent Neural Networks for Music Emotion Recognition

Author: Adavanne Sharath
Drossos Konstantinos
Jarina Roman
Malik Miroslav
Ticha Dasa
Virtanen Tuomas
Publication venue
Publication date: 01/01/2017
Field of study

This paper studies the emotion recognition from musical tracks in the 2-dimensional valence-arousal (V-A) emotional space. We propose a method based on convolutional (CNN) and recurrent neural networks (RNN), having significantly fewer parameters compared with the state-of-the-art method for the same task. We utilize one CNN layer followed by two branches of RNNs trained separately for arousal and valence. The method was evaluated using the 'MediaEval2015 emotion in music' dataset. We achieved an RMSE of 0.202 for arousal and 0.268 for valence, which is the best result reported on this dataset.Comment: Accepted for Sound and Music Computing (SMC 2017

arXiv.org e-Print Archive

Trepo - Institutional Repository of Tampere University

Recommended from our members

Spring School on Language, Music, and Cognition: Organizing Events in Time

Author: Arbib M. A.
Arbib M. A.
Bernstein L.
Carnap R.
Chomsky N.
Chomsky N.
Chomsky N.
Chomsky N.
Chomsky N.
Cross I.
Cross I.
Dahlhaus C.
Fitch W. T.
Fitch W. T.
Gallistel C. R.
Hawkins S.
Hellbernd N.
Hughes D. W.
Jackendoff R.
Lerdahl F.
Levine J.
McQueen Tokita A.
Patel A. D.
Patel A. D.
Persici V.
Ravignani A.
Rebuschat P.
Rothacker E.
Steedman M.
Sundberg J.
Vogeley K.
Wallin N. L.
Wittgenstein L
Publication venue: 'SAGE Publications'
Publication date: 01/01/2018
Field of study

The interdisciplinary spring school “Language, music, and cognition: Organizing events in time” was held from February 26 to March 2, 2018 at the Institute of Musicology of the University of Cologne. Language, speech, and music as events in time were explored from different perspectives including evolutionary biology, social cognition, developmental psychology, cognitive neuroscience of speech, language, and communication, as well as computational and biological approaches to language and music. There were 10 lectures, 4 workshops, and 1 student poster session. Overall, the spring school investigated language and music as neurocognitive systems and focused on a mechanistic approach exploring the neural substrates underlying musical, linguistic, social, and emotional processes and behaviors. In particular, researchers approached questions concerning cognitive processes, computational procedures, and neural mechanisms underlying the temporal organization of language and music, mainly from two perspectives: one was concerned with syntax or structural representations of language and music as neurocognitive systems (i.e., an intrapersonal perspective), while the other emphasized social interaction and emotions in their communicative function (i.e., an interpersonal perspective). The spring school not only acted as a platform for knowledge transfer and exchange but also generated a number of important research questions as challenges for future investigations

City Research Online

Crossref

Kölner UniversitätsPublikationsServer

Directory of Open Access Journals

Publications at Bielefeld University

MPG.PuRe

Big data analytics:Computational intelligence techniques and application areas

Author: Doctor Faiyaz
Iqbal Rahat
Mahmud Shahid
More Brian
Yousuf Usman
Publication venue: 'Elsevier BV'
Publication date: 01/04/2020
Field of study

Big Data has significant impact in developing functional smart cities and supporting modern societies. In this paper, we investigate the importance of Big Data in modern life and economy, and discuss challenges arising from Big Data utilization. Different computational intelligence techniques have been considered as tools for Big Data analytics. We also explore the powerful combination of Big Data and Computational Intelligence (CI) and identify a number of areas, where novel applications in real world smart city problems can be developed by utilizing these powerful tools and techniques. We present a case study for intelligent transportation in the context of a smart city, and a novel data modelling methodology based on a biologically inspired universal generative modelling approach called Hierarchical Spatial-Temporal State Machine (HSTSM). We further discuss various implications of policy, protection, valuation and commercialization related to Big Data, its applications and deployment

University of Essex Research Repository

Crossref

Coventry University Pure Portal

Emotional quantification of soundscapes by learning between samples

Author: S. Ntalampiras
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Predicting the emotional responses of humans to soundscapes is a relatively recent field of research coming with a wide range of promising applications. This work presents the design of two convolutional neural networks, namely ArNet and ValNet, each one responsible for quantifying arousal and valence evoked by soundscapes. We build on the knowledge acquired from the application of traditional machine learning techniques on the specific domain, and design a suitable deep learning framework. Moreover, we propose the usage of artificially created mixed soundscapes, the distributions of which are located between the ones of the available samples, a process that increases the variance of the dataset leading to significantly better performance. The reported results outperform the state of the art on a soundscape dataset following Schafer\u2019s standardized categorization considering both sound\u2019s identity and the respective listening context

AIR Universita degli studi di Milano