Search CORE

1,720 research outputs found

Mandarin Singing Voice Synthesis Based on Harmonic Plus Noise Model and Singing Expression Analysis

Author: Gu Hung-Yan
Wang Hsin-Min
Wang Ju-Chiang
Publication venue
Publication date: 15/02/2015
Field of study

The purpose of this study is to investigate how humans interpret musical scores expressively, and then design machines that sing like humans. We consider six factors that have a strong influence on the expression of human singing. The factors are related to the acoustic, phonetic, and musical features of a real singing signal. Given real singing voices recorded following the MIDI scores and lyrics, our analysis module can extract the expression parameters from the real singing signals semi-automatically. The expression parameters are used to control the singing voice synthesis (SVS) system for Mandarin Chinese, which is based on the harmonic plus noise model (HNM). The results of perceptual experiments show that integrating the expression factors into the SVS system yields a notable improvement in perceptual naturalness, clearness, and expressiveness. By one-to-one mapping of the real singing signal and expression controls to the synthesizer, our SVS system can simulate the interpretation of a real singer with the timbre of a speaker.Comment: 8 pages, technical repor

arXiv.org e-Print Archive

CiteSeerX

Sensitivity to musical emotion is influenced by tonal structure in congenital amusia

Author: B Tillmann
B Tillmann
BR Zendel
C Jiang
C Jiang
C Jiang
C McDonald
CL Krumhansl
CL Krumhansl
CL Krumhansl
CL Krumhansl
CL Krumhansl
CP Vyasarayani
CS Pereira
D Omigie
E Bigand
E Bigand
EJ Kessler
ES Morton
F Lerdahl
F Liu
G Ilie
I Bosch Van Den
I Peretz
I Peretz
I Peretz
J Ayotte
JC Hailstone
JM Foxton
KL Hyde
L Leinonen
L-L Balkwill
L-L Balkwill
M Cousineau
M Lehne
MA Castellano
ME Curtis
MM Farbood
MM Marin
MM Marin
N Gosselin
N Steinbeis
P Moreau
PCM Wong
PCM Wong
R Plomp
S McAdams
T Fritz
U Schimmack
W Ma
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 02/08/2017
Field of study

Emotional communication in music depends on multiple attributes including psychoacoustic features and tonal system information, the latter of which is unique to music. The present study investigated whether congenital amusia, a lifelong disorder of musical processing, impacts sensitivity to musical emotion elicited by timbre and tonal system information. Twenty-six amusics and 26 matched controls made tension judgments on Western (familiar) and Indian (unfamiliar) melodies played on piano and sitar. Like controls, amusics used timbre cues to judge musical tension in Western and Indian melodies. While controls assigned significantly lower tension ratings to Western melodies compared to Indian melodies, thus showing a tonal familiarity effect on tension ratings, amusics provided comparable tension ratings for Western and Indian melodies on both timbres. Furthermore, amusics rated Western melodies as more tense compared to controls, as they relied less on tonality cues than controls in rating tension for Western melodies. The implications of these findings in terms of emotional responses to music are discussed

Central Archive at the University of Reading

Crossref

The effects of timbre on neural responses to musical emotion

Author: Jiang Cunmei
Jiang Hanyuan
Liu Fang
Wang Wanqi
Zhang Weixia
Zhou Linshu
Publication venue: 'University of California Press'
Publication date: 02/12/2019
Field of study

Timbre is an important factor that affects the perception of emotion in music. To date, little is known about the effects of timbre on neural responses to musical emotion. To address this issue, we used ERPs to investigate whether there are different neural responses to musical emotion when the same melodies are presented in different timbres. With a cross-modal affective priming paradigm, target faces were primed by affectively congruent or incongruent melodies without lyrics presented in violin, flute, and the voice. Results showed a larger P3 and a larger left anterior distributed LPC in response to affectively incongruent versus congruent trials in the voice version. For the flute version, however, only the LPC effect was found, which was distributed over centro-parietal electrodes. Unlike the voice and flute versions, an N400 effect was observed in the violin version. These findings revealed different patterns of neural responses to emotional processing of music when the same melodies were presented in different timbres, and provide evidence to confirm the hypothesis that there are specialized neural responses to the human voice

Central Archive at the University of Reading

Towards Music Structural Segmentation across Genres: Features, Structural Hypotheses, and Annotation Principles

Author: Sandler MB
Tian M
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/06/2016
Field of study

This work is supported by China Scholarship Council (CSC) and EPSRC project (EP/L019981/1) Fusing Semantic and Audio Technologies for Intelligent Music Production and Consumption (FAST-IMPACt). Sandler acknowledges the support of the Royal Society as a recipient of a Wolfson Research Merit Award

Queen Mary Research Online

Timbre's function in the perception of affective intentions: Contextual information and effects of learning

Author: Heng Lena
McAdams Stephen
Publication venue: 'The Ohio State University Libraries'
Publication date: 16/12/2021
Field of study

Timbre has been identified by music perception scholars as a component in the communication of affect in music. While its function as a carrier of perceptually useful information about sound source mechanics has been established, less is understood about whether and how it functions as a carrier of information for communicating affect in music. To investigate these issues, listeners trained in Chinese and Western musical traditions were presented with phrases, measures, and individual notes of recorded excerpts interpreted with a variety of affective intentions by performers on instruments from the two cultures. These excerpts were analyzed to determine acoustic features that are correlated with timbre characteristics. Analysis revealed consistent use of temporal, spectral, and spectrotemporal attributes in judging affective intent in music, suggesting purposeful use of these properties within the sounds by listeners. Comparison between listeners' perceptions across notes and longer segments also revealed greater accuracy in perception with increased musical context. How timbre is used for musical communication appears to be implicated differently across musical traditions. The important role timbre plays also appears to vary for different positions within a musical phrase, suggesting that patterns of change over time are crucial in emotional communication

KnowledgeBank at OSU

On Automatic Music Genre Recognition by Sparse Representation Classification using Auditory Temporal Modulations

Author: Noorzad Pardis
Sturm Bob L.
Publication venue
Publication date: 01/01/2012
Field of study

VBN

Current Challenges and Visions in Music Recommender Systems Research

Author: Chen Ching-Wei
Deldjoo Yashar
Elahi Mehdi
Schedl Markus
Zamani Hamed
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 21/03/2018
Field of study

Music recommender systems (MRS) have experienced a boom in recent years, thanks to the emergence and success of online streaming services, which nowadays make available almost all music in the world at the user's fingertip. While today's MRS considerably help users to find interesting music in these huge catalogs, MRS research is still facing substantial challenges. In particular when it comes to build, incorporate, and evaluate recommendation strategies that integrate information beyond simple user--item interactions or content-based descriptors, but dig deep into the very essence of listener needs, preferences, and intentions, MRS research becomes a big endeavor and related publications quite sparse. The purpose of this trends and survey article is twofold. We first identify and shed light on what we believe are the most pressing challenges MRS research is facing, from both academic and industry perspectives. We review the state of the art towards solving these challenges and discuss its limitations. Second, we detail possible future directions and visions we contemplate for the further evolution of the field. The article should therefore serve two purposes: giving the interested reader an overview of current challenges in MRS research and providing guidance for young researchers by identifying interesting, yet under-researched, directions in the field

arXiv.org e-Print Archive

JKU | ePub

Perception of Nigerian Dùndún talking drum performances as speech-like vs. music-like: The role of familiarity and acoustic cues

Author: Durojaye C.
Fink L.
Larrouy-Maestri P.
Roeske T.
Wald-Fuhrmann M.
Publication venue: 'Frontiers Media SA'
Publication date: 20/05/2021
Field of study

It seems trivial to identify sound sequences as music or speech, particularly when the sequences come from different sound sources, such as an orchestra and a human voice. Can we also easily distinguish these categories when the sequence comes from the same sound source? On the basis of which acoustic features? We investigated these questions by examining listeners’ classification of sound sequences performed by an instrument intertwining both speech and music: the dùndún talking drum. The dùndún is commonly used in south-west Nigeria as a musical instrument but is also perfectly fit for linguistic usage in what has been described as speech surrogates in Africa. One hundred seven participants from diverse geographical locations (15 different mother tongues represented) took part in an online experiment. Fifty-one participants reported being familiar with the dùndún talking drum, 55% of those being speakers of Yorùbá. During the experiment, participants listened to 30 dùndún samples of about 7s long, performed either as music or Yorùbá speech surrogate (n = 15 each) by a professional musician, and were asked to classify each sample as music or speech-like. The classification task revealed the ability of the listeners to identify the samples as intended by the performer, particularly when they were familiar with the dùndún, though even unfamiliar participants performed above chance. A logistic regression predicting participants’ classification of the samples from several acoustic features confirmed the perceptual relevance of intensity, pitch, timbre, and timing measures and their interaction with listener familiarity. In all, this study provides empirical evidence supporting the discriminating role of acoustic features and the modulatory role of familiarity in teasing apart speech and music

PubMed Central

MPG.PuRe

Music Listening, Music Therapy, Phenomenology and Neuroscience

Author: Christensen Erik
Publication venue
Publication date: 01/01/2012
Field of study

VBN