Search CORE

589 research outputs found

Singing voice correction using canonical time warping

Author: Chen Ming-Tso
Chi Tai-Shih
Luo Yin-Jyun
Su Li
Publication venue
Publication date: 23/11/2017
Field of study

Expressive singing voice correction is an appealing but challenging problem. A robust time-warping algorithm which synchronizes two singing recordings can provide a promising solution. We thereby propose to address the problem by canonical time warping (CTW) which aligns amateur singing recordings to professional ones. A new pitch contour is generated given the alignment information, and a pitch-corrected singing is synthesized back through the vocoder. The objective evaluation shows that CTW is robust against pitch-shifting and time-stretching effects, and the subjective test demonstrates that CTW prevails the other methods including DTW and the commercial auto-tuning software. Finally, we demonstrate the applicability of the proposed method in a practical, real-world scenario

arXiv.org e-Print Archive

Crossref

Vibrato in Singing Voice: The Link between Source-Filter and Sinusoidal Models

Author
Publication venue: Springer
Publication date
Field of study

Springer - Publisher Connector

Mandarin Singing Voice Synthesis Based on Harmonic Plus Noise Model and Singing Expression Analysis

Author: Gu Hung-Yan
Wang Hsin-Min
Wang Ju-Chiang
Publication venue
Publication date: 15/02/2015
Field of study

The purpose of this study is to investigate how humans interpret musical scores expressively, and then design machines that sing like humans. We consider six factors that have a strong influence on the expression of human singing. The factors are related to the acoustic, phonetic, and musical features of a real singing signal. Given real singing voices recorded following the MIDI scores and lyrics, our analysis module can extract the expression parameters from the real singing signals semi-automatically. The expression parameters are used to control the singing voice synthesis (SVS) system for Mandarin Chinese, which is based on the harmonic plus noise model (HNM). The results of perceptual experiments show that integrating the expression factors into the SVS system yields a notable improvement in perceptual naturalness, clearness, and expressiveness. By one-to-one mapping of the real singing signal and expression controls to the synthesizer, our SVS system can simulate the interpretation of a real singer with the timbre of a speaker.Comment: 8 pages, technical repor

arXiv.org e-Print Archive

CiteSeerX

Taking Jazz Singers Seriously: Gender, Race, and Vocal Improvisation

Author: Ceres Lauren Anne
Publication venue: Bard Digital Commons
Publication date: 01/01/2022
Field of study

Senior Project submitted to The Division of Arts of Bard College

Bard College

Effect of the glottal source and the vocal tract on the partials amplitude of vibrato in male voices

Author: Arroabarren Alemán Ixone
Carlosena García Alfonso
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/2006
Field of study

In this paper the production of vocal vibrato is investigated. The most relevant features of the acoustical vibrato signal, frequency and amplitude variations of the partials, will be related to the voice production features, glottal source GS and vocal tract response VTR . Unlike previous related works, in this approach, the effect on the amplitude variations of the partials of each one of the above-mentioned voice production features will be identified in recordings of natural singing voice. Moreover, we will take special care of the reliability of the measurements, and, to this aim, a noninteractive vibrato production model will be also proposed in order to describe the vibrato production process and, more importantly, validate the measurements carried out in natural vibrato. Based on this study, it will be shown that during a few vibrato cycles, the glottal pulse characteristics, as well as the VTR, do not significantly change, and only the fundamental frequency of the GS varies. As a result, the pitch variations can be attributed to the GS, and these variations, along with the vocal tract filtering effect, will result in frequency and amplitude variations of the acoustic signal partials.This work was supported in part by the Ministerio de Educación y Ciencia under Grant FPU, AP2000-4674. The Gobierno de Navarra and the Universidad Pública de Navarra are gratefully acknowledged for financial support

Crossref

Academica-e

Vibrato Performance Style: A Case Study Comparing Erhu and Violin

Author: Chew E
Rajab SAYID-KHALID
the 10th International Conference on Computer Music Multidisciplinary Research
YANG L
Publication venue
Publication date: 23/09/2016
Field of study

Queen Mary Research Online