Search CORE

15,563 research outputs found

Singing Voice Recognition for Music Information Retrieval

Author: Mesaros Annamaria
Publication venue: Tampere University of Technology
Publication date: 01/01/2012
Field of study

This thesis proposes signal processing methods for analysis of singing voice audio signals, with the objectives of obtaining information about the identity and lyrics content of the singing. Two main topics are presented, singer identification in monophonic and polyphonic music, and lyrics transcription and alignment. The information automatically extracted from the singing voice is meant to be used for applications such as music classification, sorting and organizing music databases, music information retrieval, etc. For singer identification, the thesis introduces methods from general audio classification and specific methods for dealing with the presence of accompaniment. The emphasis is on singer identification in polyphonic audio, where the singing voice is present along with musical accompaniment. The presence of instruments is detrimental to voice identification performance, and eliminating the effect of instrumental accompaniment is an important aspect of the problem. The study of singer identification is centered around the degradation of classification performance in presence of instruments, and separation of the vocal line for improving performance. For the study, monophonic singing was mixed with instrumental accompaniment at different signal-to-noise (singing-to-accompaniment) ratios and the classification process was performed on the polyphonic mixture and on the vocal line separated from the polyphonic mixture. The method for classification including the step for separating the vocals is improving significantly the performance compared to classification of the polyphonic mixtures, but not close to the performance in classifying the monophonic singing itself. Nevertheless, the results show that classification of singing voices can be done robustly in polyphonic music when using source separation. In the problem of lyrics transcription, the thesis introduces the general speech recognition framework and various adjustments that can be done before applying the methods on singing voice. The variability of phonation in singing poses a significant challenge to the speech recognition approach. The thesis proposes using phoneme models trained on speech data and adapted to singing voice characteristics for the recognition of phonemes and words from a singing voice signal. Language models and adaptation techniques are an important aspect of the recognition process. There are two different ways of recognizing the phonemes in the audio: one is alignment, when the true transcription is known and the phonemes have to be located, other one is recognition, when both transcription and location of phonemes have to be found. The alignment is, obviously, a simplified form of the recognition task. Alignment of textual lyrics to music audio is performed by aligning the phonetic transcription of the lyrics with the vocal line separated from the polyphonic mixture, using a collection of commercial songs. The word recognition is tested for transcription of lyrics from monophonic singing. The performance of the proposed system for automatic alignment of lyrics and audio is sufficient for facilitating applications such as automatic karaoke annotation or song browsing. The word recognition accuracy of the lyrics transcription from singing is quite low, but it is shown to be useful in a query-by-singing application, for performing a textual search based on the words recognized from the query. When some key words in the query are recognized, the song can be reliably identified

Statistical Approaches for Signal Processing with Application to Automatic Singer Identification

Author: Yang Shiteng
Publication venue: RIT Scholar Works
Publication date: 01/06/2016
Field of study

In the music world, the oldest instrument is known as the singing voice that plays an important role in musical recordings. The singer\u27s identity serves as a primary aid for people to organize, browse, and retrieve music recordings. In this thesis, we focus on the problem of singer identification based on the acoustic features of singing voice. An automatic singer identification system is constructed and has achieved a very high identification accuracy. This system consists of three crucial parts: singing voice detection, background music removal and pattern recognition. These parts are introduced and explored in great details in this thesis. To be specific, in terms of the singing voice detection, we firstly study a traditional method, double GMM. Then an improved method, namely single GMM, is proposed. The experimental result shows that the detection accuracy of single GMM can be achieved as high as 96.42%. In terms of the background music removal, Non-negative Matrix Factorization (NMF) and Robust Principal Component Analysis (RPCA) are demonstrated. The evaluation result shows that RPCA outperforms NMF. In terms of pattern recognition, we explore the algorithms of Support Vector Machine (SVM) and Gaussian Mixture Model (GMM). Based on the experimental results, it turns out that the prediction accuracy of GMM classifier is about 16% higher than SVM

RIT Scholar Works

Vocal piano accompaniment: A constant research towards emancipation (2)

Author: Peñalver Vilar José María
Vallés Grau Luis
Publication venue: 'Science Publishing Group'
Publication date: 01/03/2020
Field of study

This is the second and last article in the series dedicated to the investigation of the evolution of vocal piano accompaniment through history and the role-played by the piano in its relationship with the voice. If the previous chapter focused from the beginnings of piano accompaniment to Franz Schubert, this one will take up again the analysis of the piano part from the last lieder of Franz Schubert to Arnold Schoenberg. The research method used continues to be based on the musical analysis of the piano part, addressing both issues of the piano itself and its link with the text and the vocal part. By means of musical examples of different composers, the article investigates all those aspects that provide evidence of the change in the role of the piano in the correspondence of the piano with the voice and its repercussion on the final result of the work. In this way, we will look at the history of piano accompaniment for voice and see how the influence of Schubert's conception of vocal piano accompaniment materializes in contemporary and later composers, as well as the crucial change in the history of the piano as an accompanying instrument introduced by Schubert. The results and conclusions drawn from the evolution presented in these two articles are presented at the end of the article, based on the different aspects involved in the musical event, such as rhythm, harmony or texture

Repositori Institucional de la Universitat Jaume I

Reggae on the Silk Road: the Globalisation of Uyghur Pop

Author: Harris Rachel
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/09/2005
Field of study

SOAS Research Online

The songwriting coalface: where multiple intelligences collide

Author: Harrison Clive
Publication venue: University of Bedfordshire
Publication date: 01/11/2017
Field of study

This paper investigates pedagogy around songwriting professional practice. Particular focus is given to the multiple intelligence theory of Howard Gardner as a lens through which to view songwriting practice, referenced to recent songwriting‐specific research (e.g. McIntyre, Bennett). Songwriting education provides some unique challenges; firstly, due to the qualitative nature of assessment and the complex and multi‐faceted nature of skills necessary (lyric writing, composing, recording, and performing), and secondly, in some less‐tangible capacities beneficial to the songwriter (creative skills, and nuanced choice‐making). From the perspective of songwriting education, Gardner’s MI theory provides a ‘useful fiction’ (his term) for knowledge transfer in the domain, especially (and for this researcher, surprisingly) in naturalistic intelligence

In the footsteps of the "Great Jewish Composer" : Don Harrán, Salamone Rossi: Jewish musician in late Renaissance Mantua. Oxford University Press, 1999, 310 pp. [Rezension]

Author: Seroussi Edwin
Publication venue
Publication date: 15/01/2009
Field of study

The Art of Practice – Understanding the process of musical maturation through reflection.

Author: Esslin-Peard MS
Shorrocks T
Welch GF
Publication venue: Waxmann Verlag. ISBN: 978–3–8309–3313–7
Publication date: 01/01/2015
Field of study

Much has been written in the last 30 years about musical practice and performance, but there is little consensus over what practice really means, or how musicians progress by practising. Researchers tend to focus on specific elements in practice rather than maintaining a more holistic perspective. Whilst academics historically focused on (primarily Western) classical musicians, more recent research has encompassed popular, jazz and folk musicians. The current research project at the University of Liverpool focuses on the practice and performance of both popular and classical musicians as described in students’ reflective essays. We posit a model for musical maturation that incorporates key elements from psychology, epistemology and sociocultural theory. (DIPF/Orig.

University of Liverpool Repository