Search CORE

15 research outputs found

A Lyrics-matching QBH System for Interactive Environments

Author: Hendrik Purwins
Panagiotis Papiotis
Publication venue
Publication date
Field of study

(Abstract to follow

ZENODO

The Ethnic Lyrics Fetcher tool

Author: Carlos N Silla Jr
Murilo AP Almeida
Rafael P Ribeiro
Publication venue: Springer Nature
Publication date: 01/01/2014
Field of study

Springer - Publisher Connector

Real-Time Audio-to-Score Alignment of Singing Voice Based on Melody and Lyric Information

Author: Cont Arshia
Cuvillier Philippe
Gong Rong
Obin Nicolas
Publication venue: HAL CCSD
Publication date: 06/09/2015
Field of study

International audienceSinging voice is specific in music: a vocal performance conveys both music (melody/pitch) and lyrics (text/phoneme) content. This paper aims at exploiting the advantages of melody and lyric information for real-time audio-to-score alignment of singing voice. First, lyrics are added as a separate observation stream into a template-based hidden semi-Markov model (HSMM), whose observation model is based on the construction of vowel templates. Second, early and late fusion of melody and lyric information are processed during real-time audio-to-score alignment. An experiment conducted with two professional singers (male/female) shows that the performance of a lyrics-based system is comparable to that of melody-based score following systems. Furthermore, late fusion of melody and lyric information substantially improves the alignment performance. Finally, maximum a posteriori adaptation (MAP) of the vowel templates from one singer to the other suggests that lyric information can be efficiently used for any singer

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

Music Information Retrieval: An Inspirational Guide to Transfer from Related Disciplines

Author: Hanjalic Alan
Kurth Frank
Liem Cynthia C.S.
Weninger Felix
Publication venue: Dagstuhl Follow-Ups. Multimodal Music Processing
Publication date: 01/01/2012
Field of study

The emerging field of Music Information Retrieval (MIR) has been influenced by neighboring domains in signal processing and machine learning, including automatic speech recognition, image processing and text information retrieval. In this contribution, we start with concrete examples for methodology transfer between speech and music processing, oriented on the building blocks of pattern recognition: preprocessing, feature extraction, and classification/decoding. We then assume a higher level viewpoint when describing sources of mutual inspiration derived from text and image information retrieval. We conclude that dealing with the peculiarities of music in MIR research has contributed to advancing the state-of-the-art in other fields, and that many future challenges in MIR are strikingly similar to those that other research areas have been facing

Dagstuhl Research Online Publication Server

Leveraging repetition for improved automatic lyric transcription in popular music

Author: Daniel Pw
Ellis Masataka Goto
Matt Mcvicar
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2014
Field of study

Transcribing lyrics from musical audio is a challenging research prob-lem which has not benefited from many advances made in the related field of automatic speech recognition, owing to the prevalent musical accompaniment and differences between the spoken and sung voice. However, one aspect of this problem which has yet to be exploited by researchers is that significant portions of the lyrics will be repeated throughout the song. In this paper we investigate how this information can be leveraged to form a consensus transcription with improved consistency and accuracy. Our results show that improvements can be gained using a variety of techniques, and that relative gains are largest under the most challenging and realistic experimental conditions

CiteSeerX

Crossref

PoLyScriber: Integrated Training of Extractor and Lyrics Transcriber for Polyphonic Music

Author: Gao Xiaoxue
Gupta Chitralekha
Li Haizhou
Publication venue
Publication date: 02/10/2022
Field of study

Lyrics transcription of polyphonic music is challenging as the background music affects lyrics intelligibility. Typically, lyrics transcription can be performed by a two step pipeline, i.e. singing vocal extraction frontend, followed by a lyrics transcriber backend, where the frontend and backend are trained separately. Such a two step pipeline suffers from both imperfect vocal extraction and mismatch between frontend and backend. In this work, we propose a novel end-to-end integrated training framework, that we call PoLyScriber, to globally optimize the vocal extractor front-end and lyrics transcriber backend for lyrics transcription in polyphonic music. The experimental results show that our proposed integrated training model achieves substantial improvements over the existing approaches on publicly available test datasets.Comment: 13 page

arXiv.org e-Print Archive