38 research outputs found

    Advances in the neurocognition of music and language

    Get PDF

    Singing voice phoneme segmentation by hierarchically inferring syllable and phoneme onset positions

    No full text
    Comunicació presentada a: Interspeech 2018, celebrada del 2 al 6 de setembre de 2018 a Hyderabad, India.In this paper, we tackle the singing voice phoneme segmentation problem in the singing training scenario by using language independent information – onset and prior coarse duration. We propose a two-step method. In the first step, we jointly calculate the syllable and phoneme onset detection functions (ODFs) using a convolutional neural network (CNN). In the second step, the syllable and phoneme boundaries and labels are inferred hierarchically by using a duration-informed hidden Markov model (HMM). To achieve the inference, we incorporate the a priori duration model as the transition probabilities and the ODFs as the emission probabilities into the HMM. The proposed method is designed in a language-independent way such that no phoneme class labels are used. For the model training and algorithm evaluation, we collect a new jingju (also known as Beijing or Peking opera) solo singing voice dataset and manually annotate the boundaries and labels at phrase, syllable and phoneme levels. The dataset is publicly available. The proposed method is compared with a baseline method based on hidden semi-Markov model (HSMM) forced alignment. The evaluation results show that the proposed method outperforms the baseline by a large margin regarding both segmentation and onset detection tasks.This work is supported by the CompMusic project (ERC grant agreement 267583)

    Singing voice phoneme segmentation by hierarchically inferring syllable and phoneme onset positions

    No full text
    Comunicació presentada a: Interspeech 2018, celebrada del 2 al 6 de setembre de 2018 a Hyderabad, India.In this paper, we tackle the singing voice phoneme segmentation problem in the singing training scenario by using language independent information – onset and prior coarse duration. We propose a two-step method. In the first step, we jointly calculate the syllable and phoneme onset detection functions (ODFs) using a convolutional neural network (CNN). In the second step, the syllable and phoneme boundaries and labels are inferred hierarchically by using a duration-informed hidden Markov model (HMM). To achieve the inference, we incorporate the a priori duration model as the transition probabilities and the ODFs as the emission probabilities into the HMM. The proposed method is designed in a language-independent way such that no phoneme class labels are used. For the model training and algorithm evaluation, we collect a new jingju (also known as Beijing or Peking opera) solo singing voice dataset and manually annotate the boundaries and labels at phrase, syllable and phoneme levels. The dataset is publicly available. The proposed method is compared with a baseline method based on hidden semi-Markov model (HSMM) forced alignment. The evaluation results show that the proposed method outperforms the baseline by a large margin regarding both segmentation and onset detection tasks.This work is supported by the CompMusic project (ERC grant agreement 267583)

    The evolution of language: Proceedings of the Joint Conference on Language Evolution (JCoLE)

    Get PDF

    Melody as Prosody: Toward a Usage-Based Theory of Music

    Get PDF
    MELODY AS PROSODY: TOWARD A USAGE-BASED THEORY OF MUSIC Thomas M. Pooley Gary A. Tomlinson Rationalist modes of inquiry have dominated the cognitive science of music over the past several decades. This dissertation contests many rationalist assumptions, including its core tenets of nativism, modularity, and computationism, by drawing on a wide range of evidence from psychology, neuroscience, linguistics, and cognitive music theory, as well as original data from a case study of Zulu song prosody. An alternative biocultural approach to the study of music and mind is outlined that takes account of musical diversity by attending to shared cognitive mechanisms. Grammar emerges through use, and cognitive categories are learned and constructed in particular social contexts. This usage-based theory of music shows how domain-general cognitive mechanisms for patterning-finding and intention-reading are crucial to acquisition, and how Gestalt principles are invoked in perception. Unlike generative and other rationalist approaches that focus on a series of idealizations, and the cognitive `competences\u27 codified in texts and musical scores, the usage-based approach investigates actual performances in everyday contexts by using instrumental measures of process. The study focuses on song melody because it is a property of all known musics. Melody is used for communicative purposes in both song and speech. Vocalized pitch patterning conveys a wide range of affective, propositional, and syntactic information through prosodic features that are shared by the two domains. The study of melody as prosody shows how gradient pitch features are crucial to the design and communicative functions of song melodies. The prosodic features shared by song and speech include: speech tone, intonation, and pitch-accent. A case study of ten Zulu memulo songs shows that pitch is not used in the discrete or contrastive fashion proposed by many cognitive music theorists and most (generative) phonologists. Instead there are a range of pitch categories that include pitch targets, glides, and contours. These analyses also show that song melody has a multi-dimensional pitch structure, and that it is a dynamic adaptive system that is irreducible in its complexity

    The Austronesian languages

    No full text
    This is a revised edition of the 2009 The Austronesian languages, which was published as a paperback in the then Pacific Linguistics series (ISBN 9780858836020). This revision includes typographical corrections, an improved index, and various minor content changes. The release of the open access edition serves to meet the strong ongoing demand for this important handbook, of which only 200 copies of the first edition were printed. This is the first single-authored book that attempts to describe the Austronesian language family in its entirety. Topics covered include: the physical and cultural background, official and national languages, largest and smallest languages in all major geographical regions, language contact, sound systems, linguistic palaeontology, morphology, syntax, the history of scholarship on Austronesian languages, and a critical assessment of the reconstruction of Proto Austronesian phonology.Australian National University, College of Asia and the Pacifi

    The Prague School and Theories of Structure

    Get PDF
    Diese Reihe untersucht Gemeinsamkeiten und Unterschiede von Natur- und Geisteswissenschaftlichen. Das Konzept des »Einflusses« bzw. des »gegenseitigen Einflusses« soll zugunsten eines dynamischeren Konzepts des »Interfacing« (Verbindung/Vernetzung) hinterfragt werden. Ein grundlegender Ausgangspunkt ist die Erkenntnis, dass die beiden Wissenssphären, die geistes- und die naturwissenschaftliche, häufig zur gleichen Zeit neue Untersuchungsmodelle entwickeln und damit auf komplexe wissenschaftliche und kulturelle Phänomene reagieren. Das Konzept des »Interfacing« impliziert eine integrierte Sicht neuer Wissensgebiete in neuen Kontexten. Nicht länger an der traditionellen Vorstellung von »Ursache und Wirkung« gebunden, impliziert der Isomorphismus Gleichzeitigkeit statt Konsequentialität. Nicht immer beeinflusst die eine Sphäre die andere; Isomorphismus impliziert gemeinsame Entdeckungen, durch die beide Bereichen zur gleichen Zeit neue investigative Modelle und Darstellungssysteme entwickeln. Dialog und gegenseitiges Verständnis zwischen den beiden sogenannten »zwei Kulturen« werden so stimuliert. Wichtige Forschungsbereiche sind Interfacing-Modelle und Paradigmen in den Natur- und Geisteswissenschaften, kulturell bedingte Darstellungen von Naturwissenschaft und Technologie, wissenschaftliche Entdeckungen und narrative Diskurse, Lebenserinnerungen von Wissenschaftlerinnen und Wissenschaftlern, das Überschreiten von Grenzen zwischen Natur- und Geisteswissenschaft durch Lernen sowie die Bereicherung der Geisteswissenschaften durch angewandte Naturwissenschaften, einschließlich der Informationstechnologien. Die Reihe umfasst sowohl Monographien als auch Essaysammlungen in englischer, deutscher, französischer und italienischer Sprache. Das Nebeneinander verschiedener Sprachen zeugt von der Intention von Herausgeberschaft und wissenschaftlichem Beirats, ein integriertes Wissen aus europäischer Perspektive herauszubilden

    Proceedings of the VIIth GSCP International Conference

    Get PDF
    The 7th International Conference of the Gruppo di Studi sulla Comunicazione Parlata, dedicated to the memory of Claire Blanche-Benveniste, chose as its main theme Speech and Corpora. The wide international origin of the 235 authors from 21 countries and 95 institutions led to papers on many different languages. The 89 papers of this volume reflect the themes of the conference: spoken corpora compilation and annotation, with the technological connected fields; the relation between prosody and pragmatics; speech pathologies; and different papers on phonetics, speech and linguistic analysis, pragmatics and sociolinguistics. Many papers are also dedicated to speech and second language studies. The online publication with FUP allows direct access to sound and video linked to papers (when downloaded)
    corecore