21 research outputs found
An Empirical Method for Comparing Pitch Patterns in Spoken and Musical Melodies: A Comment on J.G.S. Pearl's "Eavesdropping with a Master: Leos Janáček and the Music of Speech."
Music and speech both feature structured melodic patterns, yet these patterns are rarely compared using empirical methods. One reason for this has been a lack of tools which allow quantitative comparisons of spoken and musical pitch sequences. Recently, a new model of speech intonation perception has been proposed based on principles of pitch perception in speech. The “prosogram” model converts a sentence's fundamental frequency contour into a sequence of discrete tones and glides. This sequence is meant to represent a listener's perception of pitch in connected speech. This article briefly describes the prosogram and suggests a few ways in which it can be used to compare the structure of spoken and musical melodies
A HuComTech-korpusz Ă©s -adatbázis számĂtĂłgĂ©pes feldolgozási lehetĹ‘sĂ©gei : automatikus prozĂłdiai annotáciĂł
A kĂĽlönböz kommunikáciĂłs esemĂ©nyek számĂtĂłgĂ©pes elemzĂ©se során nĂ©lkĂĽlözhetetlen támpontot jelent, hogy gĂ©pileg feldolgozhatĂł formában elĂ©rhetk legyenek az azokat kĂsĂ©r Ă©s általánosságban jellemz fizikai jegyek, mint amilyen a gyorsulĂł beszĂ©dtempĂł vagy az eltĂ©r hanghordozás. A jelen tanulmányban bemutatásra kerĂĽl, a HuComTech-korpusz Ă©s -adatbázis bvĂtĂ©sekĂ©nt tervezett automatikus prozĂłdiai annotáciĂł ezeknek az informáciĂłknak a feltĂ©rkĂ©pezĂ©sĂ©t szolgálja abbĂłl a cĂ©lbĂłl, hogy a lehetvĂ© tegye a korpusz annotáciĂłiban rögzĂtĂ©sre kerĂĽlt kommunikáciĂłs jelensĂ©gek akusztikai jellemzĂ©sĂ©t. A tanulmány a korpusz általános bemutatása után ennek cĂ©ljait, mĂłdszereit Ă©s lehetsĂ©geit kĂvánja rĂ©szletezni
Stylisation of Thai tones using Prosogram
The aim of this study is to establish whether stylisation of F0 contours based on d'Alessandro and Mertens's model of tonal perception can be successfully applied to lexical tones of Central Thai. The percentage of correct responses to the manipulated stimuli was found to be significantly lower than the results for natural tones reported in literature on the subject.The aim of this study is to establish whether stylisation of F0 contours based on d'Alessandro and Mertens's model of tonal perception can be successfully applied to lexical tones of Central Thai. The percentage of correct responses to the manipulated stimuli was found to be significantly lower than the results for natural tones reported in literature on the subject
Outils d'aide Ă l'annotation prosodique de corpus
National audienceL'étude que nous proposons ici fait suite à de nombreux travaux que nous avons entrepris depuis la fin des années soixante-dix, portant sur les aspects sémantiques, pragmatiques et la subjectivité. Dans ces travaux, nous avons été amenée en particulier à tester différents modèles syntaxiques, sémantiques, pragmatiques, empruntés à la littérature ou originaux, ayant tous la propriété de quantifier en prédiction, des valeurs mélodiques telles que l'amplitude ou les valeurs maximales (voir à ce sujet, Caelen-Haumont, 1991 ; Caelen-Haumont, à paraître l'Harmattan).Dans l'étude présente, nous utilisons une grille d'analyse particulière en 9 niveaux, conçue pour analyser avec précision les modulations et l'amplitude de F0. Notre objectif est de contribuer à la description du parler du français régional de Bordeaux, par le domaine de la prosodie. Nous focalisons notre étude sur des éléments essentiels de la mélodie, que sont ses proéminences. Pour ce faire, nous nous sommes intéressée aux productions d'une famille présentant des critères de très grande stabilité géographique, ayant toujours vécu depuis des générations dans le même village
Outils d'aide Ă l'annotation prosodique de corpus
National audienceL'étude que nous proposons ici fait suite à de nombreux travaux que nous avons entrepris depuis la fin des années soixante-dix, portant sur les aspects sémantiques, pragmatiques et la subjectivité. Dans ces travaux, nous avons été amenée en particulier à tester différents modèles syntaxiques, sémantiques, pragmatiques, empruntés à la littérature ou originaux, ayant tous la propriété de quantifier en prédiction, des valeurs mélodiques telles que l'amplitude ou les valeurs maximales (voir à ce sujet, Caelen-Haumont, 1991 ; Caelen-Haumont, à paraître l'Harmattan).Dans l'étude présente, nous utilisons une grille d'analyse particulière en 9 niveaux, conçue pour analyser avec précision les modulations et l'amplitude de F0. Notre objectif est de contribuer à la description du parler du français régional de Bordeaux, par le domaine de la prosodie. Nous focalisons notre étude sur des éléments essentiels de la mélodie, que sont ses proéminences. Pour ce faire, nous nous sommes intéressée aux productions d'une famille présentant des critères de très grande stabilité géographique, ayant toujours vécu depuis des générations dans le même village
Plurality of languages, data and methods for a general model of melodic variation in Romance dialects
Les faits prosodiques qui caractérisent un énoncé dépendent de plusieurs facteurs et présentent une importante variation dans les langues du monde. L’interaction entre intonation de phrase, contraintes accentuelles, organisation de l’information et effets stylistiques produit des phénomènes locaux qui participent à un ordre de construction étendu qui peut être étudié en fonction de la phonologie des diverses langues. La description de celle-ci dépend toutefois de plusieurs traditions d’analyse et d’une variété de méthodes de représentation et d’interprétation qui s’imposent à différents niveaux. Si, d’un côté, nous commençons à disposer de moyens pour une étude générale qui nous permette de laisser apparaître des universaux prosodiques, une typologie trop radicale risque d’annuler des traits de différenciation pourtant essentiels en termes dialectologiques. Sur la base d’une série d’exemples concernant la variation de l’intonation dans l’espace roman, dans cet article j’essaie de montrer l’utilité de garder une approche écologique qui garantisse l’observation de la diversité prosodique des langues.Speech prosody depends on multiple factors and presents a significant variation among world languages. Interaction between sentence intonation, stress, information patterning and stylistic effects produce local phenomena contributing to complex outcomes that should be studied in accordance with the specific phonology of each language. However, the description of this interaction relies on traditional approaches and various representation methods: different interpretation models are then applied at each level. If, on the one hand, we begin to manage some tools for a general study which seem to bring to light prosodic universals, on the other hand, a too radical approach to prosodic typology may cause the risk of deleting features which are essential from a dialectological point of view. Based on a series of examples concerning the intonational variation in the Romance area, I will try to show in this paper the need to keep using an ecological approach towards languages prosodic diversity
Acoustic foundations of the speech-to-song illusion
In the “speech-to-song illusion”, certain spoken phrases are heard as highly song-like when isolated from context and repeated. This phenomenon occurs to a greater degree for some stimuli than for others, suggesting that particular cues prompt listeners to perceive a spoken phrase as song. Here we investigated the nature of these cues across four experiments. In Experiment 1, participants were asked to rate how song-like spoken phrases were after each of eight repetitions. Initial ratings were correlated with the consistency of an underlying beat and within-syllable pitch slope, while rating change was linked to beat consistency, within-syllable pitch slope, and melodic structure. In Experiment 2, the within-syllable pitch slope of the stimuli was manipulated, and this manipulation changed the extent to which participants heard certain stimuli as more musical than others. In Experiment 3, the extent to which the pitch sequences of a phrase fit a computational model of melodic structure was altered, but this manipulation did not have a significant effect on musicality ratings. In Experiment 4, the consistency of inter-syllable timing was manipulated, but this manipulation did not have an effect on the change in perceived musicality after repetition. Our methods provide a new way of studying the causal role of specific acoustic features in the speech-to-song illusion via subtle acoustic manipulations of speech, and show that listeners can rapidly (and implicitly) assess the degree to which non-musical stimuli contain musical structure
A percepção de valores pragmáticos na entoação de sentenças imperativas no Português Brasileiro: um estudo experimental
Es te artigo apresenta os resultados de um estudo perceptivo sobre a entoação de sentenças imperativas no portuguĂŞs brasileiro produzidas com os seguintes valores pragmáticos: ordem, desafio, pedido e sugestĂŁo. A sentença imperativa Conversa com ele foi gravada, com essas quatro intenções ilocutĂłrias, por quatro sujeitos, dois homens e duas mulheres. Os contornos entonacionais produzidos foram analisados segundo a abordagem do modelo IPO, um modeloperceptivo de descrição da entoação que propõe dois mĂ©todos de estilização da curva de frequĂŞncia fundamental: a estilização close copy e a estandardização dos movimentos melĂłdicos pautada na equivalĂŞncia perceptiva. Por meio do uso do software Praat, contornos melĂłdicos ressintetizados foram criados para, posteriormente, um grupo de 20 juĂzes julgarem a sua aceitabilidade em um teste de percepção. Os resultados do teste de percepção indicaram a relevânciaperceptiva de caracterĂsticas fonĂ©ticas dos movimentos melĂłdicos que atuam na identificação do valor funcional dos contornos entonacionais manipulados. A variável direção do movimento melĂłdico foi relevante para o reconhecimento da ordem, e, especialmente, para o do desafio e do pedido; a extensĂŁo do movimento revelou-se essencial para a caracterização do contorno dedesafio; a variável alinhamento do pico de F0, para os contornos do pedido e da sugestĂŁo; por fim, a variável campo tonal, para o contorno da ordem
Acoustic correlates of lexical stress in Moroccan Arabic
Presently there is no consensus regarding the interpretation and analysis of the stress system of Moroccan Arabic. This paper tests whether the acoustic realisation of syllables support one widely adopted interpretation of lexical stress, according to which stress is either penultimate or final depending on syllable weight. The experiment reports on word-initial syllables that differ in presumed stress status. Target words were embedded in a carrier sentence within a scripted mock dialogue to ensure that the measurements reflect lexical stress rather than phrase-level prominence. Results from all four acoustic parameters tested (f0, duration, Centre of Gravity and vowel quality) showed that there were no differences as a function of presumed stress status, thus failing to support an interpretation according to which stressed syllables are acoustically differentiated. We consider the results in relation to previous claims and observations, and conclude that the absence of acoustic correlates of presumed stress is compatible with the view that Moroccan Arabic lacks lexical stress