5,365 research outputs found

    Pauses and the temporal structure of speech

    Get PDF
    Natural-sounding speech synthesis requires close control over the temporal structure of the speech flow. This includes a full predictive scheme for the durational structure and in particuliar the prolongation of final syllables of lexemes as well as for the pausal structure in the utterance. In this chapter, a description of the temporal structure and the summary of the numerous factors that modify it are presented. In the second part, predictive schemes for the temporal structure of speech ("performance structures") are introduced, and their potential for characterising the overall prosodic structure of speech is demonstrated

    Speech and music discrimination: Human detection of differences between music and speech based on rhythm

    Get PDF
    Rhythm in speech and singing forms one of its basic acoustic components. Therefore, it is interesting to investigate the capability of subjects to distinguish between speech and singing when only the rhythm remains as an acoustic cue. For this study we developed a method to eliminate all linguistic components but rhythm from the speech and singing signals. The study was conducted online and participants could listen to the stimuli via loudspeakers or headphones. The analysis of the survey shows that people are able to significantly discriminate between speech and singing after they have been altered. Furthermore, our results reveal specific features, which supported participants in their decision, such as differences in regularity and tempo between singing and speech samples. The hypothesis that music trained people perform more successfully on the task was not proved. The results of the study are important for the understanding of the structure of and differences between speech and singing, for the use in further studies and for future application in the field of speech recognition

    Prosody and Pragmatics in Parenthetical Insertions in Catalan

    Get PDF
    This paper analyses the role of prosody in parenthetical insertions, a type of structure that is extremely common in both speech and writing. The materials under study come from a corpus of spontaneous speech acts in Central Catalan (with varying degrees of spontaneity) from which a corpus of oral parenthetical insertions has been compiled. The prototypical prosodic features of a parenthetical insertion in Catalan are: prosodic autonomy, limited extension, production in between pauses or final pause, tendency towards acceleration, fall in intensity, lower pitch range and, finally, falling or rising melodic pattern. While the final fall is the most frequent pattern in spontaneous conversations with a high degree of confidence between interlocutors, a final rising structure is found in interviews in which the degree of confidence between participants is smaller, their roles are unequal, and the interviewed constructs a narrative discourse. We thus suggest that the pitch contour of parenthetical insertions is related to formality and discourse typology (in this case, narrative vs. dialogue). Bearing in mind the discursive functions performed by these insertions, we propose a typology which classifies them with regards to two main functions: completion of information, and modalisation.
    corecore