5,947 research outputs found
Speech rhythm: a metaphor?
Is speech rhythmic? In the absence of evidence for a traditional view that languages strive to coordinate either syllables or stress-feet with regular time intervals, we consider the alternative that languages exhibit contrastive rhythm subsisting merely in the alternation of stronger and weaker elements. This is initially plausible, particularly for languages with a steep ‘prominence gradient’, i.e. a large disparity between stronger and weaker elements; but we point out that alternation is poorly achieved even by a ‘stress-timed’ language such as English, and, historically, languages have conspicuously failed to adopt simple phonological remedies that would ensure alternation. Languages seem more concerned to allow ‘syntagmatic contrast’ between successive units and to use durational effects to support linguistic functions than to facilitate rhythm. Furthermore, some languages (e.g. Tamil, Korean) lack the lexical prominence which would most straightforwardly underpin prominence alternation. We conclude that speech is not incontestibly rhythmic, and may even be antirhythmic. However, its linguistic structure and patterning allow the metaphorical extension of rhythm in varying degrees and in different ways depending on the language, and that it is this analogical process which allows speech to be matched to external rhythms
Non-native contrasts in Tongan loans
We present three case studies of marginal contrasts in Tongan loans from English, working with data from three speakers. Although Tongan lacks contrasts in stress or in CC vs. CVC sequences, secondary stress in loans is contrastive, and is sensitive to whether a vowel has a correspondent in the English source word; vowel deletion is also sensitive to whether a vowel is epenthetic as compared to the English source; and final vowel length is sensitive to whether the penultimate vowel is epenthetic, and if not, whether it corresponds to a stressed or unstressed vowel in the English source. We provide an analysis in the multilevel model of Boersma (1998) and Boersma & Hamann (2009), and show that the loan patterns can be captured using only constraints that plausibly are needed for native-word phonology, including constraints that reflect perceptual strategies
Speech and music discrimination: Human detection of differences between music and speech based on rhythm
Rhythm in speech and singing forms one of its basic acoustic components. Therefore, it is interesting to investigate the capability of subjects to distinguish between speech and singing when only the rhythm remains as an acoustic cue. For this study we developed a method to eliminate all linguistic components but rhythm from the speech and singing signals. The study was conducted online and participants could listen to the stimuli via loudspeakers or headphones. The analysis of the survey shows that people are able to significantly discriminate between speech and singing after they have been altered. Furthermore, our results reveal specific features, which supported participants in their decision, such as differences in regularity and tempo between singing and speech samples. The hypothesis that music trained people perform more successfully on the task was not proved. The results of the study are important for the understanding of the structure of and differences between speech and singing, for the use in further studies and for future application in the field of speech recognition
Recommended from our members
The effect of pitch span on intonational plateaux
Previous research has indicated that the H (high) of a nuclear accent may be realized as a flat stretch of contour rather than as a single turning point. Both the duration of this plateau and its alignment within the accented syllable are affected by the segmental and prosodic structure of the utterance. The present work investigates whether a non-structural variable, namely pitch span, also affects the realization of the plateau. Speakers replicated all-sonorant utterances in different pitch spans. Results show that both the duration and alignment of the plateau vary with pitch span but in ways different from the way they vary with prosodic structure. Importantly, results also indicate that, when using a proportional measure of alignment, the end of the plateau is anchored within the syllable for each speaker and may be a marker of linguistic structure
It´s all about the rhythm - A neurocognitive approach towards the Rhythm Rule in German and English
The aim of the present doctoral thesis is to gain deeper insight into the cognitive processing of rhythmically irregular structures in form of stress clashes and stress lapses in comparison to structures that follow the Rhythm Rule. Although stress clashes and stress lapses are allowed and hence present in speech, they are nonetheless marked as rhythmically ill-formed. Hence, since rhythmically induced stress shifts appear often in languages like German, and especially English, it was decided to investigate how the brain reacts to structures that do not meet with rhythmic expectations but are allowed in the investigated language. In this respect, this rhythmic phenomenon differs from the rhythmic deviation types that have been investigated to date. Four studies comprising five experiments using the ERP technique were conducted within the scope of the present thesis. In order to support and complement the findings of the ERP studies, an additional production and perception study and two reaction time studies were designed and undertaken on German rhythmic irregularities.
Three ERP studies were conducted on the cognitive processing of rhythmic irregularities in German phrases and compounds. Due to the given task settings in the ERP studies, measured reaction times were not meaningful. Therefore, independent reaction time studies with the identical set of stimuli were performed and are reported with the corresponding ERP studies. Based on the findings of the first ERP experiment on German phrases, a follow-up study was conducted in which the sensitivity towards attentional and contextual influences was further tested by using modified task settings and adjusted stimuli presentation modalities. The study on German compounds consists of two experiments which tried to shed further light on the task-sensitivity of the ERP components found in the studies on German phrases.
A further ERP study was set up in order to compare the influence of the RR on processing in German and English by using similar deviations in English. Therefore, English compounds were tested either obeying or deviating from this rule. Moreover, due to the aforementioned syntactic differences between stress shift targets in German and English, this study allowed for a combined yet disentangled investigation of rhythmical and lexical influences on speech processing.
In previous research, the application of the RR in speech production was mainly investigated on English data and exclusively in compound structures in German. Therefore, an additional production and perception study was used as a pre-test for the planned ERP studies on German. Investigating the application and perception of the RR should deliver further insights into its importance in German not only on the word level (in compounds) but also on the phrasal level and therefore complement and extend the findings of previous studies
Recommended from our members
Speech rhythm: the language-specific integration of pitch and duration
Experimental phonetic research on speech rhythm seems to have reached an impasse. Recently, this research field has tended to investigate produced (rather than perceived) rhythm, focussing on timing, i.e. duration as an acoustic cue, and has not considered that rhythm perception might be influenced by native language. Yet evidence from other areas of phonetics, and other disciplines, suggests that an investigation of rhythm is needed which (i) focuses on listeners’ perception, (ii) acknowledges the role of several acoustic cues, and (iii) explores whether the relative significance of these cues differs between languages. This thesis, the originality of which derives from its adoption of these three perspectives combined, indicates new directions for progress. A series of perceptual experiments investigated the interaction of duration and f0 as perceptual cues to prosody in languages with different prosodic structures – Swiss German, Swiss French, and French (i.e. from France). The first experiment demonstrated that a dynamic f0 increases perceived syllable duration in contextually isolated pairs of monosyllables, for all three language groups. The second experiment found that dynamic f0 and increased duration interact as cues to rhythmic groups in series of monosyllabic digits and letters; the two cues were significantly more effective than one when heard simultaneously, but significantly less effective than one when heard in conflicting positions around the rhythmic-group boundary location, and native language influenced whether f0 or duration was the more effective cue.
These two experiments laid the basis for the third, which directly addressed rhythm. Listeners were asked to judge the rhythmicality of sentences with systematic duration and f0 manipulations; the results provide evidence that duration and f0 are interdependent cues in rhythm perception, and that the weighting of each cue varies in different languages. A fourth experiment applied the perceptual results to production data, to develop a rhythm metric which captures the multi-dimensional and language-specific nature of perceived rhythm in speech production. These findings have the important implication that if future phonetic research on rhythm follows these new perspectives, it may circumvent the impasse and advance our knowledge and model of speech rhythm.This work was funded by an AHRC doctoral award to the author
Recommended from our members
Rhythm in the speech of a person with right hemisphere damage: Applying the pairwise variability index
Although several aspects of prosody have been studied in speakers with right hemisphere damage (RHD), rhythm remains largely uninvestigated. This study compares the rhythm of an Australian English speaker with right hemisphere damage (due to a stroke, but with no concomitant dysarthria) to that of a neurologically unimpaired individual. The speakers' rhythm is compared using the pairwise variability index (PVI) which allows for an acoustic characterization of rhythm by comparing the duration of successive vocalic and intervocalic intervals. A sample of speech from a structured interview between a speech and language therapist and each participant was analysed. Previous research has shown that speakers with RHD may have difficulties with intonation production, and therefore it was hypothesized that there may also be rhythmic disturbance. Results show that the neurologically normal control uses a similar rhythm to that reported for British English (there are no previous studies available for Australian English), whilst the speaker with RHD produces speech with a less strongly stress-timed rhythm. This finding was statistically significant for the intervocalic intervals measured (t(8) = 4.7, p < .01), and suggests that some aspects of prosody may be right lateralized for this speaker. The findings are discussed in relation to previous findings of dysprosody in RHD populations, and in relation to syllable-timed speech of people with other neurological conditions
The phonetics of second language learning and bilingualism
This chapter provides an overview of major theories and findings in the field of second language (L2) phonetics and phonology. Four main conceptual frameworks are discussed and compared: the Perceptual Assimilation Model-L2, the Native Language Magnet Theory, the Automatic Selection Perception Model, and the Speech Learning Model. These frameworks differ in terms of their empirical focus, including the type of learner (e.g., beginner vs. advanced) and target modality (e.g., perception vs. production), and in terms of their theoretical assumptions, such as the basic unit or window of analysis that is relevant (e.g., articulatory gestures, position-specific allophones). Despite the divergences among these theories, three recurring themes emerge from the literature reviewed. First, the learning of a target L2 structure (segment, prosodic pattern, etc.) is influenced by phonetic and/or phonological similarity to structures in the native language (L1). In particular, L1-L2 similarity exists at multiple levels and does not necessarily benefit L2 outcomes. Second, the role played by certain factors, such as acoustic phonetic similarity between close L1 and L2 sounds, changes over the course of learning, such that advanced learners may differ from novice learners with respect to the effect of a specific variable on observed L2 behavior. Third, the connection between L2 perception and production (insofar as the two are hypothesized to be linked) differs significantly from the perception-production links observed in L1 acquisition. In service of elucidating the predictive differences among these theories, this contribution discusses studies that have investigated L2 perception and/or production primarily at a segmental level. In addition to summarizing the areas in which there is broad consensus, the chapter points out a number of questions which remain a source of debate in the field today.https://drive.google.com/open?id=1uHX9K99Bl31vMZNRWL-YmU7O2p1tG2wHhttps://drive.google.com/open?id=1uHX9K99Bl31vMZNRWL-YmU7O2p1tG2wHhttps://drive.google.com/open?id=1uHX9K99Bl31vMZNRWL-YmU7O2p1tG2wHAccepted manuscriptAccepted manuscrip
- …