353 research outputs found

    Immediate and Distracted Imitation in Second-Language Speech: Unreleased Plosives in English

    Get PDF
    The paper investigates immediate and distracted imitation in second-language speech using unreleased plosives. Unreleased plosives are fairly frequently found in English sequences of two stops. Polish, on the other hand, is characterised by a significant rate of releases in such sequences. This cross-linguistic difference served as material to look into how and to what extent non-native properties of sounds can be produced in immediate and distracted imitation. Thirteen native speakers of Polish first read and then imitated sequences of words with two stops straddling the word boundary. Stimuli for imitation had no release of the first stop. The results revealed that (1) a non-native feature such as the lack of the release burst can be imitated; (2) distracting imitation impedes imitative performance; (3) the type of a sequence interacts with the magnitude of an imitative effec

    Converging toward a common speech code: imitative and perceptuo-motor recalibration processes in speech production

    Get PDF
    International audienceAuditory and somatosensory systems play a key role in speech motor control. In the act of speaking, segmental speech movements are programmed to reach phonemic sensory goals, which in turn are used to estimate actual sensory feedback in order to further control production. The adult's tendency to automatically imitate a number of acoustic-phonetic characteristics in another speaker's speech however suggests that speech production not only relies on the intended phonemic sensory goals and actual sensory feedback but also on the processing of external speech inputs. These online adaptive changes in speech production, or phonetic convergence effects, are thought to facilitate conversational exchange by contributing to setting a common perceptuo-motor ground between the speaker and the listener. In line with previous studies on phonetic convergence, we here demonstrate, in a non-interactive situation of communication, online unintentional and voluntary imitative changes in relevant acoustic features of acoustic vowel targets (fundamental and first formant frequencies) during speech production and imitation. In addition, perceptuo-motor recalibration processes, or after-effects, occurred not only after vowel production and imitation but also after auditory categorization of the acoustic vowel targets. Altogether, these findings demonstrate adaptive plasticity of phonemic sensory-motor goals and suggest that, apart from sensory-motor knowledge, speech production continuously draws on perceptual learning from the external speech environment

    Assessing objective characterizations of phonetic convergence

    No full text
    International audienceThis paper focuses on the study of the convergence between characteristics of speech segments- i.e. spectral characteristics of speech sounds - during live interactions between speaking dyads. The interaction data has been collected using an original verbal game called 'verbal dominoes' that provides a dense sampling of the acoustic spaces of the interlocutors. Two methods for characterizing phonetic convergence are here compared. The first one is based on a fine-grained analysis of the spectra of central frames of vowels (LDA) while the second one uses a more global speaker recognition technique (LLR). We show that convergence rates calculated by the two techniques correlate as the number of dominoes increases and that the LDA method well resists to the decrease of training and test material. We finally comment the impact of several factors on the computed convergence rates, i.e. interlocutors' familiarity and sex pairs

    Phonetic convergence in temporal organization during shadowed speech

    Get PDF
    The goal of this study was to examine phonetic convergence (when one imitates the phonetic characteristics of another talker) in various measures of temporal organization during shadowed speech across different American English dialects. Participants from the Northern and Midland American English dialect regions, plus several "mobile" talkers, were asked to read 72 sentences to establish a baseline for temporal organization, and then to repeat the same 72 sentences after Northern, Midland, and Southern model talkers. Measures of temporal organization (i.e., %V, ΔC, ΔV, rPVI-C, and nPVI-V) were calculated for the read sentences, shadowed sentences, and model talker sentences. Statistical analysis of the differences in distance between the model talker sentences and the shadowers' read and shadowed sentences, respectively, revealed significant convergence by all three shadowing groups toward the model dialects for ΔV, and significant divergence by Mobile talkers away from the model talkers for nPVI-V. Though the result of divergence by Mobile talkers was unexpected, both results provide evidence that support previous studies, which claim that social perception is a large contributing factor in convergence and divergence. These results are also consistent with previous findings demonstrating variation across dialects in temporal organization and, in addition, provide evidence for variation across dialects in convergence in temporal organization.The Ohio State University College of Arts and Sciences Undergraduate Research ScholarshipNo embargoAcademic Major: Linguistic

    The Influence of Socioindexical Information on the Speech Perception-Production Link: Evidence from a Shadowing Task

    Get PDF
    The body of work on speech perception demonstrates the ability of listeners to utilize both visual and acoustic information in their processing of a given speech signal. More recent studies have established that listeners are sensitive to cues in both these modalities which inform their perception of a speaker\u27s identity in parallel with the linguistic message, but the relationship between social information in perception and production together is unclear. This study reports the results of an experiment designed to test the hypothesis that expectations about a speakers identity is able to influence a listener\u27s perception and production of speech in tandem. The shadowing task addresses the degree to which listeners faithfully reproduce L2 accented English when presented with four ethnically distinct faces in congruent and incongruent auditory-visual pairs in a within-subject design. Analyses of the degree of acoustic similarity to model talkers in speakers\u27 imitations revealed a slight average trend toward convergence on vowel spectra, vowel duration, and average fundamental frequency. Significant predictors of the degree of change in a speaker\u27s production were shown to be the vowel quality measured and the voice presented, but these predictors were agnostic with respect to whether these changes represented phonetic convergence or divergence. The variance in degree of similarity suggests that speakers\u27 convergence is subject to linguistic selectivity, but it is less clear the role social selectivity plays when presented with unfamiliar varieties. Overall these findings are consistent with exemplar models which consider the inherent coupling of individuals\u27 speech perception and production, but that the visual stimuli had no significant effect on these analyses may be reflective of listeners\u27 adaptive processes during perception of L2-accented speech

    Phonetic convergence in the speech of Polish learners of English

    Get PDF
    This dissertation examines variability in the phonetic performance of L2 users of English and concentrates on speech convergence as a result of exposure to native and non-native pronunciation. The term speech convergence refers to a process during which speakers adapt their linguistic behaviour according to who they are talking or listening to. Previous studies show that the phenomenon may take place both in a speaker’s L1 (e.g. Giles, 1973; Coupland, 1984; Gregory and Webster, 1996; Pardo, 2006; Babel; 2010) and L2 (e.g. Beebe, 1977; Berkowitz, 1986; Lewandowski, 2012; Rojczyk, 2013; Trofimovich and Kennedy, 2014). Speech convergence can be subdivided into three types of linguistic behaviour: convergence (the process of making one’s speech more similar to that of another person), divergence (the process of moving away from the speech of another person) and maintenance (the process of maintaining one’s default linguistic behaviour in spite of exposure to the speech of another person). The dissertation consists of four chapters; the first two provide theoretical background, the next two describe the study and its findings. Chapter One is concerned with previous research on speech convergence. The chapter reviews the methodology and approaches used in previous work and discusses the range of factors that may affect convergence strategies. Chapter Two provides an overview of relevant studies in the field of L2 phonetics. It describes the structure and formation of the L2 sound system and the numerous socialpsychological, linguistic and psycholinguistic variables that may influence L2 phonetic performance. Chapter Three describes the study on speech convergence in the pronunciation of Polish learners of English, i.e. the aims, hypotheses, methodology and results. In Chapter Four, the results of the study on phonetic convergence in the speech of Polish learners of English are analysed and discussed. The phenomenon of speech convergence has been explored under different names and with the use of various frameworks and methodological procedures. Some researchers refer to the process as accommodation and investigate it by analysing spontaneous conversational data (e.g. Giles, 1973; Bourhis and Giles, 1977; Coupland, 1984; Gregory and Webster, 1996). Other researches use the term imitation and examine the phenomenon in socially minimal, laboratory-based settings (e.g. Goldinger, 1998; Schokley et al., 2004; Delvaux and Soquet, 2007; Nielsen, 2011). Irrespective of terminological and methodological differences, the results of previous studies on phonetic convergence indicate that the process is conditioned by 171 a variety of linguistic (e.g. Mitterer and Ernestus, 2008; Babel, 2009; Brouwer et al., 2010; Nielsen, 2011) and social-psychological factors (Giles, 1973; Bilous i Krauss, 1988; Gregory and Webster, 1996; Pardo, 2006; Babel, 2009, Yu et al., 2013) Research on L2 acquisition and non-native pronunciation shows that the development of the L2 sound system is a complex and dynamic process. It has been argued that the productions of L2 users are generated by interlanguage (IL), an independent linguistic system that encompasses elements of the learner’s L1 and L2 but does not correspond exactly to either the NL or the TL (e.g. Selinker, 1972; 1992). Importantly, previous findings indicate that the phonetic performance of non-native speakers is influenced not only by their L1 and L2 sound systems but also by a range of various psycholinguistic (e.g. Flege, 1987; Flege et al., 2003) and social-psychological factors (e.g. Taylor et al., 1971; Zuengler, 1982; Gatbonton et al., 2011). The process of adapting one’s pronunciation as a result of exposure to another person’s speech has been detected in the productions of L2 users (e.g. Beebe, 1977; Berkowitz, 1986; Lewandowski, 2012; Rojczyk, 2013; Trofimovich and Kennedy, 2014). Similarly as in the case of L1 speech convergence, previous studies show that the magnitude of L2 speech convergence may depend upon a variety of social-psychological and linguistic variables. An interesting aspect of L2 phonetic convergence that has not yet been thoroughly explored is the comparison of pronunciation shifts upon exposure to the speech of native speakers of the TL as compared with pronunciation shifts upon exposure to the speech of other learners. The aim of the study was to address this issue by investigating and comparing L2 convergence strategies upon exposure to native and non-native pronunciation. The study concentrated on the phonetic performance of advanced Polish learners of English, who were exposed to two pronunciation varieties: Polish-accented English and native English. The participants were 38 native speakers of Polish, majoring in English Studies and recruited from the University of Lodz. The subjects listened to pre-recorded productions provided by two model talkers/interlocutors: a native speaker of Standard Southern British English and a native speaker of Polish (a qualified phonetician imitating a heavy Polish accent in English). The phonetic variables under investigation were the following: aspiration in word-initial /p t k/, pre-voicing in word-initial /b d g/, vowel duration as a cue for consonant voicing in English /æ e ɪ iː/. The experimental procedure consisted of several phases. First, the informants were instructed to identify the target words in an auditory naming task (baseline condition). Next, they were asked to listen to pre-recorded English words provided by the two 172 model talkers/interlocutors and to identify the words by saying them out loud (imitation condition). Finally, the subjects were required to read the target words for the two model talkers/interlocutors to listen to at a later time (accommodation condition). Following the production stage of the experiment, the participants completed a questionnaire whose purpose was to gauge attitudes towards native and foreign-accented English. Three hypotheses were formulated to be tested in the course of the study. Hypothesis 1 predicted that convergence strategies following exposure to native and non-native English will vary as a function of model talker/interlocutor. Hypothesis 2 predicted that convergence strategies following exposure to native and non-native English will be affected by the subjects’ attitudes towards native and Polish-accented English. Hypothesis 3 predicted that convergence strategies following exposure to native and non-native English will differ as a function of phonetic context (place of articulation and vowel category). Acoustic and statistical analysis of the data revealed that the subjects modified their linguistic behaviour following exposure to the speech of the model talkers/interlocutors, which corroborates the claim that L2 speech convergence phenomena are present in nonnative pronunciation. Hypothesis 1 was partially supported by the results of the study. It was found that speech behaviour following exposure to native and non-native English varied as a function of model talker/interlocutor in all but two instances (accommodation on pre-voicing and imitation of vowel duration). The results suggests that when using a second language, speakers may use different convergence strategies depending on the native/non-native status of the model talker or interlocutor. Hypothesis 2 was partially supported by the data. The results indicate that a strong preference for target-like pronunciation may prompt learners to converge towards native speech and diverge from foreign-accented speech. However, the factor does not seem to operate if a learner has not succeeded in mastering a given TL pronunciation feature, i.e. the impact of attitudinal factors on the magnitude of convergence in non-native pronunciation appears to be conditioned by the stage of acquisition of a given TL phonetic feature. Hypothesis 3 was not borne out the results obtained in the study. It was found that convergence strategies following exposure to native and non-native English did not vary depending on phonetic context. Overall, the findings of the study provide support for the claim that the process of speech convergence operates in L2 pronunciation and imply that certain social-psychological and psycholinguistic factors may have an impact on learners’ convergence strategies

    Acomodación fonética durante las interacciones conversacionales: una visión general

    Get PDF
    During conversational interactions such as tutoring, instruction-giving tasks, verbal negotiations, or just talking with friends, interlocutors’ behaviors experience a series of changes due to the characteristics of their counterpart and to the interaction itself. These changes are pervasively present in every social interaction, and most of them occur in the sounds and rhythms of our speech, which is known as acoustic-prosodic accommodation, or simply phonetic accommodation. The consequences, linguistic and social constraints, and underlying cognitive mechanisms of phonetic accommodation have been studied for at least 50 years, due to the importance of the phenomenon to several disciplines such as linguistics, psychology, and sociology. Based on the analysis and synthesis of the existing empirical research literature, in this paper we present a structured and comprehensive review of the qualities, functions, onto- and phylogenetic development, and modalities of phonetic accommodation.Durante las interacciones conversacionales como dar una tutoría, dar instrucciones, las negociaciones verbales, o simplemente hablar con amigos, los comportamientos de las personas experimentan una serie de cambios debido a las características de su interlocutor y a la interacción en sí. Estos cambios están presentes en cada interacción social, y la mayoría de ellos ocurre en los sonidos y ritmos del habla, lo cual se conoce como acomodación acústico-prosódica, o simplemente acomodación fonética. Las consecuencias, las limitaciones lingüísticas y sociales, y los mecanismos cognitivos subyacentes a la acomodación fonética se han estudiado durante al menos 50 años, debido a la importancia del fenómeno para varias disciplinas como la lingüística, la psicología, y la sociología. A partir del análisis y síntesis de la literatura de investigación empírica existente, en este artículo presentamos una revisión estructurada y exhaustiva de las cualidades, funciones, desarrollo onto- y filogenético, y modalidades de la acomodación fonética

    Investigating inter-speaker convergence through phonetic microvariation in paired data

    Get PDF
    Talk given at the Glasgow University Laboratory of Phonetic

    The listening talker: A review of human and algorithmic context-induced modifications of speech

    Get PDF
    International audienceSpeech output technology is finding widespread application, including in scenarios where intelligibility might be compromised - at least for some listeners - by adverse conditions. Unlike most current algorithms, talkers continually adapt their speech patterns as a response to the immediate context of spoken communication, where the type of interlocutor and the environment are the dominant situational factors influencing speech production. Observations of talker behaviour can motivate the design of more robust speech output algorithms. Starting with a listener-oriented categorisation of possible goals for speech modification, this review article summarises the extensive set of behavioural findings related to human speech modification, identifies which factors appear to be beneficial, and goes on to examine previous computational attempts to improve intelligibility in noise. The review concludes by tabulating 46 speech modifications, many of which have yet to be perceptually or algorithmically evaluated. Consequently, the review provides a roadmap for future work in improving the robustness of speech output
    corecore