Search CORE

935 research outputs found

Fundamental frequency height as a resource for the management of overlap in talk-in-interaction.

Author: Brown G. J.
Kurtic E.
Wells B.
Publication venue: Emerald Group Publishing Limited
Publication date: 01/01/2009
Field of study

Overlapping talk is common in talk-in-interaction. Much of the previous research on this topic agrees that speaker overlaps can be either turn competitive or noncompetitive. An investigation of the differences in prosodic design between these two classes of overlaps can offer insight into how speakers use and orient to prosody as a resource for turn competition. In this paper, we investigate the role of fundamental frequency (F0) as a resource for turn competition in overlapping speech. Our methodological approach combines detailed conversation analysis of overlap instances with acoustic measurements of F0 in the overlapping sequence and in its local context. The analyses are based on a collection of overlap instances drawn from the ICSI Meeting corpus. We found that overlappers mark an overlapping incoming as competitive by raising F0 above their norm for turn beginnings, and retaining this higher F0 until the point of overlap resolution. Overlappees may respond to these competitive incomings by returning competition, in which case they raise their F0 too. Our results thus provide instrumental support for earlier claims made on impressionistic evidence, namely that participants in talk-in-interaction systematically manipulate F0 height when competing for the turn

White Rose Research Online

Catching the news: Processing strategies in listening to dialogs as measured by ERPs

Author: Alter Kai
Pannekamp Ann
Toepel Ulrike
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background The online segmentation of spoken single sentences has repeatedly been associated with a particular event-related brain potential. The brain response could be attributed to the perception of major prosodic boundaries, and was termed Closure Positive Shift (CPS). However, verbal exchange between humans is mostly realized in the form of cooperative dialogs instead of loose strings of single sentences. The present study investigated whether listeners use prosodic cues for structuring larger contextually embedded utterances (i.e. dialogs) like in single sentence processing. Methods ERPs were recorded from listeners (n = 22) when presented with question-answer dialogs in German. The prosody of the answer (target sentence) either matched the context provided by a question or did not match the context question. Results CPS responses to the processing of the target sentences are elicited, first, when listeners encounter information comprising 'novelties', i.e. information not mentioned in the preceding question but facts corrected between context and target. Thereby it is irrelevant whether the actual prosody of the target sentence is in congruence with the informative status or not. Second, when listeners encounter target sentences which do not convey any novelties but only previously 'given' already known information, the structuring of the speech input is driven by prosody again. The CPS is then elicited when listeners perceive major prosodic boundaries similar as for the processing of context-free single sentences. Conclusion The study establishes a link between the on-line structuring of context-free (single sentences) and context-embedded utterances (dialogs) as measured by ERPs. Moreover, the impact of prosodic phrasing and accentuation on the perception of spoken utterances on and beyond sentence level is discussed.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Disfluency in dialogue:an intentional signal from the speaker?

Author: A Caramazza
AA Cleland
C Kidd
CL Hudson Kam
DC O’Connell
EA Isaacs
ER Blackmer
GW Beattie
H Bortfeld
HH Clark
HH Clark
HH Clark
HP Branigan
Ian R. Finlayson
J Bavelas
JD Jescheniak
JE Arnold
JE Fox Tree
JE Fox Tree
JE Fox Tree
JG Snodgrass
JK Lake
LJ MacGregor
LJ MacGregor
M Corley
M Swerts
M Vitkovich
Martin Corley
MJ Pickering
NE Breslow
RJ Hartsuiker
S DebRoy
S Garrod
S Merlo
S Oviatt
S Schachter
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Disfluency is a characteristic feature of spontaneous human speech, commonly seen as a consequence of problems with production. However, the question remains open as to why speakers are disfluent: Is it a mechanical by-product of planning difficulty, or do speakers use disfluency in dialogue to manage listeners' expectations? To address this question, we present two experiments investigating the production of disfluency in monologue and dialogue situations. Dialogue affected the linguistic choices made by participants, who aligned on referring expressions by choosing less frequent names for ambiguous images where those names had previously been mentioned. However, participants were no more disfluent in dialogue than in monologue situations, and the distribution of types of disfluency used remained constant. Our evidence rules out at least a straightforward interpretation of the view that disfluencies are an intentional signal in dialogue. © 2012 Psychonomic Society, Inc

Crossref

Edinburgh Research Explorer

Queen Margaret University eResearch

Quantitative analysis of backchannels uttered by an interviewer during neuropsychological tests

Author: Bailly Gérard
Elisei Frédéric
Juphard Alexandra
Moreaud Olivier
Publication venue: 'International Speech Communication Association'
Publication date: 08/09/2016
Field of study

International audienceThis paper examines in detail the backchannels uttered by a French professional interviewer during a neuropsychological test of verbal memories. These backchannels are short utterances such as oui, d'accord, uhm, etc. They are mainly produced here to encourage subjects to retrieve a set of words after their controlled encoding. We show that the choice of lexical items, their production rates and their associated prosodic contours are influenced by the subject performance and conditioned by the protocol

Hal - Université Grenoble Alpes

HAL Université de Savoie

A data-driven approach to spoken dialog segmentation

Author: Callejas Carrión Zoraida
Griol Barres David
Molina López José Manuel
Sanchis de Miguel María Araceli
Publication venue: 'Elsevier BV'
Publication date: 28/05/2020
Field of study

In This Paper, We Present A Statistical Model For Spoken Dialog Segmentation That Decides The Current Phase Of The Dialog By Means Of An Automatic Classification Process. We Have Applied Our Proposal To Three Practical Conversational Systems Acting In Different Domains. The Results Of The Evaluation Show That Is Possible To Attain High Accuracy Rates In Dialog Segmentation When Using Different Sources Of Information To Represent The User Input. Our Results Indicate How The Module Proposed Can Also Improve Dialog Management By Selecting Better System Answers. The Statistical Model Developed With Human-Machine Dialog Corpora Has Been Applied In One Of Our Experiments To Human-Human Conversations And Provides A Good Baseline As Well As Insights In The Model Limitation

Universidad Carlos III de Madrid e-Archivo

Unimodal and multimodal co‐activation in first encounters – a case study

Author: Allwood Jens
Lu Jia
Publication venue
Publication date: 30/12/2011
Field of study

Proceedings of the 3rd Nordic Symposium on Multimodal Communication. Editors: Patrizia Paggio, Elisabeth Ahlsén, Jens Allwood, Kristiina Jokinen, Costanza Navarretta. NEALT Proceedings Series, Vol. 15 (2011), 1–9. © 2011 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/22532

DSpace at Tartu University Library

Do (and say) as I say: Linguistic adaptation in human-computer dialogs

Author: Bargh J. A.
Bell L.
Bohus D.
Branigan H. P.
Branigan H. P.
Branigan H. P.
Brennan S. E.
Brennan S. E.
Gabsdil M.
Gergle D.
Gravetter F. J.
Healey P. G.
Lazar J.
Levin D. T.
Levinson S. C.
Porzel R.
Reitter D.
Reitter D.
Robert D. Macredie
Sauro J.
Stanislao Lauria
Theodora Koulouri
Publication venue: 'Informa UK Limited'
Publication date: 18/06/2014
Field of study

© Theodora Koulouri, Stanislao Lauria, and Robert D. Macredie. This article has been made available through the Brunel Open Access Publishing Fund.There is strong research evidence showing that people naturally align to each other’s vocabulary, sentence structure, and acoustic features in dialog, yet little is known about how the alignment mechanism operates in the interaction between users and computer systems let alone how it may be exploited to improve the efficiency of the interaction. This article provides an account of lexical alignment in human–computer dialogs, based on empirical data collected in a simulated human–computer interaction scenario. The results indicate that alignment is present, resulting in the gradual reduction and stabilization of the vocabulary-in-use, and that it is also reciprocal. Further, the results suggest that when system and user errors occur, the development of alignment is temporarily disrupted and users tend to introduce novel words to the dialog. The results also indicate that alignment in human–computer interaction may have a strong strategic component and is used as a resource to compensate for less optimal (visually impoverished) interaction conditions. Moreover, lower alignment is associated with less successful interaction, as measured by user perceptions. The article distills the results of the study into design recommendations for human–computer dialog systems and uses them to outline a model of dialog management that supports and exploits alignment through mechanisms for in-use adaptation of the system’s grammar and lexicon

Crossref

Brunel University Research Archive