Search CORE

21,673 research outputs found

The BURCHAK corpus: a Challenge Data Set for Interactive Learning of Visually Grounded Word Meanings

Author: Eshghi Arash
Lemon Oliver Joseph
Mills Gregory
Yu Yanchao
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2017
Field of study

We motivate and describe a new freely available human-human dialogue dataset for interactive learning of visually grounded word meanings through ostensive definition by a tutor to a learner. The data has been collected using a novel, character-by-character variant of the DiET chat tool (Healey et al., 2003; Mills and Healey, submitted) with a novel task, where a Learner needs to learn invented visual attribute words (such as " burchak " for square) from a tutor. As such, the text-based interactions closely resemble face-to-face conversation and thus contain many of the linguistic phenomena encountered in natural, spontaneous dialogue. These include self-and other-correction, mid-sentence continuations, interruptions, overlaps, fillers, and hedges. We also present a generic n-gram framework for building user (i.e. tutor) simulations from this type of incremental data, which is freely available to researchers. We show that the simulations produce outputs that are similar to the original data (e.g. 78% turn match similarity). Finally, we train and evaluate a Reinforcement Learning dialogue control agent for learning visually grounded word meanings, trained from the BURCHAK corpus. The learned policy shows comparable performance to a rule-based system built previously.Comment: 10 pages, THE 6TH WORKSHOP ON VISION AND LANGUAGE (VL'17

arXiv.org e-Print Archive

Heriot Watt Pure

Crossref

Recommended from our members

Spring School on Language, Music, and Cognition: Organizing Events in Time

Author: Arbib M. A.
Arbib M. A.
Bernstein L.
Carnap R.
Chomsky N.
Chomsky N.
Chomsky N.
Chomsky N.
Chomsky N.
Cross I.
Cross I.
Dahlhaus C.
Fitch W. T.
Fitch W. T.
Gallistel C. R.
Hawkins S.
Hellbernd N.
Hughes D. W.
Jackendoff R.
Lerdahl F.
Levine J.
McQueen Tokita A.
Patel A. D.
Patel A. D.
Persici V.
Ravignani A.
Rebuschat P.
Rothacker E.
Steedman M.
Sundberg J.
Vogeley K.
Wallin N. L.
Wittgenstein L
Publication venue: 'SAGE Publications'
Publication date: 01/01/2018
Field of study

The interdisciplinary spring school “Language, music, and cognition: Organizing events in time” was held from February 26 to March 2, 2018 at the Institute of Musicology of the University of Cologne. Language, speech, and music as events in time were explored from different perspectives including evolutionary biology, social cognition, developmental psychology, cognitive neuroscience of speech, language, and communication, as well as computational and biological approaches to language and music. There were 10 lectures, 4 workshops, and 1 student poster session. Overall, the spring school investigated language and music as neurocognitive systems and focused on a mechanistic approach exploring the neural substrates underlying musical, linguistic, social, and emotional processes and behaviors. In particular, researchers approached questions concerning cognitive processes, computational procedures, and neural mechanisms underlying the temporal organization of language and music, mainly from two perspectives: one was concerned with syntax or structural representations of language and music as neurocognitive systems (i.e., an intrapersonal perspective), while the other emphasized social interaction and emotions in their communicative function (i.e., an interpersonal perspective). The spring school not only acted as a platform for knowledge transfer and exchange but also generated a number of important research questions as challenges for future investigations

City Research Online

Crossref

Kölner UniversitätsPublikationsServer

Directory of Open Access Journals

Publications at Bielefeld University

MPG.PuRe

Computational and Robotic Models of Early Language Development: A Review

Author: Kachergis George
Oudeyer Pierre-Yves
Schueller William
Publication venue
Publication date: 25/03/2019
Field of study

We review computational and robotics models of early language learning and development. We first explain why and how these models are used to understand better how children learn language. We argue that they provide concrete theories of language learning as a complex dynamic system, complementing traditional methods in psychology and linguistics. We review different modeling formalisms, grounded in techniques from machine learning and artificial intelligence such as Bayesian and neural network approaches. We then discuss their role in understanding several key mechanisms of language development: cross-situational statistical learning, embodiment, situated social interaction, intrinsically motivated learning, and cultural evolution. We conclude by discussing future challenges for research, including modeling of large-scale empirical data about language acquisition in real-world environments. Keywords: Early language learning, Computational and robotic models, machine learning, development, embodiment, social interaction, intrinsic motivation, self-organization, dynamical systems, complexity.Comment: to appear in International Handbook on Language Development, ed. J. Horst and J. von Koss Torkildsen, Routledg

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Empathic Agent Technology (EAT)

Author: Broek Egon L. van den
Publication venue: Brooklyn College
Publication date: 01/01/2005
Field of study

A new view on empathic agents is introduced, named: Empathic Agent Technology (EAT). It incorporates a speech analysis, which provides an indication for the amount of tension present in people. It is founded on an indirect physiological measure for the amount of experienced stress, defined as the variability of the fundamental frequency of the human voice. A thorough review of literature is provided on which the EAT is founded. In addition, the complete processing line of this measure is introduced. Hence, the first generally applicable, completely automated technique is introduced that enables the development of truly empathic agents

CiteSeerX

VU Research Portal

University of Twente Research Information

Spontaneous eye movements during passive spoken language comprehension reflect grammatical processing

Author: Ardell Dr. David
Huette Ms. Stephanie
Matlock Dr. Teenie
Spivey Dr. Michael
Winter Mr. Bodo
Publication venue
Publication date: 30/01/2013
Field of study

Language is tightly connected to sensory and motor systems. Recent research using eye- tracking typically relies on constrained visual contexts, viewing a small array of objects on a computer screen. Some critiques of embodiment ask if people simply match their simulations to the pictures being presented. This study compared the comprehension of verbs with two different grammatical forms: the past progressive form (e.g., was walking), which emphasizes the ongoing nature of actions, and the simple past (e.g., walked), which emphasizes the end-state of an action. The results showed that the distribution and timing of eye movements mirrors the underlying conceptual structure of this linguistic difference in the absence of any visual stimuli. Thus, eye movement data suggest that visual inputs are unnecessary to solicit perceptual simulations

CogPrints Cognitive Sciences Eprint Archive