4 research outputs found

    PRESENCE: A human-inspired architecture for speech-based human-machine interaction

    No full text
    Recent years have seen steady improvements in the quality and performance of speech-based human-machine interaction driven by a significant convergence in the methods and techniques employed. However, the quantity of training data required to improve state-of-the-art systems seems to be growing exponentially and performance appears to be asymptotic to a level that may be inadequate for many real-world applications. This suggests that there may be a fundamental flaw in the underlying architecture of contemporary systems, as well as a failure to capitalize on the combinatorial properties of human spoken language. This paper addresses these issues and presents a novel architecture for speech-based human-machine interaction inspired by recent findings in the neurobiology of living systems. Called PRESENCE-"PREdictive SENsorimotor Control and Emulation" - this new architecture blurs the distinction between the core components of a traditional spoken language dialogue system and instead focuses on a recursive hierarchical feedback control structure. Cooperative and communicative behavior emerges as a by-product of an architecture that is founded on a model of interaction in which the system has in mind the needs and intentions of a user and a user has in mind the needs and intentions of the system

    Finding Rhythm in Speech: A Response to Cummins

    Get PDF
    This paper attempts to address three critical questions left unanswered by Cummins’ review: are rhythm and entrainment physical, perceptual or social phenomena, what are the underlying mechanisms, and what is their role in behaviour such as speech and music? These issues are addressed from the perspective of an engineer/computer-scientist/ roboticist for whom modelling such behaviours within a computational framework not only provides an empirical methodology for validating theoretical claims, but also facilitates the construction of artificial devices that are capable of exhibiting/exploiting those behaviours in the context of human-machine interaction. The paper draws on insights from a range of different perspectives, and attempts to weave them together within a coherent theoretical framework. It is concluded that (i) rhythm and entrainment are phenomena that emerge naturally from the structural coupling within and between even simple systems, (ii) living systems have evolved very effective mechanisms for managing such behaviours for intrinsic and extrinsic gains, and (iii) the fields of energetics and information theory provide the appropriate tools for analysing and characterising such behaviour within a general theoretical framework. It is hoped that these insights will inspire future cross- disciplinary research in these areas, and lead to a deeper understanding of these fundamental behaviours

    Gestures, Vocalizations, and Memory in Language Origins

    Get PDF
    This article discusses the possible homologies between the human language networks and comparable auditory projection systems in the macaque brain, in an attempt to reconcile two existing views on language evolution: one that emphasizes hand control and gestures, and the other that emphasizes auditory–vocal mechanisms. The capacity for language is based on relatively well defined neural substrates whose rudiments have been traced in the non-human primate brain. At its core, this circuit constitutes an auditory–vocal sensorimotor circuit with two main components, a “ventral pathway” connecting anterior auditory regions with anterior ventrolateral prefrontal areas, and a “dorsal pathway” connecting auditory areas with parietal areas and with posterior ventrolateral prefrontal areas via the arcuate fasciculus and the superior longitudinal fasciculus. In humans, the dorsal circuit is especially important for phonological processing and phonological working memory, capacities that are critical for language acquisition and for complex syntax processing. In the macaque, the homolog of the dorsal circuit overlaps with an inferior parietal–premotor network for hand and gesture selection that is under voluntary control, while vocalizations are largely fixed and involuntary. The recruitment of the dorsal component for vocalization behavior in the human lineage, together with a direct cortical control of the subcortical vocalizing system, are proposed to represent a fundamental innovation in human evolution, generating an inflection point that permitted the explosion of vocal language and human communication. In this context, vocal communication and gesturing have a common history in primate communication

    Spoken language processing: piecing together the puzzle

    No full text
    Attempting to understand the fundamental mechanisms underlying spoken language processing, whether it is viewed as behaviour exhibited by human beings or as a faculty simulated by machines, is one of the greatest scientific challenges of our age. Despite tremendous achievements over the past 50 or so years, there is still a long way to go before we reach a comprehensive explanation of human spoken language behaviour and can create a technology with performance approaching or exceeding that of a human being. It is argued that progress is hampered by the fragmentation of the field across many different disciplines, coupled with a failure to create an integrated view of the fundamental mechanisms that underpin one organism's ability to communicate with another. This paper weaves together accounts from a wide variety of different disciplines concerned with the behaviour of living systems - many of them outside the normal realms of spoken language - and compiles them into a new model: PRESENCE (PREdictive SENsorimotor Control and Emulation). It is hoped that the results of this research will provide a sufficient glimpse into the future to give breath to a new generation of research into spoken language processing by mind or machine. (c) 2007 Elsevier B.V. All rights reserved
    corecore