78,387 research outputs found
A mentalist framework for linguistic and extralinguistic communication
We outline some components of a mentalist theory of human communicative competence. Communication in our species is an intentional and overt type of social interaction, based on each agent's capability of entertaining shared mental states and of acting so as to make certain mental states shared with the other. Communicative meaning is a matter of ascription: it is not an intrinsic property of a communicative act, but is instead created here and now as the shared construction of the interlocutors. We then discuss how communicative actions are superficially realized by our species, focusing in particular on the difference between linguistic and extralinguistic (that is, gestural) means of expression. Linguistic communication is the communicative use of a symbol system, whereas extralinguistic communication is the communicative use of a set of symbols. The difference turns out to be a matter of processing rather than of intrinsic structure
Agents for educational games and simulations
This book consists mainly of revised papers that were presented at the Agents for Educational Games and Simulation (AEGS) workshop held on May 2, 2011, as part of the Autonomous Agents and MultiAgent Systems (AAMAS) conference in Taipei, Taiwan. The 12 full papers presented were carefully reviewed and selected from various submissions. The papers are organized topical sections on middleware applications, dialogues and learning, adaption and convergence, and agent applications
Conjunctive Visual and Auditory Development via Real-Time Dialogue
Human developmental learning is capable of
dealing with the dynamic visual world, speech-based
dialogue, and their complex real-time association.
However, the architecture that realizes
this for robotic cognitive development has
not been reported in the past. This paper takes
up this challenge. The proposed architecture does
not require a strict coupling between visual and
auditory stimuli. Two major operations contribute
to the “abstraction” process: multiscale temporal
priming and high-dimensional numeric abstraction
through internal responses with reduced variance.
As a basic principle of developmental learning,
the programmer does not know the nature
of the world events at the time of programming
and, thus, hand-designed task-specific representation
is not possible. We successfully tested the
architecture on the SAIL robot under an unprecedented
challenging multimodal interaction mode:
use real-time speech dialogue as a teaching source
for simultaneous and incremental visual learning
and language acquisition, while the robot is viewing
a dynamic world that contains a rotating object
to which the dialogue is referring
Emotion Recognition from Acted and Spontaneous Speech
Dizertační práce se zabývá rozpoznáním emočního stavu mluvčích z řečového signálu. Práce je rozdělena do dvou hlavních častí, první část popisuju navržené metody pro rozpoznání emočního stavu z hraných databází. V rámci této části jsou představeny výsledky rozpoznání použitím dvou různých databází s různými jazyky. Hlavními přínosy této části je detailní analýza rozsáhlé škály různých příznaků získaných z řečového signálu, návrh nových klasifikačních architektur jako je například „emoční párování“ a návrh nové metody pro mapování diskrétních emočních stavů do dvou dimenzionálního prostoru. Druhá část se zabývá rozpoznáním emočních stavů z databáze spontánní řeči, která byla získána ze záznamů hovorů z reálných call center. Poznatky z analýzy a návrhu metod rozpoznání z hrané řeči byly využity pro návrh nového systému pro rozpoznání sedmi spontánních emočních stavů. Jádrem navrženého přístupu je komplexní klasifikační architektura založena na fúzi různých systémů. Práce se dále zabývá vlivem emočního stavu mluvčího na úspěšnosti rozpoznání pohlaví a návrhem systému pro automatickou detekci úspěšných hovorů v call centrech na základě analýzy parametrů dialogu mezi účastníky telefonních hovorů.Doctoral thesis deals with emotion recognition from speech signals. The thesis is divided into two main parts; the first part describes proposed approaches for emotion recognition using two different multilingual databases of acted emotional speech. The main contributions of this part are detailed analysis of a big set of acoustic features, new classification schemes for vocal emotion recognition such as “emotion coupling” and new method for mapping discrete emotions into two-dimensional space. The second part of this thesis is devoted to emotion recognition using multilingual databases of spontaneous emotional speech, which is based on telephone records obtained from real call centers. The knowledge gained from experiments with emotion recognition from acted speech was exploited to design a new approach for classifying seven emotional states. The core of the proposed approach is a complex classification architecture based on the fusion of different systems. The thesis also examines the influence of speaker’s emotional state on gender recognition performance and proposes system for automatic identification of successful phone calls in call center by means of dialogue features.
- …