375 research outputs found
An Affect-Rich Neural Conversational Model with Biased Attention and Weighted Cross-Entropy Loss
Affect conveys important implicit information in human communication. Having
the capability to correctly express affect during human-machine conversations
is one of the major milestones in artificial intelligence. In recent years,
extensive research on open-domain neural conversational models has been
conducted. However, embedding affect into such models is still under explored.
In this paper, we propose an end-to-end affect-rich open-domain neural
conversational model that produces responses not only appropriate in syntax and
semantics, but also with rich affect. Our model extends the Seq2Seq model and
adopts VAD (Valence, Arousal and Dominance) affective notations to embed each
word with affects. In addition, our model considers the effect of negators and
intensifiers via a novel affective attention mechanism, which biases attention
towards affect-rich words in input sentences. Lastly, we train our model with
an affect-incorporated objective function to encourage the generation of
affect-rich words in the output responses. Evaluations based on both perplexity
and human evaluations show that our model outperforms the state-of-the-art
baseline model of comparable size in producing natural and affect-rich
responses.Comment: AAAI-1
EEG-Based Emotion Recognition Using Regularized Graph Neural Networks
Electroencephalography (EEG) measures the neuronal activities in different
brain regions via electrodes. Many existing studies on EEG-based emotion
recognition do not fully exploit the topology of EEG channels. In this paper,
we propose a regularized graph neural network (RGNN) for EEG-based emotion
recognition. RGNN considers the biological topology among different brain
regions to capture both local and global relations among different EEG
channels. Specifically, we model the inter-channel relations in EEG signals via
an adjacency matrix in a graph neural network where the connection and
sparseness of the adjacency matrix are inspired by neuroscience theories of
human brain organization. In addition, we propose two regularizers, namely
node-wise domain adversarial training (NodeDAT) and emotion-aware distribution
learning (EmotionDL), to better handle cross-subject EEG variations and noisy
labels, respectively. Extensive experiments on two public datasets, SEED and
SEED-IV, demonstrate the superior performance of our model than
state-of-the-art models in most experimental settings. Moreover, ablation
studies show that the proposed adjacency matrix and two regularizers contribute
consistent and significant gain to the performance of our RGNN model. Finally,
investigations on the neuronal activities reveal important brain regions and
inter-channel relations for EEG-based emotion recognition
TranSG: Transformer-Based Skeleton Graph Prototype Contrastive Learning with Structure-Trajectory Prompted Reconstruction for Person Re-Identification
Person re-identification (re-ID) via 3D skeleton data is an emerging topic
with prominent advantages. Existing methods usually design skeleton descriptors
with raw body joints or perform skeleton sequence representation learning.
However, they typically cannot concurrently model different body-component
relations, and rarely explore useful semantics from fine-grained
representations of body joints. In this paper, we propose a generic
Transformer-based Skeleton Graph prototype contrastive learning (TranSG)
approach with structure-trajectory prompted reconstruction to fully capture
skeletal relations and valuable spatial-temporal semantics from skeleton graphs
for person re-ID. Specifically, we first devise the Skeleton Graph Transformer
(SGT) to simultaneously learn body and motion relations within skeleton graphs,
so as to aggregate key correlative node features into graph representations.
Then, we propose the Graph Prototype Contrastive learning (GPC) to mine the
most typical graph features (graph prototypes) of each identity, and contrast
the inherent similarity between graph representations and different prototypes
from both skeleton and sequence levels to learn discriminative graph
representations. Last, a graph Structure-Trajectory Prompted Reconstruction
(STPR) mechanism is proposed to exploit the spatial and temporal contexts of
graph nodes to prompt skeleton graph reconstruction, which facilitates
capturing more valuable patterns and graph semantics for person re-ID.
Empirical evaluations demonstrate that TranSG significantly outperforms
existing state-of-the-art methods. We further show its generality under
different graph modeling, RGB-estimated skeletons, and unsupervised scenarios.Comment: Accepted by CVPR 2023. Codes are available at
https://github.com/Kali-Hac/TranSG. Supplemental material is included in the
conference proceeding
- …