18,498 research outputs found
Video Question Answering via Attribute-Augmented Attention Network Learning
Video Question Answering is a challenging problem in visual information
retrieval, which provides the answer to the referenced video content according
to the question. However, the existing visual question answering approaches
mainly tackle the problem of static image question, which may be ineffectively
for video question answering due to the insufficiency of modeling the temporal
dynamics of video contents. In this paper, we study the problem of video
question answering by modeling its temporal dynamics with frame-level attention
mechanism. We propose the attribute-augmented attention network learning
framework that enables the joint frame-level attribute detection and unified
video representation learning for video question answering. We then incorporate
the multi-step reasoning process for our proposed attention network to further
improve the performance. We construct a large-scale video question answering
dataset. We conduct the experiments on both multiple-choice and open-ended
video question answering tasks to show the effectiveness of the proposed
method.Comment: Accepted for SIGIR 201
Federated Recommendation with Additive Personalization
Building recommendation systems via federated learning (FL) is a new emerging
challenge for advancing next-generation Internet service and privacy
protection. Existing approaches train shared item embedding by FL while keeping
the user embedding private on client side. However, item embedding identical
for all clients cannot capture users' individual differences on perceiving the
same item and thus leads to poor personalization. Moreover, dense item
embedding in FL results in expensive communication cost and latency. To address
these challenges, we propose Federated Recommendation with Additive
Personalization (FedRAP), which learns a global view of items via FL and a
personalized view locally on each user. FedRAP enforces sparsity of the global
view to save FL's communication cost and encourages difference between the two
views through regularization. We propose an effective curriculum to learn the
local and global views progressively with increasing regularization weights. To
produce recommendations for an user, FedRAP adds the two views together to
obtain a personalized item embedding. FedRAP achieves the best performance in
FL setting on multiple benchmarks. It outperforms recent federated
recommendation methods and several ablation study baselines.Comment: 9 pages, conferenc
Exact Embeddings of JT Gravity in Strings and M-theory
We show that two-dimensional JT gravity, the holographic dual of the IR fixed
point of the SYK model, can be obtained from the consistent Kaluza-Klein
reduction of a class of EMD theories in general dimensions. For , ,
the EMD theories can be themselves embedded in supergravities. These exact
embeddings provide the holographic duals in the framework of strings and
M-theory. We find that a class of JT gravity solutions can be lifted to become
time-dependent charged extremal black holes. They can be further lifted, for
example, to describe the D1/D5-branes where the worldsheet is the Milne
universe, rather than the typical Minkowski spacetime.Comment: Latex, 25 pages, typos corrected, further discussions added, appeared
in Eur.Phys.J.
- …