18,498 research outputs found

    Video Question Answering via Attribute-Augmented Attention Network Learning

    Full text link
    Video Question Answering is a challenging problem in visual information retrieval, which provides the answer to the referenced video content according to the question. However, the existing visual question answering approaches mainly tackle the problem of static image question, which may be ineffectively for video question answering due to the insufficiency of modeling the temporal dynamics of video contents. In this paper, we study the problem of video question answering by modeling its temporal dynamics with frame-level attention mechanism. We propose the attribute-augmented attention network learning framework that enables the joint frame-level attribute detection and unified video representation learning for video question answering. We then incorporate the multi-step reasoning process for our proposed attention network to further improve the performance. We construct a large-scale video question answering dataset. We conduct the experiments on both multiple-choice and open-ended video question answering tasks to show the effectiveness of the proposed method.Comment: Accepted for SIGIR 201

    Federated Recommendation with Additive Personalization

    Full text link
    Building recommendation systems via federated learning (FL) is a new emerging challenge for advancing next-generation Internet service and privacy protection. Existing approaches train shared item embedding by FL while keeping the user embedding private on client side. However, item embedding identical for all clients cannot capture users' individual differences on perceiving the same item and thus leads to poor personalization. Moreover, dense item embedding in FL results in expensive communication cost and latency. To address these challenges, we propose Federated Recommendation with Additive Personalization (FedRAP), which learns a global view of items via FL and a personalized view locally on each user. FedRAP enforces sparsity of the global view to save FL's communication cost and encourages difference between the two views through regularization. We propose an effective curriculum to learn the local and global views progressively with increasing regularization weights. To produce recommendations for an user, FedRAP adds the two views together to obtain a personalized item embedding. FedRAP achieves the best performance in FL setting on multiple benchmarks. It outperforms recent federated recommendation methods and several ablation study baselines.Comment: 9 pages, conferenc

    Exact Embeddings of JT Gravity in Strings and M-theory

    Full text link
    We show that two-dimensional JT gravity, the holographic dual of the IR fixed point of the SYK model, can be obtained from the consistent Kaluza-Klein reduction of a class of EMD theories in general DD dimensions. For D=4D=4, 55, the EMD theories can be themselves embedded in supergravities. These exact embeddings provide the holographic duals in the framework of strings and M-theory. We find that a class of JT gravity solutions can be lifted to become time-dependent charged extremal black holes. They can be further lifted, for example, to describe the D1/D5-branes where the worldsheet is the Milne universe, rather than the typical Minkowski spacetime.Comment: Latex, 25 pages, typos corrected, further discussions added, appeared in Eur.Phys.J.
    • …
    corecore