Search CORE

7,553 research outputs found

Omnidirectional Information Gathering for Knowledge Transfer-based Audio-Visual Navigation

Author: Chen Jinyu
Li Hongsheng
Liu Si
Wang Wenguan
Yang Yi
Publication venue
Publication date: 20/08/2023
Field of study

Audio-visual navigation is an audio-targeted wayfinding task where a robot agent is entailed to travel a never-before-seen 3D environment towards the sounding source. In this article, we present ORAN, an omnidirectional audio-visual navigator based on cross-task navigation skill transfer. In particular, ORAN sharpens its two basic abilities for a such challenging task, namely wayfinding and audio-visual information gathering. First, ORAN is trained with a confidence-aware cross-task policy distillation (CCPD) strategy. CCPD transfers the fundamental, point-to-point wayfinding skill that is well trained on the large-scale PointGoal task to ORAN, so as to help ORAN to better master audio-visual navigation with far fewer training samples. To improve the efficiency of knowledge transfer and address the domain gap, CCPD is made to be adaptive to the decision confidence of the teacher policy. Second, ORAN is equipped with an omnidirectional information gathering (OIG) mechanism, i.e., gleaning visual-acoustic observations from different directions before decision-making. As a result, ORAN yields more robust navigation behaviour. Taking CCPD and OIG together, ORAN significantly outperforms previous competitors. After the model ensemble, we got 1st in Soundspaces Challenge 2022, improving SPL and SR by 53% and 35% relatively.Comment: ICCV 202

arXiv.org e-Print Archive

Differentiable Multi-Granularity Human Representation Learning for Instance-Aware Human Semantic Parsing

Author: Liu Si
Van Gool Luc
Wang Wenguan
Yang Yi
Zhou Tianfei
Publication venue
Publication date: 01/01/2021
Field of study

To address the challenging task of instance-aware human part parsing, a new bottom-up regime is proposed to learn category-level human semantic segmentation as well as multi-person pose estimation in a joint and end-to-end manner. It is a compact, efficient and powerful framework that exploits structural information over different human granularities and eases the difficulty of person partitioning. Specifically, a dense-to-sparse projection field, which allows explicitly associating dense human semantics with sparse keypoints, is learnt and progressively improved over the network feature pyramid for robustness. Then, the difficult pixel grouping problem is cast as an easier, multi-person joint assembling task. By formulating joint association as maximum-weight bipartite matching, a differentiable solution is developed to exploit projected gradient descent and Dykstra's cyclic projection algorithm. This makes our method end-to-end trainable and allows back-propagating the grouping error to directly supervise multi-granularity human representation learning. This is distinguished from current bottom-up human parsers or pose estimators which require sophisticated post-processing or heuristic greedy algorithms. Experiments on three instance-aware human parsing datasets show that our model outperforms other bottom-up alternatives with much more efficient inference.Comment: CVPR 2021 (Oral). Code: https://github.com/tfzhou/MG-HumanParsin

arXiv.org e-Print Archive

Repository for Publications and Research Data

OPUS - University of Technology Sydney

Optimizing Performance of Hadoop with Parameter Tuning

Author: Cheng Chen
Guang-Rui Li
Si-Yu Liu
Xiang Chen
Yi Liang
Publication venue: 'EDP Sciences'
Publication date: 01/01/2017
Field of study

Optimizing Hadoop with the parameter tuning is an effective way to greatly improve the performance, but it usually costs too much time to identify the optimal parameters configuration because there are many parameters. Users are always blindly adjust too many parameters and are sometimes confused about which one could be changed at a higher-priority. To make optimization easier, classifying the parameter based on different applications will be helpful. In this paper, we will introduce a method that can classify these parameters in order that users can optimize performance more quickly and effectively for different applications

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

The linear and nonlinear Jaynes-Cummings model for the multiphoton transition

Author: Xiao-Jing Liu
Jing-Bin Lu
Si-Qi Zhang
Ji-Ping Liu
Hong Li
Yu Liang
Ji Ma
Yi-Jiao Weng
Qi-Rui Zhang
Han Liu
Xiao-Ru Zhang
Xiang-Yao Wu
Publication venue
Publication date: 01/01/2017
Field of study

With the Jaynes-Cummings model, we have studied the atom and light field quantum entanglement of multiphoton transition, and researched the effect of initial state superposition coefficient

C_{1}

, the transition photon number

N

, the quantum discord

\delta

and the nonlinear coefficient

\chi

on the quantum entanglement degrees. We have given the quantum entanglement degrees curves with time evolution, and obtained some results, which should have been used in quantum computing and quantum information.Comment: arXiv admin note: text overlap with arXiv:1404.0821, arXiv:1205.0979 by other author

arXiv.org e-Print Archive

Crossref

Biblioteca Digital de la Comunidad de Madrid