Search CORE

66 research outputs found

Exploiting Point-Wise Attention in 6D Object Pose Estimation Based on Bidirectional Prediction

Author: Wu Jun
Xiong Rong
Yang Yuhao
Zhang Guangjian
Publication venue
Publication date: 17/08/2023
Field of study

Traditional geometric registration based estimation methods only exploit the CAD model implicitly, which leads to their dependence on observation quality and deficiency to occlusion. To address the problem,the paper proposes a bidirectional correspondence prediction network with a point-wise attention-aware mechanism. This network not only requires the model points to predict the correspondence but also explicitly models the geometric similarities between observations and the model prior. Our key insight is that the correlations between each model point and scene point provide essential information for learning point-pair matches. To further tackle the correlation noises brought by feature distribution divergence, we design a simple but effective pseudo-siamese network to improve feature homogeneity. Experimental results on the public datasets of LineMOD, YCB-Video, and Occ-LineMOD show that the proposed method achieves better performance than other state-of-the-art methods under the same evaluation criteria. Its robustness in estimating poses is greatly improved, especially in an environment with severe occlusions

arXiv.org e-Print Archive

Reliability Assurance for Deep Neural Network Architectures Against Numerical Defects

Author: Li Linyi
Ren Luyao
Xie Tao
Xiong Yingfei
Zhang Yuhao
Publication venue
Publication date: 26/03/2023
Field of study

With the widespread deployment of deep neural networks (DNNs), ensuring the reliability of DNN-based systems is of great importance. Serious reliability issues such as system failures can be caused by numerical defects, one of the most frequent defects in DNNs. To assure high reliability against numerical defects, in this paper, we propose the RANUM approach including novel techniques for three reliability assurance tasks: detection of potential numerical defects, confirmation of potential-defect feasibility, and suggestion of defect fixes. To the best of our knowledge, RANUM is the first approach that confirms potential-defect feasibility with failure-exhibiting tests and suggests fixes automatically. Extensive experiments on the benchmarks of 63 real-world DNN architectures show that RANUM outperforms state-of-the-art approaches across the three reliability assurance tasks. In addition, when the RANUM-generated fixes are compared with developers' fixes on open-source projects, in 37 out of 40 cases, RANUM-generated fixes are equivalent to or even better than human fixes.Comment: To appear at 45th International Conference on Software Engineering (ICSE 2023), camera-ready versio

arXiv.org e-Print Archive

Searching Collaborative Agents for Multi-plane Localization in 3D Ultrasound

Author: Chen Chaoyu
Dou Haoran
Frangi Alejandro
Huang Xiaoqiong
Huang Yuhao
Li Rui
Luo Huanjia
Ni Dong
Qian Jikuan
Shi Wenlong
Xiong Yi
Yang Xin
Zhang Yuanji
Publication venue
Publication date: 01/01/2020
Field of study

3D ultrasound (US) is widely used due to its rich diagnostic information, portability and low cost. Automated standard plane (SP) localization in US volume not only improves efficiency and reduces user-dependence, but also boosts 3D US interpretation. In this study, we propose a novel Multi-Agent Reinforcement Learning (MARL) framework to localize multiple uterine SPs in 3D US simultaneously. Our contribution is two-fold. First, we equip the MARL with a one-shot neural architecture search (NAS) module to obtain the optimal agent for each plane. Specifically, Gradient-based search using Differentiable Architecture Sampler (GDAS) is employed to accelerate and stabilize the training process. Second, we propose a novel collaborative strategy to strengthen agents' communication. Our strategy uses recurrent neural network (RNN) to learn the spatial relationship among SPs effectively. Extensively validated on a large dataset, our approach achieves the accuracy of 7.05 degree/2.21mm, 8.62 degree/2.36mm and 5.93 degree/0.89mm for the mid-sagittal, transverse and coronal plane localization, respectively. The proposed MARL framework can significantly increase the plane localization accuracy and reduce the computational cost and model size.Comment: Early accepted by MICCAI 202

arXiv.org e-Print Archive

The University of Manchester - Institutional Repository

FetusMapV2: Enhanced Fetal Pose Estimation in 3D Ultrasound

Author: Cao Yan
Chen Chaoyu
Hu Xindi
Huang Weijun
Huang Yuhao
Luo Mingyuan
Ni Dong
Shi Wenlong
Xiong Yi
Yang Xin
Yu Lequan
Yue Kejuan
Zhang Yuanji
Zhue Lei
Publication venue
Publication date: 30/10/2023
Field of study

Fetal pose estimation in 3D ultrasound (US) involves identifying a set of associated fetal anatomical landmarks. Its primary objective is to provide comprehensive information about the fetus through landmark connections, thus benefiting various critical applications, such as biometric measurements, plane localization, and fetal movement monitoring. However, accurately estimating the 3D fetal pose in US volume has several challenges, including poor image quality, limited GPU memory for tackling high dimensional data, symmetrical or ambiguous anatomical structures, and considerable variations in fetal poses. In this study, we propose a novel 3D fetal pose estimation framework (called FetusMapV2) to overcome the above challenges. Our contribution is three-fold. First, we propose a heuristic scheme that explores the complementary network structure-unconstrained and activation-unreserved GPU memory management approaches, which can enlarge the input image resolution for better results under limited GPU memory. Second, we design a novel Pair Loss to mitigate confusion caused by symmetrical and similar anatomical structures. It separates the hidden classification task from the landmark localization task and thus progressively eases model learning. Last, we propose a shape priors-based self-supervised learning by selecting the relatively stable landmarks to refine the pose online. Extensive experiments and diverse applications on a large-scale fetal US dataset including 1000 volumes with 22 landmarks per volume demonstrate that our method outperforms other strong competitors.Comment: 16 pages, 11 figures, accepted by Medical Image Analysis(2023

arXiv.org e-Print Archive

Secrets of RLHF in Large Language Models Part I: PPO

Author: Chang Cheng
Chen Lu
Cheng Wensen
Dou Shihan
Gao Songyang
Gui Tao
Hua Yuan
Huang Haoran
Huang Xuanjing
Jin Senjie
Lai Wenbin
Liu Qin
Liu Yan
Qiu Xipeng
Shen Wei
Sun Tianxiang
Wang Binghai
Weng Rongxiang
Xi Zhiheng
Xiong Limao
Xu Nuo
Yan Hang
Yin Zhangyue
Zhang Qi
Zheng Rui
Zhou Yuhao
Zhu Minghao
Publication venue
Publication date: 10/07/2023
Field of study

Large language models (LLMs) have formulated a blueprint for the advancement of artificial general intelligence. Its primary objective is to function as a human-centric (helpful, honest, and harmless) assistant. Alignment with humans assumes paramount significance, and reinforcement learning with human feedback (RLHF) emerges as the pivotal technological paradigm underpinning this pursuit. Current technical routes usually include \textbf{reward models} to measure human preferences, \textbf{Proximal Policy Optimization} (PPO) to optimize policy model outputs, and \textbf{process supervision} to improve step-by-step reasoning capabilities. However, due to the challenges of reward design, environment interaction, and agent training, coupled with huge trial and error cost of large language models, there is a significant barrier for AI researchers to motivate the development of technical alignment and safe landing of LLMs. The stable training of RLHF has still been a puzzle. In the first report, we dissect the framework of RLHF, re-evaluate the inner workings of PPO, and explore how the parts comprising PPO algorithms impact policy agent training. We identify policy constraints being the key factor for the effective implementation of the PPO algorithm. Therefore, we explore the PPO-max, an advanced version of PPO algorithm, to efficiently improve the training stability of the policy model. Based on our main results, we perform a comprehensive analysis of RLHF abilities compared with SFT models and ChatGPT. The absence of open-source implementations has posed significant challenges to the investigation of LLMs alignment. Therefore, we are eager to release technical reports, reward models and PPO code

arXiv.org e-Print Archive

Shear lag of bolted and welded single angles with high strength steels

Author: Xiong Yuhao
Publication venue
Publication date
Field of study

PolyU Institutional Repository

EFFECT OF LOW PRESSURE ON THE PROPERTIES AND MICROSTRUCTURE OF ULTRA-HIGH PERFORMANCE CONCRETE

Author: Luo Yaoling
Wen Yang
Xie Yuhao
Xiong Wu
Yan Xinyi
Publication venue: University of Chemistry and Technology, Prague
Publication date: 01/12/2021
Field of study

The properties of fresh concrete, the mechanical properties and microstructure of Ultra-High Performance Concrete (UHPC) with different admixtures were studied in Chengdu and Lhasa, for exploring the influence of low pressure on the performance of UHPC. The test results indicated that: The plateau low pressure environment can significantly reduce the dosage of the admixtures and improve the performance of fresh UHPC, but the compressive strength was significantly reduced at 28 d and 60 d. The porosity and average pore size of the UHPC under low pressure are relatively small, but the pore size of the UHPC above 10000 nm takes up a larger proportion. In addition, the low pressure environment on the plateau can also reduce the degree of hydration of the UHPC

Directory of Open Access Journals