Search CORE

91 research outputs found

Neural Moving Horizon Estimation for Robust Flight Control

Author: Lai Shupeng
Ma Zhengtian
Wang Bingheng
Zhao Lin
Publication venue
Publication date: 18/09/2023
Field of study

Estimating and reacting to disturbances is crucial for robust flight control of quadrotors. Existing estimators typically require significant tuning for a specific flight scenario or training with extensive ground-truth disturbance data to achieve satisfactory performance. In this paper, we propose a neural moving horizon estimator (NeuroMHE) that can automatically tune the key parameters modeled by a neural network and adapt to different flight scenarios. We achieve this by deriving the analytical gradients of the MHE estimates with respect to the weighting matrices, which enables a seamless embedding of the MHE as a learnable layer into neural networks for highly effective learning. Interestingly, we show that the gradients can be computed efficiently using a Kalman filter in a recursive form. Moreover, we develop a model-based policy gradient algorithm to train NeuroMHE directly from the quadrotor trajectory tracking error without needing the ground-truth disturbance data. The effectiveness of NeuroMHE is verified extensively via both simulations and physical experiments on quadrotors in various challenging flights. Notably, NeuroMHE outperforms a state-of-the-art neural network-based estimator, reducing force estimation errors by up to 76.7%, while using a portable neural network that has only 7.7% of the learnable parameters of the latter. The proposed method is general and can be applied to robust adaptive control of other robotic systems

arXiv.org e-Print Archive

Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey

Author: Chen Hao
Chen Taolue
Huang Yunpeng
Jiang Zixu
Lai Junyu
Li Shupeng
Li Zenan
Ma Xiaoxing
Xu Jingwei
Yang Lijuan
Yao Yuan
Zhao Penghao
Publication venue
Publication date: 23/02/2024
Field of study

Transformer-based Large Language Models (LLMs) have been applied in diverse areas such as knowledge bases, human interfaces, and dynamic agents, and marking a stride towards achieving Artificial General Intelligence (AGI). However, current LLMs are predominantly pretrained on short text snippets, which compromises their effectiveness in processing the long-context prompts that are frequently encountered in practical scenarios. This article offers a comprehensive survey of the recent advancement in Transformer-based LLM architectures aimed at enhancing the long-context capabilities of LLMs throughout the entire model lifecycle, from pre-training through to inference. We first delineate and analyze the problems of handling long-context input and output with the current Transformer-based models. We then provide a taxonomy and the landscape of upgrades on Transformer architecture to solve these problems. Afterwards, we provide an investigation on wildly used evaluation necessities tailored for long-context LLMs, including datasets, metrics, and baseline models, as well as optimization toolkits such as libraries, frameworks, and compilers to boost the efficacy of LLMs across different stages in runtime. Finally, we discuss the challenges and potential avenues for future research. A curated repository of relevant literature, continuously updated, is available at https://github.com/Strivin0311/long-llms-learning.Comment: 40 pages, 3 figures, 4 table

arXiv.org e-Print Archive

Supply chains create global benefits from improved vaccine accessibility

Author: Bjørnstad Ottar N
Guan Dabo
Hallegatte Stéphane
Hao Qi
Huo Jingwen
Lei Tianyang
Li Ruiyun
Stenseth Nils C
Sun Yida
Wang Daoping
Zeng Zhao
Zhu Shupeng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 21/03/2023
Field of study

Ensuring a more equitable distribution of vaccines worldwide is an effective strategy to control global pandemics and support economic recovery. We analyze the socioeconomic effects - defined as health gains, lockdown-easing effect, and supply-chain rebuilding benefit - of a set of idealized COVID-19 vaccine distribution scenarios. We find that an equitable vaccine distribution across the world would increase global economic benefits by 11.7% ($950 billion per year), compared to a scenario focusing on vaccinating the entire population within vaccine-producing countries first and then distributing vaccines to non-vaccine-producing countries. With limited doses among low-income countries, prioritizing the elderly who are at high risk of dying, together with the key front-line workforce who are at high risk of exposure is projected to be economically beneficial (e.g., 0.9%~3.4% annual GDP in India). Our results reveal how equitable distributions would cascade more protection of vaccines to people and ways to improve vaccine equity and accessibility globally through international collaboration

UCL Discovery

Global supply chains amplify economic costs of future extreme heat risk

Author: Cai Wenjia
Duan Jianping
Guan Dabo
Hu Yixin
Lu Hui
Sun Yida
Tan Chang
Tao Shu
Wang Daoping
Wang Yong
Yin Hao
Zhang Lingrui
Zhao Mengzhen
Zhu Shupeng
Publication venue
Publication date: 13/03/2024
Field of study

Evidence shows a continuing increase in the frequency and severity of global heatwaves, raising concerns about the future impacts of climate change and the associated socioeconomic costs. Here we develop a disaster footprint analytical framework by integrating climate, epidemiological and hybrid input–output and computable general equilibrium global trade models to estimate the midcentury socioeconomic impacts of heat stress. We consider health costs related to heat exposure, the value of heat-induced labour productivity loss and indirect losses due to economic disruptions cascading through supply chains. Here we show that the global annual incremental gross domestic product loss increases exponentially from 0.03 ± 0.01 (SSP 245)–0.05 ± 0.03 (SSP 585) percentage points during 2030–2040 to 0.05 ± 0.01–0.15 ± 0.04 percentage points during 2050–2060. By 2060, the expected global economic losses reach a total of 0.6–4.6% with losses attributed to health loss (37–45%), labour productivity loss (18–37%) and indirect loss (12–43%) under different shared socioeconomic pathways. Small- and medium-sized developing countries suffer disproportionately from higher health loss in South-Central Africa (2.1 to 4.0 times above global average) and labour productivity loss in West Africa and Southeast Asia (2.0–3.3 times above global average). The supply-chain disruption effects are much more widespread with strong hit to those manufacturing-heavy countries such as China and the USA, leading to soaring economic losses of 2.7 ± 0.7% and 1.8 ± 0.5%, respectively.

UCL Discovery

Recommended from our members

Occurrence of neonicotinoids in Chinese apiculture and a corresponding risk exposure assessment

Author: Chen Lanzhen
Goulson Dave
Jin Yue
Li Yi
Wang Xinran
Yang Shupeng
Zhang Jinzhen
Zhao Wen
Zhou Jinhui
Publication venue: 'American Chemical Society (ACS)'
Publication date: 24/02/2020
Field of study

Neonicotinoids are the most widely used insecticides worldwide, but there is mounting evidence demonstrating that they have adverse effects on nontarget organisms. However, little is known about the extent of environmental neonicotinoids contamination in China. In this study, a total of 693 honey samples from across China, from both Apis melifera and Apis cerana, were analyzed to examine neonicotinoid concentrations and their geographical distribution, and correlation with the primary plant species from which the honey was obtained. Furthermore, chronic and acute exposure risk and risk ranking for humans eating honey were investigated, and risks to bees were also considered. The results revealed that 40.8% of the samples contained at least one of the five neonicotinoids tested. Honeys from commercial crops were found to be more frequently contaminated with neonicotinoids than those from noncommercial crops. Honey samples from Apis mellifera were more frequently contaminated than those from Apis cerana. The concentrations of neonicotinoids found in honey overlapped with those that have been found to have significant adverse effects on honeybee health. The dietary risk assessments indicated that the levels of neonicotinoids detected in honey were likely to be safe for human consumption

Sussex Research Online

Structural and Lipidomic Alterations of Striatal Myelin in 16p11.2 Deletion Mouse Model of Autism Spectrum Disorder

Author: Dilong Wang
Huanquan Liao
Huiliang Li
Jian Jiang
Jun Ju
Lei Zheng
Liyang Liang
Ningning Li
Ningning Li
Sheng-Tao Hou
Shupeng Li
Xiaofeng Zhao
Xiaoyi Fang
Xiuyan Yang
Yihang Pan
Yumeng Zhang
Publication venue: Frontiers Media SA
Publication date: 01/08/2021
Field of study

Myelin abnormalities have been observed in autism spectrum disorder (ASD). In this study, we seek to discover myelin-related changes in the striatum, a key brain region responsible for core ASD features, using the 16p11.2 deletion (16p11.2±) mouse model of ASD. We found downregulated expression of multiple myelin genes and decreased myelin thickness in the striatum of 16p11.2± mice versus wild type controls. Moreover, given that myelin is the main reservoir of brain lipids and that increasing evidence has linked dysregulation of lipid metabolism to ASD, we performed lipidomic analysis and discovered decreased levels of certain species of sphingomyelin, hexosyl ceramide and their common precursor, ceramide, in 16p11.2± striatum, all of which are major myelin components. We further identified lack of ceramide synthase 2 as the possible reason behind the decrease in these lipid species. Taken together, our data suggest a role for myelin and myelin lipids in ASD development

Directory of Open Access Journals

UCL Discovery

Genetic algorithm optimization for storing arbitrary multimode transverse images in thermal atomic vapor

Author: Chen Yun
Gao Hong
Jiang Zibin
Li Fuli
Wang Chengyuan
Wang Jinwen
Wei Dong
Yu Ya
Zhao Shupeng
Publication venue: 'AIP Publishing'
Publication date: 07/06/2021
Field of study

Storing multimode transverse images in atomic media is crucial in constructing large-scale quantum networks. A major obstacle of storing transverse images in the thermal atomic vapor is the distortion of the retrieved images caused by atomic diffusion. Here, we demonstrate the combination of genetic algorithm with the phase-shift lithography method to construct the optimal phase for an arbitrary transverse image that can diminish the effect of diffusion. Theoretical simulations and experimental results manifest that the retrieved images' resolution can be substantially improved when carrying the optimal phases. Our scheme is efficient and straightforward and can be extensively applied in storing complex transverse multimodes in diffusion media

Enlighten