Search CORE

25 research outputs found

Safe Model-Free Reinforcement Learning using Disturbance-Observer-Based Control Barrier Functions

Author: Cheng Yikun
Hovakimyan Naira
Zhao Pan
Publication venue
Publication date: 25/12/2022
Field of study

Safe reinforcement learning (RL) with assured satisfaction of hard state constraints during training has recently received a lot of attention. Safety filters, e.g., based on control barrier functions (CBFs), provide a promising way for safe RL via modifying the unsafe actions of an RL agent on the fly. Existing safety filter-based approaches typically involve learning of uncertain dynamics and quantifying the learned model error, which leads to conservative filters before a large amount of data is collected to learn a good model, thereby preventing efficient exploration. This paper presents a method for safe and efficient model-free RL using disturbance observers (DOBs) and control barrier functions (CBFs). Unlike most existing safe RL methods that deal with hard state constraints, our method does not involve model learning, and leverages DOBs to accurately estimate the pointwise value of the uncertainty, which is then incorporated into a robust CBF condition to generate safe actions. The DOB-based CBF can be used as a safety filter with any model-free RL algorithms by minimally modifying the actions of an RL agent whenever necessary to ensure safety throughout the learning process. Simulation results on a unicycle and a 2D quadrotor demonstrate that the proposed method outperforms a state-of-the-art safe RL algorithm using CBFs and Gaussian processes-based model learning, in terms of safety violation rate, and sample and computational efficiency

arXiv.org e-Print Archive

Guaranteed Nonlinear Tracking in the Presence of DNN-Learned Dynamics With Contraction Metrics and Disturbance Estimation

Author: Cheng Yikun
Gahlawat Aditya
Guo Ziyao
Hovakimyan Naira
Kang Hyungsoo
Zhao Pan
Publication venue
Publication date: 22/04/2022
Field of study

This paper presents an approach to trajectory-centric learning control based on contraction metrics and disturbance estimation for nonlinear systems subject to matched uncertainties. The proposed approach allows for the use of deep neural networks to learn uncertain dynamics while still providing guarantees of transient tracking performance throughout the learning phase. Within the proposed approach, a disturbance estimation law is adopted to estimate the pointwise value of the uncertainty, with pre-computable estimation error bounds (EEBs). The learned dynamics, the estimated disturbances, and the EEBs are then incorporated in a robust Riemannian energy condition to compute the control law that guarantees exponential convergence of actual trajectories to desired ones throughout the learning phase, even when the learned model is poor. On the other hand, with improved accuracy, the learned model can be incorporated into a high-level planner to plan better trajectories with improved performance, e.g., lower energy consumption and shorter travel time. The proposed framework is validated on a planar quadrotor navigation example.Comment: Shorter version submitted to CDC 202

arXiv.org e-Print Archive

Convex Synthesis of Control Barrier Functions Under Input Constraints

Author: Abdi Hossein
Cheng Yikun
Ghabcheloo Reza
Hovakimyan Naira
Zhao Pan
Publication venue
Publication date: 01/01/2023
Field of study

This letter presents a systematic method based on the sum of square (SOS) optimization to synthesize control barrier functions (CBFs) for nonlinear polynomial systems subject to input constraints. The approach consists of two design steps. In the first step, using a linear-like representation of the nonlinear dynamics, an SOS optimization problem is formulated to search for an initial CBF and controller jointly. In the second step, an iterative optimization procedure involving the solution of a series of SOS problems is proposed to alternatively update the CBF and the controller to increase the invariant set defined by the CBF. The efficacy of the proposed approach is validated using numerical examples.Peer reviewe

Trepo - Institutional Repository of Tampere University

Giant magneto-birefringence effect and tuneable colouration of 2D crystals' suspensions

Author: Cheng Hui-Ming
Ding Baofu
Geim A. K.
Grigorieva I. V.
Kuang Wenjun
Liu Bilu
Pan Yikun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/07/2020
Field of study

One of the long sought-after goals in manipulation of light through light-matter interactions is the realization of magnetic-field-tuneable colouration, so-called magneto-chromatic effect, which holds great promise for optical, biochemical and medical applications due to its contactless and non-invasive nature. This goal can be achieved by magnetic-field controlled birefringence, where colours are produced by the interference between phase-retarded components of transmitted polarised light. Thus far birefringence-tuneable coloration has been demonstrated using electric field, material chirality and mechanical strain but magnetic field control remained elusive due to either weak magneto-optical response of transparent media or low transmittance to visible light of magnetically responsive media, such as ferrofluids. Here we demonstrate magnetically tuneable colouration of aqueous suspensions of two-dimensional cobalt-doped titanium oxide which exhibit an anomalously large magneto-birefringence effect. The colour of the suspensions can be tuned over more than two wavelength cycles in the visible range by moderate magnetic fields below 0.8 T. We show that such giant magneto-chromatic response is due to particularly large phase retardation (>3 pi) of the polarised light, which in its turn is a combined result of a large Cotton-Mouton coefficient (three orders of magnitude larger than for known liquid crystals), relatively high saturation birefringence (delta n = 2 x 10^-4) and high transparency of our suspensions to visible light. The work opens a new avenue to achieve tuneable colouration through engineered magnetic birefringence and can readily be extended to other magnetic 2D nanocrystals. The demonstrated effect can be used in a variety of magneto-optical applications, including magnetic field sensors, wavelength-tuneable optical filters and see-through printing.Comment: 10 pages, 4 figures. Nature Communications, 2020, Accepte

arXiv.org e-Print Archive

Directory of Open Access Journals

Probing the Electroweak Phase Transition with Exotic Higgs Decays

Author: Carena Marcela
Kozaczuk Jonathan
Liu Zhen
Ou Tong
Ramsey-Musolf Michael J.
Shelton Jessie
Wang Yikun
Xie Ke-Pan
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 15/03/2022
Field of study

An essential goal of the Higgs physics program at the LHC and beyond is to explore the nature of the Higgs potential and shed light on the mechanism of electroweak symmetry breaking. An important class of models alter the thermal history of electroweak symmetry breaking from the predictions of the Standard Model (SM). This paper reviews the existence of a region of parameter space where a strong first-order electroweak phase transition is compatible with exotic decays of the SM-like Higgs boson. A dedicated search for exotic Higgs decays can actively explore this framework at the Large Hadron Collider (LHC), while future exotic Higgs decay searches at the high-luminosity LHC and future Higgs factories will be vital to conclusively probe the scenario

arXiv.org e-Print Archive

DigitalCommons@University of Nebraska

DDC-PIM: Efficient Algorithm/Architecture Co-design for Doubling Data Capacity of SRAM-based Processing-In-Memory

Author: Duan Cenlin
He Xiaolin
He Ziyan
Jia Xiaotao
Pan Weitao
Qi Yingjie
Wang Xueyan
Wang Yikun
Wang Yiou
Yan Bonan
Yang Jianlei
Zhao Weisheng
Publication venue
Publication date: 31/10/2023
Field of study

Processing-in-memory (PIM), as a novel computing paradigm, provides significant performance benefits from the aspect of effective data movement reduction. SRAM-based PIM has been demonstrated as one of the most promising candidates due to its endurance and compatibility. However, the integration density of SRAM-based PIM is much lower than other non-volatile memory-based ones, due to its inherent 6T structure for storing a single bit. Within comparable area constraints, SRAM-based PIM exhibits notably lower capacity. Thus, aiming to unleash its capacity potential, we propose DDC-PIM, an efficient algorithm/architecture co-design methodology that effectively doubles the equivalent data capacity. At the algorithmic level, we propose a filter-wise complementary correlation (FCC) algorithm to obtain a bitwise complementary pair. At the architecture level, we exploit the intrinsic cross-coupled structure of 6T SRAM to store the bitwise complementary pair in their complementary states (

Q/\overline{Q}

), thereby maximizing the data capacity of each SRAM cell. The dual-broadcast input structure and reconfigurable unit support both depthwise and pointwise convolution, adhering to the requirements of various neural networks. Evaluation results show that DDC-PIM yields about

2.84\times

speedup on MobileNetV2 and

2.69\times

on EfficientNet-B0 with negligible accuracy loss compared with PIM baseline implementation. Compared with state-of-the-art SRAM-based PIM macros, DDC-PIM achieves up to

8.41\times

and

2.75\times

improvement in weight density and area efficiency, respectively.Comment: 14 pages, to be published in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD

arXiv.org e-Print Archive

Giant magneto-birefringence effect and tuneable colouration of 2D crystal suspensions

Author: Cheng Hui-Ming
Ding Baofu
Geim A. K.
Grigorieva I. V.
Kuang Wenjun
Liu Bilu
Pan Yikun
Publication venue: Nature Publishing Group UK
Publication date: 03/07/2020
Field of study

From Springer Nature via Jisc Publications RouterHistory: received 2020-02-13, accepted 2020-07-03, registration 2020-07-10, pub-electronic 2020-07-24, online 2020-07-24, collection 2020-12Publication status: PublishedAbstract: One of the long-sought-after goals in light manipulation is tuning of transmitted interference colours. Previous approaches toward this goal include material chirality, strain and electric-field controls. Alternatively, colour control by magnetic field offers contactless, non-invasive and energy-free advantages but has remained elusive due to feeble magneto-birefringence in conventional transparent media. Here we demonstrate an anomalously large magneto-birefringence effect in transparent suspensions of magnetic two-dimensional crystals, which arises from a combination of a large Cotton-Mouton coefficient and relatively high magnetic saturation birefringence. The effect is orders of magnitude stronger than those previously demonstrated for transparent materials. The transmitted colours of the suspension can be continuously tuned over two-wavelength cycles by moderate magnetic fields below 0.8 T. The work opens a new avenue to tune transmitted colours, and can be further extended to other systems with artificially engineered magnetic birefringence

ChesterRep

Recognition of Abnormal-Laying Hens Based on Fast Continuous Wavelet and Deep Learning Using Hyperspectral Images

Author: Chenxiao Lai
Mingzhong Pan
Xing Qin
Yikun Wang
Yun Xiang
Zejun Pan
Publication venue: 'MDPI AG'
Publication date: 01/03/2023
Field of study

The egg production of laying hens is crucial to breeding enterprises in the laying hen breeding industry. However, there is currently no systematic or accurate method to identify low-egg-production-laying hens in commercial farms, and the majority of these hens are identified by breeders based on their experience. In order to address this issue, we propose a method that is widely applicable and highly precise. First, breeders themselves separate low-egg-production-laying hens and normal-laying hens. Then, under a halogen lamp, hyperspectral images of the two different types of hens are captured via hyperspectral imaging equipment. The vertex component analysis (VCA) algorithm is used to extract the cockscomb end member spectrum to obtain the cockscomb spectral feature curves of low-egg-production-laying hens and normal ones. Next, fast continuous wavelet transform (FCWT) is employed to analyze the data of the feature curves in order to obtain the two-dimensional spectral feature image dataset. Finally, referring to the two-dimensional spectral image dataset of the low-egg-production-laying hens and normal ones, we developed a deep learning model based on a convolutional neural network (CNN). When we tested the model’s accuracy by using the prepared dataset, we found that it was 0.975 percent accurate. This outcome demonstrates our identification method, which combines hyperspectral imaging technology, an FCWT data analysis method, and a CNN deep learning model, and is highly effective and precise in laying-hen breeding plants. Furthermore, the attempt to use FCWT for the analysis and processing of hyperspectral data will have a significant impact on the research and application of hyperspectral technology in other fields due to its high efficiency and resolution characteristics for data signal analysis and processing

Directory of Open Access Journals