Search CORE

12 research outputs found

Agent-Aware Training for Agent-Agnostic Action Advising in Deep Reinforcement Learning

Author: Chen Kaixuan
Liu Shunyu
Song Jie
Song Mingli
Wang Yong
Wei Yaoquan
Zheng Tongya
Publication venue
Publication date: 28/11/2023
Field of study

Action advising endeavors to leverage supplementary guidance from expert teachers to alleviate the issue of sampling inefficiency in Deep Reinforcement Learning (DRL). Previous agent-specific action advising methods are hindered by imperfections in the agent itself, while agent-agnostic approaches exhibit limited adaptability to the learning agent. In this study, we propose a novel framework called Agent-Aware trAining yet Agent-Agnostic Action Advising (A7) to strike a balance between the two. The underlying concept of A7 revolves around utilizing the similarity of state features as an indicator for soliciting advice. However, unlike prior methodologies, the measurement of state feature similarity is performed by neither the error-prone learning agent nor the agent-agnostic advisor. Instead, we employ a proxy model to extract state features that are both discriminative (adaptive to the agent) and generally applicable (robust to agent noise). Furthermore, we utilize behavior cloning to train a model for reusing advice and introduce an intrinsic reward for the advised samples to incentivize the utilization of expert guidance. Experiments are conducted on the GridWorld, LunarLander, and six prominent scenarios from Atari games. The results demonstrate that A7 significantly accelerates the learning process and surpasses existing methods (both agent-specific and agent-agnostic) by a substantial margin. Our code will be made publicly available

arXiv.org e-Print Archive

An Input-Series-Output-Parallel Cascaded Converter System Applied to DC Microgrids

Author: Chunxue Wen
Jianlin Li
Menghan Lv
Peng Wang
Pengyu Jia
Qingxuan Wei
Yaoquan Wei
Publication venue: 'MDPI AG'
Publication date: 01/05/2023
Field of study

Direct current transformer (DCT) is a key piece of equipment in direct current (DC) microgrids, and the mainstream topologies mainly include LLC resonant converter (LLC) and dual active bridge (DAB). In this paper, a novel bi-directional buck/boost + CLLLC cascade topology is proposed for the input-series-output-parallel cascade converter system of a DC microgrid. To solve the problem that frequency variation causes the converter to deviate from the optimal operating point, resulting in low efficiency, and the inability to achieve a soft switching function. The CLLLC converter operates near the resonant frequency point as a DCT, only providing electrical isolation and voltage matching, while the buck/boost converter controls the output voltage and the voltage and current sharing of each module. Compared to other cascaded converter systems, the cascaded converter proposed in this paper has high efficiency, simplifies the parameter design, and is suitable for wide input and wide output operating conditions. The system adopts a three-loop control strategy, establishes the small-signal modeling of the system, and its stability is verified by theoretical analysis and simulation. The simulation and experimental results verify the correctness of the proposed cascaded converter based on buck/boost + CLLLC and the effectiveness of the control strategy

Directory of Open Access Journals

Independent of EPR effect : a smart delivery nanosystem for tracking and treatment of nonvascularized intra-abdominal metastases

Author: Chen Hongzhong
Chen Rui
Li Fuyou
Li Junyao
Lim Wei Qi
Peng Juanjuan
Su Yaoquan
Tham Phoebe Huijun
Xiang Huijing
Xing Pengyao
Yang Liqiang
Yuan Wei
Zhao Lingzhi
Zhao Yanli
Publication venue: 'Wiley'
Publication date: 01/01/2018
Field of study

Nanoparticle-based delivery systems (NDS) have impacted the field of cancer therapy on account of the enhanced permeability and retention (EPR) effect that promotes passive accumulation in tumors through the tumor vasculature after intravenous (IV) administration. However, transplanted tumor xenografts on animal models used to justify the feasibility of EPR effect are quite different from clinical tumors in many aspects, a fact that becomes an impediment for NDS to succeed clinical trials. Particularly, early-stage tumor metastases are usually nonvascularized and incapable of conforming the EPR effect after IV injection. Therefore, it is necessary to develop smart NDS to deliver drugs in an EPR-independent route. Herein, an NDS-based treatment approach for intra-abdominal metastases from ovarian carcinoma is reported. Instead of IV injection, intraperitoneal (IP) injection was employed to directly apply the NDS to the metastatic lesions. The NDS was tailor-made with targeting groups to actively target the tumor nidus and redox-responsive drug release to reduce systematic toxicity. Comparing with IV administration, the IP injected NDS could be enriched in metastatic tumor more efficiently, leading to superior therapeutic outcome in vivo. This study provides a successful protocol of EPR independent NDS-based cancer treatment, which may facilitate the clinical translation of nanoparticle-based cancer therapeutics.NRF (Natl Research Foundation, S’pore)Accepted versio

DR-NTU (Digital Repository of NTU)

A General Strategy for Hollow Metal‐Phytate Coordination Complex Micropolyhedra Enabled by Cation Exchange

Author: Brown I. D.
Chenxi Peng
Juanjuan Peng
Meiling Chen
Qiang Sun
Wei Huang
Xiaowang Liu
Xue Chen
Yaoquan Su
You X.
Yu Wang
Yuezhou Zhang
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

A General Strategy for Hollow Metal‐Phytate Coordination Complex Micropolyhedra Enabled by Cation Exchange

Author: Brown I. D.
Chenxi Peng
Juanjuan Peng
Meiling Chen
Qiang Sun
Wei Huang
Xiaowang Liu
Xue Chen
Yaoquan Su
You X.
Yu Wang
Yuezhou Zhang
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

A quasar possibly ejected from NGC4579

Author: A. V. Filippenko
D. Maoz
E. Hummel
H. Arp
H. D. Radecke
Haotong Zhang
Jianyan Wei
Jingyao Hu
M. Burbidge
M. Burbidge
P. C. Hewett
P. Massey
R. M. G. Delgado
X. Zhu
Xingfen Zhu
Y. Chu
Yaoquan Chu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Dissolved and particulate organic carbon in Yantai Sishili Bay aquiculture waters

Author: C. A. Carlson
D. L. Kirchman
E. K. Duursma
J. R. Toggweiler
Jiang-tao Wang
K. Y. Bϕrsheim
Kuang Guorui
Kuang Shihuan
L. D. Guo
L. D. Guo
Li Yan
Nian-zhi Jiao
R. N. Sambrotto
S. L. Strom
S. Myklestad
Wei-hong Zhao
Wu Yaoquan
Yang Heming
Yao Lanfan
Zeng-xia Zhao
Zhang Bo
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Discovery of Highly Potent Pinanamine-Based Inhibitors against Amantadine- and Oseltamivir-Resistant Influenza A Viruses

Influenza pandemic is a constant major threat to public health caused by influenza A viruses (IAVs). IAVs are subcategorized by the surface proteins hemagglutinin (HA) and neuraminidase (NA), in which they are both essential targets for drug discovery. While it is of great concern that NA inhibitor oseltamivir resistant strains are frequently identified from human or avian influenza virus, structural and functional characterization of influenza HA has raised hopes for new antiviral therapies. In this study, we explored a structure–activity relationship (SAR) of pinanamine-based antivirals and discovered a potent inhibitor M090 against amantadine-resistant viruses, including the 2009 H1N1 pandemic strains, and oseltamivir-resistant viruses. Mechanism of action studies, particularly hemolysis inhibition, indicated that M090 targets influenza HA and it occupied a highly conserved pocket of the HA2 domain and inhibited virus-mediated membrane fusion by “locking” the bending state of HA2 during the conformational rearrangement process. This work provides new binding sites within the HA protein and indicates that this pocket may be a promising target for broad-spectrum anti-influenza A drug design and development

The Francis Crick Institute