12 research outputs found

    Agent-Aware Training for Agent-Agnostic Action Advising in Deep Reinforcement Learning

    Full text link
    Action advising endeavors to leverage supplementary guidance from expert teachers to alleviate the issue of sampling inefficiency in Deep Reinforcement Learning (DRL). Previous agent-specific action advising methods are hindered by imperfections in the agent itself, while agent-agnostic approaches exhibit limited adaptability to the learning agent. In this study, we propose a novel framework called Agent-Aware trAining yet Agent-Agnostic Action Advising (A7) to strike a balance between the two. The underlying concept of A7 revolves around utilizing the similarity of state features as an indicator for soliciting advice. However, unlike prior methodologies, the measurement of state feature similarity is performed by neither the error-prone learning agent nor the agent-agnostic advisor. Instead, we employ a proxy model to extract state features that are both discriminative (adaptive to the agent) and generally applicable (robust to agent noise). Furthermore, we utilize behavior cloning to train a model for reusing advice and introduce an intrinsic reward for the advised samples to incentivize the utilization of expert guidance. Experiments are conducted on the GridWorld, LunarLander, and six prominent scenarios from Atari games. The results demonstrate that A7 significantly accelerates the learning process and surpasses existing methods (both agent-specific and agent-agnostic) by a substantial margin. Our code will be made publicly available

    An Input-Series-Output-Parallel Cascaded Converter System Applied to DC Microgrids

    No full text
    Direct current transformer (DCT) is a key piece of equipment in direct current (DC) microgrids, and the mainstream topologies mainly include LLC resonant converter (LLC) and dual active bridge (DAB). In this paper, a novel bi-directional buck/boost + CLLLC cascade topology is proposed for the input-series-output-parallel cascade converter system of a DC microgrid. To solve the problem that frequency variation causes the converter to deviate from the optimal operating point, resulting in low efficiency, and the inability to achieve a soft switching function. The CLLLC converter operates near the resonant frequency point as a DCT, only providing electrical isolation and voltage matching, while the buck/boost converter controls the output voltage and the voltage and current sharing of each module. Compared to other cascaded converter systems, the cascaded converter proposed in this paper has high efficiency, simplifies the parameter design, and is suitable for wide input and wide output operating conditions. The system adopts a three-loop control strategy, establishes the small-signal modeling of the system, and its stability is verified by theoretical analysis and simulation. The simulation and experimental results verify the correctness of the proposed cascaded converter based on buck/boost + CLLLC and the effectiveness of the control strategy

    Independent of EPR effect : a smart delivery nanosystem for tracking and treatment of nonvascularized intra-abdominal metastases

    No full text
    Nanoparticle-based delivery systems (NDS) have impacted the field of cancer therapy on account of the enhanced permeability and retention (EPR) effect that promotes passive accumulation in tumors through the tumor vasculature after intravenous (IV) administration. However, transplanted tumor xenografts on animal models used to justify the feasibility of EPR effect are quite different from clinical tumors in many aspects, a fact that becomes an impediment for NDS to succeed clinical trials. Particularly, early-stage tumor metastases are usually nonvascularized and incapable of conforming the EPR effect after IV injection. Therefore, it is necessary to develop smart NDS to deliver drugs in an EPR-independent route. Herein, an NDS-based treatment approach for intra-abdominal metastases from ovarian carcinoma is reported. Instead of IV injection, intraperitoneal (IP) injection was employed to directly apply the NDS to the metastatic lesions. The NDS was tailor-made with targeting groups to actively target the tumor nidus and redox-responsive drug release to reduce systematic toxicity. Comparing with IV administration, the IP injected NDS could be enriched in metastatic tumor more efficiently, leading to superior therapeutic outcome in vivo. This study provides a successful protocol of EPR independent NDS-based cancer treatment, which may facilitate the clinical translation of nanoparticle-based cancer therapeutics.NRF (Natl Research Foundation, S’pore)Accepted versio

    Discovery of Highly Potent Pinanamine-Based Inhibitors against Amantadine- and Oseltamivir-Resistant Influenza A Viruses

    No full text
    Influenza pandemic is a constant major threat to public health caused by influenza A viruses (IAVs). IAVs are subcategorized by the surface proteins hemagglutinin (HA) and neuraminidase (NA), in which they are both essential targets for drug discovery. While it is of great concern that NA inhibitor oseltamivir resistant strains are frequently identified from human or avian influenza virus, structural and functional characterization of influenza HA has raised hopes for new antiviral therapies. In this study, we explored a structure–activity relationship (SAR) of pinanamine-based antivirals and discovered a potent inhibitor <b>M090</b> against amantadine-resistant viruses, including the 2009 H1N1 pandemic strains, and oseltamivir-resistant viruses. Mechanism of action studies, particularly hemolysis inhibition, indicated that <b>M090</b> targets influenza HA and it occupied a highly conserved pocket of the HA<sub>2</sub> domain and inhibited virus-mediated membrane fusion by “locking” the bending state of HA<sub>2</sub> during the conformational rearrangement process. This work provides new binding sites within the HA protein and indicates that this pocket may be a promising target for broad-spectrum anti-influenza A drug design and development
    corecore