Search CORE

84 research outputs found

ED2: Environment Dynamics Decomposition World Models for Continuous Control

Author: Hao Jianye
Wang Cong
Wang Zhen
Yuan Yifu
Publication venue
Publication date: 15/02/2024
Field of study

Model-based reinforcement learning (MBRL) achieves significant sample efficiency in practice in comparison to model-free RL, but its performance is often limited by the existence of model prediction error. To reduce the model error, standard MBRL approaches train a single well-designed network to fit the entire environment dynamics, but this wastes rich information on multiple sub-dynamics which can be modeled separately, allowing us to construct the world model more accurately. In this paper, we propose the Environment Dynamics Decomposition (ED2), a novel world model construction framework that models the environment in a decomposing manner. ED2 contains two key components: sub-dynamics discovery (SD2) and dynamics decomposition prediction (D2P). SD2 discovers the sub-dynamics in an environment automatically and then D2P constructs the decomposed world model following the sub-dynamics. ED2 can be easily combined with existing MBRL algorithms and empirical results show that ED2 significantly reduces the model error, increases the sample efficiency, and achieves higher asymptotic performance when combined with the state-of-the-art MBRL algorithms on various continuous control tasks. Our code is open source and available at https://github.com/ED2-source-code/ED2.Comment: 10 pages, 13 figure

arXiv.org e-Print Archive

Disaster cassification net: A disaster classification algorithm on remote sensing imagery

Author: Fangyuan Liu
Jianye Yuan
Xin Cai
Xinwang Ding
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2023
Field of study

As we all know, natural disasters have a great impact on people’s lives and properties, and it is very necessary to deal with disaster categories in a timely and effective manner. In light of this, we propose using tandem stitching to create a new Disaster Cassification network D-Net (Disaster Cassification Net) using the D-Conv, D-Linear, D-model, and D-Layer modules. During the experiment, we compared the proposed method with “CNN” and “Transformer”, we found that disaster cassification net compared to CNN algorithm Params decreased by 26–608 times, FLOPs decreased by up to 21 times, Precision increased by 1.6%–43.5%; we found that disaster cassification net compared to Transformer algorithm Params decreased by 23–149 times, FLOPs decreased by 1.7–10 times, Precision increased by 3.9%–25.9%. Precision increased by 3.9%–25.9%. And found that disaster cassification net achieves the effect of SOTA(State-Of-The-Art) on the disaster dataset; After that, we compared the above-mentioned MobileNet_v2 with the best performance on the classification dataset and CCT network are compared with disaster cassification net on fashion_mnist and CIFAR_100 public datasets, respectively, and the results show that disaster cassification net can still achieve the state-of-the-art classification effect. Therefore, our proposed algorithm can be applied not only to disaster tasks, but also to other classification tasks

Directory of Open Access Journals

PreRoutGNN for Timing Prediction with Order Preserving Partition: Global Circuit Pre-training, Local Delay Learning and Attentional Cell Modeling

Author: Hao Jianye
Kai Shixiong
Tang Zhentao
Yan Junchi
Ye Junjie
Yuan Mingxuan
Zhong Ruizhe
Publication venue
Publication date: 12/03/2024
Field of study

Pre-routing timing prediction has been recently studied for evaluating the quality of a candidate cell placement in chip design. It involves directly estimating the timing metrics for both pin-level (slack, slew) and edge-level (net delay, cell delay), without time-consuming routing. However, it often suffers from signal decay and error accumulation due to the long timing paths in large-scale industrial circuits. To address these challenges, we propose a two-stage approach. First, we propose global circuit training to pre-train a graph auto-encoder that learns the global graph embedding from circuit netlist. Second, we use a novel node updating scheme for message passing on GCN, following the topological sorting sequence of the learned graph embedding and circuit graph. This scheme residually models the local time delay between two adjacent pins in the updating sequence, and extracts the lookup table information inside each cell via a new attention mechanism. To handle large-scale circuits efficiently, we introduce an order preserving partition scheme that reduces memory consumption while maintaining the topological dependencies. Experiments on 21 real world circuits achieve a new SOTA R2 of 0.93 for slack prediction, which is significantly surpasses 0.59 by previous SOTA method. Code will be available at: https://github.com/Thinklab-SJTU/EDA-AI.Comment: 13 pages, 5 figures, The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024

arXiv.org e-Print Archive

The activation of p38MAPK and JNK pathways in bovine herpesvirus 1 infected MDBK cells

Author: Chen Yuan
Dong Zhang
Guoqiang Zhu
Jianye Wang
Liqian Zhu
Liyuan Huang
Xiuyan Ding
Publication venue: Springer Nature
Publication date
Field of study

Springer - Publisher Connector

Learning to Select Cuts for Efficient Mixed-Integer Programming

Author: Hao Jianye
Huang Zeren
Liu Furui
Wang Jun
Wang Kerong
Yu Yong
Yuan Mingxuan
Zhang Weinan
Zhen Hui-ling
Publication venue
Publication date: 08/10/2021
Field of study

Cutting plane methods play a significant role in modern solvers for tackling mixed-integer programming (MIP) problems. Proper selection of cuts would remove infeasible solutions in the early stage, thus largely reducing the computational burden without hurting the solution accuracy. However, the major cut selection approaches heavily rely on heuristics, which strongly depend on the specific problem at hand and thus limit their generalization capability. In this paper, we propose a data-driven and generalizable cut selection approach, named Cut Ranking, in the settings of multiple instance learning. To measure the quality of the candidate cuts, a scoring function, which takes the instance-specific cut features as inputs, is trained and applied in cut ranking and selection. In order to evaluate our method, we conduct extensive experiments on both synthetic datasets and real-world datasets. Compared with commonly used heuristics for cut selection, the learning-based policy has shown to be more effective, and is capable of generalizing over multiple problems with different properties. Cut Ranking has been deployed in an industrial solver for large-scale MIPs. In the online A/B testing of the product planning problems with more than

10^7

variables and constraints daily, Cut Ranking has achieved the average speedup ratio of 12.42% over the production solver without any accuracy loss of solution.Comment: Paper accepted at Pattern Recognition journa

arXiv.org e-Print Archive

UCL Discovery

AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model

Author: Dong Zibin
Fan Changjie
Hao Jianye
Hu Yujing
Hu Zhipeng
Lv Tangjie
Mu Yao
Ni Fei
Yuan Yifu
Zheng Yan
Publication venue
Publication date: 04/02/2024
Field of study

Aligning agent behaviors with diverse human preferences remains a challenging problem in reinforcement learning (RL), owing to the inherent abstractness and mutability of human preferences. To address these issues, we propose AlignDiff, a novel framework that leverages RL from Human Feedback (RLHF) to quantify human preferences, covering abstractness, and utilizes them to guide diffusion planning for zero-shot behavior customizing, covering mutability. AlignDiff can accurately match user-customized behaviors and efficiently switch from one to another. To build the framework, we first establish the multi-perspective human feedback datasets, which contain comparisons for the attributes of diverse behaviors, and then train an attribute strength model to predict quantified relative strengths. After relabeling behavioral datasets with relative strengths, we proceed to train an attribute-conditioned diffusion model, which serves as a planner with the attribute strength model as a director for preference aligning at the inference phase. We evaluate AlignDiff on various locomotion tasks and demonstrate its superior performance on preference matching, switching, and covering compared to other baselines. Its capability of completing unseen downstream tasks under human instructions also showcases the promising potential for human-AI collaboration. More visualization videos are released on https://aligndiff.github.io/

arXiv.org e-Print Archive

Mechanism of Qingchang Suppository on repairing the intestinal mucosal barrier in ulcerative colitis

Author: Haoyue Liu
Jiang Lin
Jianye Yuan
Jingyi Shan
Jingyi Shan
Suxian Liu
Suxian Liu
Publication venue: Frontiers Media S.A.
Publication date: 01/08/2023
Field of study

Ulcerative colitis (UC) is a refractory inflammatory bowel disease, and the outcomes of conventional therapies of UC, including 5-aminosalicylic acid, glucocorticoids, immunosuppressants, and biological agents, are not satisfied with patients and physicians with regard to adverse reactions and financial burden. The abnormality of the intestinal mucosal barrier in the pathogenesis of UC was verified. Qingchang Suppository (QCS) is an herbal preparation and is effective in treating ulcerative proctitis. The mechanism of QCS and its active ingredients have not been concluded especially in mucosal healing. This review elucidated the potential mechanism of QCS from the intestinal mucosal barrier perspective to help exploring future QCS research directions

Directory of Open Access Journals

CLEC7A regulates M2 macrophages to suppress the immune microenvironment and implies poorer prognosis of glioma

Author: Jianning Zhang
Jianye Xu
Jinchao Wang
Kai Wang
Kaiji Li
Ruilong Peng
Shangchen Xu
Shu Zhang
Xiaoru Li
Xu Zhang
Yalong Gao
Yuan Zhou
Publication venue: Frontiers Media S.A.
Publication date: 01/05/2024
Field of study

BackgroundGliomas constitute a category of malignant tumors originating from brain tissue, representing the majority of intracranial malignancies. Previous research has demonstrated the pivotal role of CLEC7A in the progression of various cancers, yet its specific implications within gliomas remain elusive. The primary objective of this study was to investigate the prognostic significance and immune therapeutic potential of CLEC7A in gliomas through the integration of bioinformatics and clinical pathological analyses.MethodsThis investigation involved examining and validating the relationship between CLEC7A and glioma using samples from Hospital, along with data from TCGA, GEO, GTEx, and CGGA datasets. Subsequently, we explored its prognostic value, biological functions, expression location, and impact on immune cells within gliomas. Finally, we investigated its potential impact on the chemotaxis and polarization of macrophages.ResultsThe expression of CLEC7A is upregulated in gliomas, and its levels escalate with the malignancy of tumors, establishing it as an independent prognostic factor. Functional enrichment analysis revealed a significant correlation between CLEC7A and immune function. Subsequent examination of immune cell differential expression demonstrated a robust association between CLEC7A and M2 macrophages. This conclusion was further substantiated through single-cell analysis, immunofluorescence, and correlation studies. Finally, the knockout of CLEC7A in M2 macrophages resulted in a noteworthy reduction in macrophage chemotaxis and polarization factors.ConclusionCLEC7A expression is intricately linked to the pathology and molecular characteristics of gliomas, establishing its role as an independent prognostic factor for gliomas and influencing macrophage function. It could be a promising target for immunotherapy in gliomas

Directory of Open Access Journals

3D sweater garment style generation based on 3D anthropometric characteristic parameters

Author: Duan Li
Jianye Yuan
Xuan Luo
Publication venue: SAGE Publishing
Publication date: 01/01/2024
Field of study

This paper proposes a method for three-dimensional style modeling of loose sweaters. Through the correlation analysis of the three-dimensional human body and the classic sweater style, a style model was built on the three-dimensional human body model to realize efficient personalized sweater design and production. First of all, the design model was extracted from the human body model based on the characteristics of the ring-cutting algorithm. Secondly, the loose model of the sweater was established based on the chest, waist, and hip data of the human body. Subsequently, the feature line between the size information and style features was created, and curve interpolation values were combined with joint smoothing methods to generate a multi-faceted sweater style model. Finally, the mapping function was used to flatten the style model, the related styles were woven by operating the computer, and the suitability of the established sweater fabric was verified. The comparison results showed that the accuracy of the style construction of this model was improved. Through the analysis of experimental data, it can be proven that the method proposed in this paper can quickly and accurately establish a three-dimensional style model of a sweater, without the need for repeated measurements to make templates, thus saving development time

Directory of Open Access Journals

Research on Lightweight Disaster Classification Based on High-Resolution Remote Sensing Images

Author: Ge Han
Jianye Yuan
Song Li
Wei Gong
Xin Ma
Publication venue: MDPI AG
Publication date: 01/05/2022
Field of study

With the increasing frequency of natural disasters becoming, it is very important to classify and identify disasters. We propose a lightweight disaster classification model, which has lower computation and parameter quantities and a higher accuracy than other classification models. For this purpose, this paper specially proposes the SDS-Network algorithm, which is optimized on ResNet, to deal with the above problems of remote sensing images. First, it implements the spatial attention mechanism to improve the accuracy of the algorithm; then, the depth separable convolution is introduced to reduce the number of model calculations and parameters while ensuring the accuracy of the algorithm; finally, the effect of the model is increased by adjusting some hyperparameters. The experimental results show that, compared with the classic AlexNet, ResNet18, VGG16, VGG19, and Densenet121 classification models, the SDS-Network algorithm in this paper has a higher accuracy, and when compared with the lightweight models mobilenet series, shufflenet series, squeezenet series, and mnasnet series, it has lower model complexity and a higher accuracy rate. According to a comprehensive performance comparison of the charts made in this article, it is found that the SDS-Network algorithm is still better than the regnet series algorithm. Furthermore, after verification with a public data set, the SDS-Network algorithm in this paper is found to have a good generalization ability. Thus, we can conclude that the SDS-Network classification model of the algorithm in this paper has a good classification effect, and it is suitable for disaster classification tasks. Finally, it is verified on public data sets that the proposed SDS-Network has good generalization ability and portability

Directory of Open Access Journals