Search CORE

63 research outputs found

Vid2Act: Activate Offline Videos for Visual RL

Author: Minting Pan
Xiaokang Yang
Yitao Zheng
Yunbo Wang
Publication venue
Publication date: 05/06/2023
Field of study

Pretraining RL models on offline video datasets is a promising way to improve their training efficiency in online tasks, but challenging due to the inherent mismatch in tasks, dynamics, and behaviors across domains. A recent model, APV, sidesteps the accompanied action records in offline datasets and instead focuses on pretraining a task-irrelevant, action-free world model within the source domains. We present Vid2Act, a model-based RL method that learns to transfer valuable action-conditioned dynamics and potentially useful action demonstrations from offline to online settings. The main idea is to use the world models not only as simulators for behavior learning but also as tools to measure the domain relevance for both dynamics representation transfer and policy transfer. Specifically, we train the world models to generate a set of time-varying task similarities using a domain-selective knowledge distillation loss. These similarities serve two purposes: (i) adaptively transferring the most useful source knowledge to facilitate dynamics learning, and (ii) learning to replay the most relevant source actions to guide the target policy. We demonstrate the advantages of Vid2Act over the action-free visual RL pretraining method in both Meta-World and DeepMind Control Suite

arXiv.org e-Print Archive

Numerical investigation of multi-nozzle ejector device with inclined nozzles for marine gas turbine

Author: Cheng Mengmeng
Shi Hong
Wang Rui
Yuan Jie
Zhang Qianwei
Zheng Rentong
Zou Yitao
Publication venue: Faculty of Mechanical Engineering and Naval Architecture
Publication date: 01/01/2023
Field of study

The high-temperature exhaust gases and the hot surfaces of the ejector device in marine gas turbines generate significant levels of infrared radiation. An appropriate nozzle structure can effectively lower the exhaust gas temperature and reduce the high-temperature radiation surface area, thereby minimizing external infrared radiation. In this study, a numerical simulation of the nozzle structure in the ejector device was conducted using computational fluid dynamics (CFD) methods. By investigating the orthogonal combinations of nozzle inclination angles and the number of nozzles, the temperature distribution and flow characteristics under different operating conditions were analysed. The results showed that as the nozzle inclination angle increased, the entrainment coefficient (Ce) and the temperature ratio at the inlet and outlet (Rt) initially improved but then worsened. Simultaneously, the pressure loss coefficient (Cpl) increased with the inclination angle. The CRITIC weight method was employed to objectively allocate weights to Rt, Ce, and Cpl, determining the optimal solution. The results indicated that Rt and Cpl had significant and similar weights. The optimal solution was found in Case 10 (α = 5°, x = 4), with corresponding evaluation indices of Ce=2.38, Cpl=11.45, and =0.68. This study\u27s findings are of great importance for enhancing the performance of marine gas turbines and reducing external infrared radiation

Directory of Open Access Journals

HRČAK - Portal of Croatian Scientific and Professional Journals

Hrčak - Portal of scientific journals of Croatia

Large Language Models are In-Context Semantic Reasoners rather than Symbolic Reasoners

Author: Li Jiaqi
Liang Yitao
Meng Fanxu
Tang Xiaojuan
Zhang Muhan
Zheng Zilong
Zhu Song-Chun
Publication venue
Publication date: 24/05/2023
Field of study

The emergent few-shot reasoning capabilities of Large Language Models (LLMs) have excited the natural language and machine learning community over recent years. Despite of numerous successful applications, the underlying mechanism of such in-context capabilities still remains unclear. In this work, we hypothesize that the learned \textit{semantics} of language tokens do the most heavy lifting during the reasoning process. Different from human's symbolic reasoning process, the semantic representations of LLMs could create strong connections among tokens, thus composing a superficial logical chain. To test our hypothesis, we decouple semantics from the language reasoning process and evaluate three kinds of reasoning abilities, i.e., deduction, induction and abduction. Our findings reveal that semantics play a vital role in LLMs' in-context reasoning -- LLMs perform significantly better when semantics are consistent with commonsense but struggle to solve symbolic or counter-commonsense reasoning tasks by leveraging in-context new knowledge. The surprising observations question whether modern LLMs have mastered the inductive, deductive and abductive reasoning abilities as in human intelligence, and motivate research on unveiling the magic existing within the black-box LLMs. On the whole, our analysis provides a novel perspective on the role of semantics in developing and evaluating language models' reasoning abilities. Code is available at {\url{https://github.com/XiaojuanTang/ICSR}}

arXiv.org e-Print Archive

JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models

Author: Cai Shaofei
He Zhaofeng
Hou Jinbing
Jin Yonggang
Liang Yitao
Lin Haowei
Liu Anji
Ma Xiaojian
Wang Zihao
Yang Yaodong
Zhang Bowei
Zheng Zilong
Publication venue
Publication date: 30/11/2023
Field of study

Achieving human-like planning and control with multimodal observations in an open world is a key milestone for more functional generalist agents. Existing approaches can handle certain long-horizon tasks in an open world. However, they still struggle when the number of open-world tasks could potentially be infinite and lack the capability to progressively enhance task completion as game time progresses. We introduce JARVIS-1, an open-world agent that can perceive multimodal input (visual observations and human instructions), generate sophisticated plans, and perform embodied control, all within the popular yet challenging open-world Minecraft universe. Specifically, we develop JARVIS-1 on top of pre-trained multimodal language models, which map visual observations and textual instructions to plans. The plans will be ultimately dispatched to the goal-conditioned controllers. We outfit JARVIS-1 with a multimodal memory, which facilitates planning using both pre-trained knowledge and its actual game survival experiences. JARVIS-1 is the existing most general agent in Minecraft, capable of completing over 200 different tasks using control and observation space similar to humans. These tasks range from short-horizon tasks, e.g., "chopping trees" to long-horizon tasks, e.g., "obtaining a diamond pickaxe". JARVIS-1 performs exceptionally well in short-horizon tasks, achieving nearly perfect performance. In the classic long-term task of

\texttt{ObtainDiamondPickaxe}

, JARVIS-1 surpasses the reliability of current state-of-the-art agents by 5 times and can successfully complete longer-horizon and more challenging tasks. The project page is available at https://craftjarvis.org/JARVIS-1Comment: update project pag

arXiv.org e-Print Archive

The First Case of Ischemia-Free Kidney Transplantation in Humans

Author: Chen Chuanbao
Chen Guodong
Cui Jun
Guo Zhiyong
Han Ming
He Xiaoshun
Huang Jiefu
Huang Shanzhou
Jiao Xingyuan
Li Xianchang
Mo Liqiu
Tang Yunhua
van Leeuwen Otto B.
Wang Changxi
Wang Linhe
Wang Xiaoping
Yuan Xiaopeng
Zhang Zhiheng
Zhao Qiang
Zheng Yitao
Zhu Zebin
Publication venue: 'Frontiers Media SA'
Publication date: 11/12/2019
Field of study

Background: Ischemia-reperfusion injury (IRI) has been considered an inevitable event in organ transplantation since the first successful kidney transplant was performed in 1954. To avoid IRI, we have established a novel procedure called ischemia-free organ transplantation. Here, we describe the first case of ischemia-free kidney transplantation (IFKT). Materials and Methods: The kidney graft was donated by a 19-year-old brain-dead donor. The recipient was a 47-year-old man with end-stage diabetic nephropathy. The graft was procured, preserved, and implanted without cessation of blood supply using normothermic machine perfusion. Results: The graft appearance, perfusion flow, and urine production suggested that the kidney was functioning well-during the whole procedure. The creatinine dropped rapidly to normal range within 3 days post-transplantation. The levels of serum renal injury markers were low post-transplantation. No rejection or vascular or infectious complications occurred. The patient had an uneventful recovery. Conclusion: This paper marks the first case of IFKT in humans. This innovation may offer a unique solution to optimizing transplant outcomes in kidney transplantation

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

A predictive energy management strategy for multi-mode plug-in hybrid electric vehicles based on multi neural networks

Author: Chen
Chen
Chen
Chen
Du
Guang Li
Guo
Hannan
Jiangwei Shen
Li
Li
Li
Li
Liu
Liu
Liu
Lü
Morchid
Murphey
Namwook
Onori
Peng
Qiu
Rezaei
Rezaei
Schmid
Song
Tian
Tribioli
Uebel
Wang
Wu
Xie
Xie
Xie
Xu
Yan
Yang
Yitao Wu
Yonggang Liu
Yuan
Yuanjian Zhang
Zeng
Zhang
Zhang
Zheng Chen
Zhou
Zhou
Publication venue: 'Elsevier BV'
Publication date: 01/10/2020
Field of study

Online optimal energy management of plug-in hybrid electric vehicles has been continually investigated for better fuel economy. This paper proposed a predictive energy management strategy based on multi neural networks for a multi-mode plug-in hybrid electric vehicle. To attain it, firstly, the offline optimal results prepared for knowledge learning are derived by dynamic programming and Pontryagin’s minimum principle. Then, the mode recognition neural network is trained based on the optimal results of dynamic programming and the recurrent neural network is firstly exploited to realize online co-state estimation application. Consequently, the velocity prediction-based online model predictive control framework is established with the co-state correction and slacked constraints to solve the real-time optimal control sequence. A series of numerical simulation results validate that the optimal performance yielded from global optimal strategy can be exploited online to attain the satisfied cost reduction, compared with equivalent consumption minimum strategy, with the assistance of estimated real time co-state and slacked reference. In addition, the computation duration of proposed algorithm decreases by 23.40%, compared with conventional Pontryagin’s minimum principle-based model predictive control scheme, thereby proving its online application potential

Crossref

The University of Manchester - Institutional Repository

Queen Mary Research Online

Scoliotic posture as the initial symptom in adolescents with lumbar disc herniation: its curve pattern and natural history after lumbar discectomy

Author: A Kurihara
B Fakouri
Bangping Qian
BE Finneson
Bin Wang
CC Edwards
DA Spiegel
DY Lee
FC Pinto
GXZX Zheng
H Ishihara
H Matsui
JF Martinez-Lage
KS Suk
M Bezer
MA Adams
MC Battie
MP Lorio
N Bradbury
Qinghua Zhao
R Gennuso
R Kumar
RM Puno
RW Coonrad
S Ozgen
SE Borgesen
VM Mattila
Y Qiu
Yang Yu
Yitao Ding
Yong Qiu
Zezhang Zhu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background There have been few studies focusing on the curve pattern of scoliosis caused by lumbar disc herniation (LDH) in adolescents and the natural history of scoliosis after discectomy. The current study was carried out to identify the curve pattern of scoliosis and investigate the effect of posterior discectomy on the curve improvement in adolescents with LDH. Methods This review focused on a group of 26 adolescents with LDH who initially presented to our clinic for evaluation of scoliosis, followed by posterior discectomy between 2000 and 2009. Radiographic measurements included curve pattern, specific curve features, trunk shift, and sagittal profile. The correlation between the side of disc herniation and the direction of lumbosacral curve and the trunk shift was evaluated. Results A typical curve pattern was initially identified in all of the patients as a short lumbosacral curve accompanied with a long thoracic or thoracolumbar curve toward the opposite side. 23 of 26 patients (88.5%) had a trunk shift more than 2.0 cm away from the midline, showing a poor coronal balance. A relatively straight sagittal profile was noted in all the patients. 84.6% (22/26) patients had a disc herniation at the convex side of lumbosacral curve. Similarly, 73.1% (19/26) patients showed a trunk shift toward the opposite side of disc herniation. All of the patients had an marked curve improvement immediately after discectomy. In the 17 patients with a more than 2-year follow-up, only two had a residual lumbosacral curve greater than or equal to 20 degrees. The mean ODI improved from 21.4% before surgery to 7.3% at the final follow-up. Conclusions A short lumbosacral curve accompanied with a long thoracic or thoracolumbar curve toward the opposite side, and a relatively straight sagittal profile have been noted in all the patients. The direction of lumbosacral curve and trunk shift was related to the side of disc herniation. A majority of patients have a small curve size while assosiated with a significant coronal imbalance. Earlier decompression can provide a greater opportunity for spontaneous correction of scoliosis.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Nucleosome organizations in induced pluripotent stem cells reprogrammed from somatic cells belonging to three different germ layers

Author: A Gaspar-Maia
A Gore
A Soufi
B Zheng
C Jiang
Cizhong Jiang
DE Schones
DH Phanstiel
Gang Chang
Guitao Ding
Hong Wang
I Albert
J Chen
K Ahmed
K Takahashi
KB Jensen
L Kang
L Kang
MG Guenther
MH Chin
MJ Boland
MP Schnetz
R Lister
RD Hawkins
S Cuddapah
S Efroni
S Ruiz
SH Orkin
SH Park
Shaorong Gao
Shuai Gao
SM Hussein
T Aoi
TN Mavrich
U Ben-David
VB Teif
VB Teif
Weisheng Zheng
Wenqiang Liu
X Chen
Xiaobai Zhang
Xiaochen Kou
Xinfeng Hou
XY Zhao
Y Gao
Y Ohi
Y Shen
Yitao Tang
Yonghua Jiang
Yu Tao
Yueying Li
Z Ghosh
Z Kou
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref