Search CORE

10 research outputs found

CycleAlign: Iterative Distillation from Black-box LLM to White-box Models for Better Human Alignment

Author: Chen Changyu
Gao Xing
Hong Jixiang
Tu Quan
Yan Rui
Zhang Ji
Publication venue
Publication date: 24/10/2023
Field of study

Language models trained on large-scale corpus often generate content that is harmful, toxic, or contrary to human preferences, making their alignment with human values a critical concern. Reinforcement learning from human feedback (RLHF) with algorithms like PPO is a prevalent approach for alignment but is often complex, unstable, and resource-intensive. Recently, ranking-based alignment methods have emerged, offering stability and effectiveness by replacing the RL framework with supervised fine-tuning, but they are costly due to the need for annotated data. Considering that existing large language models (LLMs) like ChatGPT are already relatively well-aligned and cost-friendly, researchers have begun to align the language model with human preference from AI feedback. The common practices, which unidirectionally distill the instruction-following responses from LLMs, are constrained by their bottleneck. Thus we introduce CycleAlign to distill alignment capabilities from parameter-invisible LLMs (black-box) to a parameter-visible model (white-box) in an iterative manner. With in-context learning (ICL) as the core of the cycle, the black-box models are able to rank the model-generated responses guided by human-craft instruction and demonstrations about their preferences. During iterative interaction, the white-box models also have a judgment about responses generated by them. Consequently, the agreement ranking could be viewed as a pseudo label to dynamically update the in-context demonstrations and improve the preference ranking ability of black-box models. Through multiple interactions, the CycleAlign framework could align the white-box model with the black-box model effectively in a low-resource way. Empirical results illustrate that the model fine-tuned by CycleAlign remarkably exceeds existing methods, and achieves the state-of-the-art performance in alignment with human value

arXiv.org e-Print Archive

The 5th International Conference on Biomedical Engineering and Biotechnology (ICBEB 2016)

Author: Ailong Cai
Baiying Lei
Baiying Lei
Baodong Gai
Baoliang Sun
Bin Wang
Bin Yan
Binquan Li
Changyu Tu
Chengxin Yan
Chiehhsuan Wei
Chunlan Yang
Chunlan Yang
Chunlan Yang
Cong Xu
Daisheng Luo
Daisheng Luo
Dong Ni
Dongyan Yang
Fang Han
Farnaz Farokhian
Farnaz Farokhian
Feng Shi
Feng Zhao
Fuwen Lai
Guanyu Li
Guixue Liu
Haibing Bu
Haijun Lei
Haizhu Xie
Hao Fang
Hasan Demirel
Hua Zhong
Huihong Gong
Huihui Yang
Iman Beheshti
Ioannis Manousakas
Jian Zhang
Jianping Yin
Jie Yang
Jie Yang
Jie Yang
Jiechuan Ren
Jiejue Ma
Jing Xiong
Jingke Zhang
Jingwen Zhuang
Junghua Ho
Junzheng Zheng
Juyoung Park
Ke Gan
Ke Gan
Keming Mao
Keming Mao
Kuan Li
Kyungtae Kang
Lanhua Zhang
Lei Li
Lili Zhao
Linyuan Wang
LiSha Tan
Manning Wang
Mao Wang
Mei Bai
Meixia Su
Minghua Zhao
Mingwu Jin
Mingyue Ding
Nan Fu
Nan Fu
Nan Fu
Ning Mao
Ping Sun
Preetha Phillips
Qi Mao
Qiang Liu
Qingchun Li
Qun Wang
Qun Wang
Rongmao Li
Rongmao Li
Shaode Yu
Shaode Yu
Shaode Yu
Shaomao Lv
Shaoqing Wang
Shaowu Li
Shaoyin Duan
Shengli Li
Shihou Sheng
Shuguang Zhao
Shuicai Wu
Shuicai Wu
Shuihua Wang
Shuo Li
Shuo Li
Shuo Li
Shuwen Chen
Sidan Du
Simin Lin
Siping Chen
Song Gao
Soyeun Kim
Tao Gong
Tao Gong
Tao Gong
Tianfu Wang
Tianxu Zhang
Wan Li
Wan Li
Wan Li
Wangsheng Lu
Wei Liu
Wei Peng
Wensheng Li
Wenyu Liang
Xianbin Cheng
Xiancun Yang
Xiaohui Hu
Xiaolei Song
Xiaolong Sun
Xin Zhang
Xin Zhang
Xinnuan Mu
Xuming Zhang
Y. F. Li
Yafeng Zhan
Yan Zhang
Yanchun Zhu
Yanchun Zhu
Yanchun Zhu
Yanhong Zhou
Yanhui Ding
Yaoqin Xie
Yaoqin Xie
Yaoqin Xie
Yaping Wang
Yifei Liu
Yijie Ren
Yin Chang
Yingnan Nie
Yingnan Nie
Yixian Liu
Yongchao Wang
Yonghong Liu
Yongxin Zhang
Yudong Zhang
Yulu Song
Yun Liang
Yupei Chen
Yuxiang Wu
Zeyuan Lu
Zhang Yang
Zhen Yu
Zhengchao Dong
Zhenghao Shi
Zhenghua Huang
Zhijian Song
ZhiJun Gao
Zhimin Chen
Zhuofu Deng
Publication venue: Springer Nature
Publication date: 01/01/2016
Field of study

Springer - Publisher Connector

Institutional Repository of Yantai Institute of Coastal Zone Research, CAS

PolyaryleneEther Nitrile and Barium TitanateNanocomposite Plasticized by CarboxylatedZinc Phthalocyanine Buffer

Author: Changyu Liu
Chenchen Liu
Ling Tu
Renbo Wei
Shuning Liu
Xiaobo Liu
Yong You
Publication venue: 'MDPI AG'
Publication date: 01/03/2019
Field of study

Barium titanate (BT) and polyarylene ether nitrile (PEN) nanocomposites with enhanced dielectric properties were obtained by using carboxylatedzinc phthalocyanine (ZnPc-COOH) buffer as the plasticizer. Carboxylated zinc phthalocyanine, prepared through hydrolyzing ZnPc in NaOH solution, reacted with the hydroxyl groups on the peripheral of hydrogen peroxide treated BT (BT-OH) yielding core-shell structured BT@ZnPc. Thermogravimetric analysis (TGA), transmission electron microscopy (TEM), TEM energy dispersive spectrometer mapping, scanning electron microscopy (SEM), X-ray diffraction (XRD), X-ray photoelectron spectroscopy (XPS), and Fourier transform infrared (FTIR) demonstrated successful preparation of BT@ZnPc. The fabricated BT@ZnPc was incorporated into the PEN matrix through the solution casting method. Rheological measurements demonstrated that the ZnPc-COOH buffer can improve the compatibility between BT and PEN effectively. With the existence of the ZnPc-COOH buffer, the prepared BT@ZnPc/PEN nanocomposites exhibit a high dielectric constant of 5.94 and low dielectric loss (0.016 at 1000 Hz). BT@ZnPc/PEN dielectric composite films can be easily prepared, presenting great application prospects in the field of organic film capacitors

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

Polyarylene Ether Nitrile and Barium Titanate Nanocomposite Plasticized by Carboxylated Zinc Phthalocyanine Buffer

Author: Changyu Liu
Chenchen Liu
Chopra
Li
Ling Tu
Renbo Wei
Shuning Liu
Xiaobo Liu
Yong You
Publication venue: 'MDPI AG'
Publication date
Field of study

Crossref

Effects of Applying Different Organic Materials on Grain Yield and Soil Fertility in a Double-Season Rice Cropping System

Author: Bin Liao
Changyu Fang
Chao Li
Guozhu Ma
Jing Yang
Mohamed S. Sheteiwy
Naimei Tu
Sichao Liu
Zhenxie Yi
Publication venue: 'MDPI AG'
Publication date: 13/11/2022
Field of study

Double-cropping rice cultivation reduces soil fertility, and the extensive use of chemical fertilizers has harmful effects on both the environment and grain yield. The application of organic materials could be used as a practical strategy to maintain soil fertility and improve grain yield in a double-season rice cropping system. For this purpose, field experiments with six growing seasons over three years, from 2016 to 2018, were conducted to assess the effects of five organic materials (biochar, Chinese milk vetch, rice straw, rapeseed cake fertilizer, and manure) on the grain yield and soil fertility, aiming to save about 25% of the chemical nitrogen (N) fertilizer required for all rice growing stages. The result showed that, compared with CK (the most common dose of fertilizer in this study region; 100% chemical fertilizer without organic fertilizer), the grain yield and soil fertility of double-cropped rice were increased after applying organic fertilizers for three consecutive years. Specifically, the CRC treatment (Chinese milk vetch (10.77 t ha−1 in fresh)/rice straw (26.51 t ha−1 in fresh) + 75% chemical fertilizer) showed significantly higher rates of effective panicles (4.65–10.92%) and annual grain yield (8.00–8.82%). The total N, total phosphorus (P), total potassium (K), alkaline N, and available P content in the CRC soil were significantly increased by 11.85%, 12.22%, 15.08%, 23.32%, and 41.04%, respectively, relative to CK. The decomposition of the applied Chinese milk vetch and rice straw combined with 75% chemical fertilizer resulted in more soil humus (9.50 g kg−1), humic acid (3.19 g kg−1), fulvic acid (3.26 g kg−1), and active organic carbon (5.78 g kg−1) and a significantly higher carbon pool management index (13.5%), as well as significantly higher soil urease activity (18.10%) and acid phosphatase activity (17.64%). Therefore, in this study, Chinese milk vetch (10.77 t ha−1 in fresh) in the early rice season/rice straw (26.51 t ha−1 fresh) in the late rice season + 75% chemical fertilizer treatment was the optimal dose for the double-season rice cropping system. It resulted in higher rice yields and has the potential to be used for more sustainable soil fertility

Multidisciplinary Digital Publishing Institute