Search CORE

11 research outputs found

Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning

Author: Chen Jiuhai
Chen Lichang
Gu Jiuxiang
He Shwai
Huang Heng
Li Ming
Zhou Tianyi
Publication venue
Publication date: 18/10/2023
Field of study

Recent advancements in Large Language Models (LLMs) have expanded the horizons of natural language understanding and generation. Notably, the output control and alignment with the input of LLMs can be refined through instruction tuning. However, as highlighted in several studies, low-quality data in the training set are usually detrimental to instruction tuning, resulting in inconsistent or even misleading LLM outputs. We propose a novel method, termed "reflection-tuning," which addresses the problem by self-improvement and judging capabilities of LLMs. This approach utilizes an oracle LLM to recycle the original training data by introspecting and enhancing the quality of instructions and responses in the data. Extensive experiments on widely used evaluation benchmarks show that LLMs trained with our recycled data outperform those trained with existing datasets in various benchmarks

arXiv.org e-Print Archive

From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning

Author: Chen Jiuhai
Chen Lichang
Cheng Ning
Li Ming
Li Zhitao
Wang Jianzong
Xiao Jing
Zhang Yong
Zhou Tianyi
Publication venue
Publication date: 15/09/2023
Field of study

In the realm of Large Language Models, the balance between instruction data quality and quantity has become a focal point. Recognizing this, we introduce a self-guided methodology for LLMs to autonomously discern and select cherry samples from vast open-source datasets, effectively minimizing manual curation and potential cost for instruction tuning an LLM. Our key innovation, the Instruction-Following Difficulty (IFD) metric, emerges as a pivotal tool to identify discrepancies between a model's expected responses and its autonomous generation prowess. Through the adept application of IFD, cherry samples are pinpointed, leading to a marked uptick in model training efficiency. Empirical validations on renowned datasets like Alpaca and WizardLM underpin our findings; with a mere 10% of conventional data input, our strategy showcases improved results. This synthesis of self-guided cherry-picking and the IFD metric signifies a transformative leap in the optimization of LLMs, promising both efficiency and resource-conscious advancements. Codes, data, and models are available: https://github.com/MingLiiii/Cherry_LL

arXiv.org e-Print Archive

Stemphylium lycopersici Nep1-like Protein (NLP) Is a Key Virulence Factor in Tomato Gray Leaf Spot Disease

Author: Chuanyou Li
Hongyu Han
Jiajie Lian
Jiuhai Zhao
Qian Chen
Xizhan Chen
Publication venue: 'MDPI AG'
Publication date: 18/05/2022
Field of study

The fungus Stemphylium lycopersici (S. lycopersici) is an economically important plant pathogen that causes grey leaf spot disease in tomato. However, functional genomic studies in S. lycopersici are lacking, and the factors influencing its pathogenicity remain largely unknown. Here, we present the first example of genetic transformation and targeted gene replacement in S. lycopersici. We functionally analyzed the NLP gene, which encodes a necrosis- and ethylene-inducing peptide 1 (Nep1)-like protein (NLP). We found that targeted disruption of the NLP gene in S. lycopersici significantly compromised its virulence on tomato. Moreover, our data suggest that NLP affects S. lycopersici conidiospore production and weakly affects its adaptation to osmotic and oxidative stress. Interestingly, we found that NLP suppressed the production of reactive oxygen species (ROS) in tomato leaves during S. lycopersici infection. Further, expressing the fungal NLP in tomato resulted in constitutive transcription of immune-responsive genes and inhibited plant growth. Through gene manipulation, we demonstrated the function of NLP in S. lycopersici virulence and development. Our work provides a paradigm for functional genomics studies in a non-model fungal pathogen system

Multidisciplinary Digital Publishing Institute

PubMed Central

Recommended from our members

The forms of bioavailability of phosphorus in integrated vertical flow constructed wetland with earthworms and different substrates

Author: Alan Howard
Defu Xu
Hui Xu
Huili Li
Jiuhai Li
Lin Wang
Yidong Guan
Yingxue Li
Publication venue: 'Elsevier BV'
Publication date: 01/09/2015
Field of study

A sequential extraction method was utilized to analyze seven forms of P in an integrated vertical-flow constructed wetland (IVFCW) containing earthworms and different substrates. The aluminum-bound P (Al-P) content was found to be lower, and the occluded P (Oc-P) content was higher in the IVFCW. The addition of earthworms into the influent chamber of IVFCW increased the exchange P (Ex-P), iron-bound P (Fe-P), calcium bound P (Ca-P), Oc-P, detritus-bound (De-P) and organic P (Org-P) content in the influent chamber, and also enhanced P content uptake by wetland plants. A significantly positive correlation between P content of above-ground wetland plants and the Ex-P, Fe-P, Oc-P and Org-P content in the rhizosphere was found (P < 0.05), which indicated that the Ex-P, Fe-P, Oc-P and Org-P could be bio-available P. The Ex-P, Fe-P, De-P, Oc-P and Ca-P content of the influent chamber was higher where the substrate contained a mixture of Qing sand and river sand rather than only river sand. Also the IVFCW with earthworms and both Qing sand and river sand had a higher removal efficiency of P, which was related to higher P content uptake by wetland plants and P retained in IVFCW. These findings suggest that addition of earthworms in IVFCW increases the bioavailable P content, resulting in enhanced P content uptake by wetland plants

Central Archive at the University of Reading

Crossref

Ecological trade‐offs between jasmonic acid‐dependent direct and indirect plant defences in tritrophic interactions

Author: Chuanyou Li
Feng Ge
Halitschke R
Howe GA
Jianing Wei
Jiuhai Zhao
Le Kang
Lizhong Wang
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

Mediator Subunit MED25 Physically Interacts with PHYTOCHROME INTERACTING FACTOR4 to Regulate Shade-Induced Hypocotyl Elongation in Tomato

Author: Chuanlong Sun
Chuanyou Li
Hongyu Han
Jiuhai Zhao
Lei Deng
Lihao Lin
Panrong Ren
Qingzhe Zhai
Wenjing Sun
Yiran Xu
Publication venue: 'American Society of Plant Biologists (ASPB)'
Publication date
Field of study

Crossref

MYC2 Orchestrates a Hierarchical Transcriptional Cascade That Regulates Jasmonate-Mediated Plant Immunity in Tomato

Author: Chang-Bao Li
Chuanyou Li
David T.W. Tzeng
Fangming Wu
Jiuhai Zhao
Lei Deng
Ming Zhou
Minmin Du
Qian Chen
Qiaomei Wang
Qingzhe Zhai
Silin Zhong
Tianxia Yang
Yuanyuan Liu
Zhuo Huang
Publication venue: 'American Society of Plant Biologists (ASPB)'
Publication date
Field of study

Crossref

Phosphorus removal from aqueous solution in parent and aluminum-modified eggshells: thermodynamics and kinetics, adsorption mechanism, and diffusion process

Author: B Koumanova
Bin Zhu
CZ Zhang
D Zhu
DH Olson
G Akay
H Qiu
HL Zhang
JH Chen
Jiuhai Li
JS Markovski
JX Tie
JY Jeong
K Karageorgiou
KA Whitehead
L Qiu
LG Yan
M Baláž
M Baláž
M Baláž
M Oliveira
MM Dávila-Jimenez
N Chen
NY Mezenner
Qingjun Guo
RH Li
S Karaca
S Moharami
S Mustafa
SG Lu
SL Tian
T Li
TE Köse
WF Liu
WJ Stadelman
WJ Weber
ZB Guo
ZB Guo
ZB Guo
ZB Guo
ZH Zhang
Zhaobing Guo
Ziyan Guo
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A Snapshot of the Emerging Tomato Genome Sequence

Author: Abbott James
Asamizu Erika
Barone Amalia
Beasley Helen
Bishop Gerard J
Bombarely Aureliano
Botella Miguel A
Bouzayen Mondher
Bruggmann Rémy
Bryan Glenn
Buchan Daniel
Buels Robert
Butcher Sarah
Camara Francisco
Chang Song-Bin
Chattopadhyay Debasis
Chen Jianjun
Chen Jinfeng
Chen Mingsheng
Cheng Zhukuan
Chiusano Maria Luisa
Choi Doil
Chowdhury Parul
D'Agostino Nunzio
Dai Yuanyuan
Dalal Vivek
Datema Erwin
de Jong Hans
Dixit Rekha
Du Yongchen
Ercolano Mara
Falcone Giulia
Fan Huajie
Fawcett Jeffrey
Fei Zhangjun
Fernandez-Pedrosa V
Fiers Mark WEJ
Frasse Pierre
Frusciante Luigi
Fukuoka Hiroyuki
Gaikwad Kishor
Geng Yu
Ghazi Irfan Ahmad
Giovannoni James J
Giuliano Giovanni
Grandillo Silvana
Granell Antonio
Guigó Roderic
Gupta Vikrant
He Jun
Huang Sanwen
Humphray Sean
Hur Cheol-Goo
Jiang Hongling
Jo Sung-Hwan
Jöcker Anika
Khurana Jitendra P
Khurana Paramjit
Kim Byung Dong
Klein Lankhorst René
Knapp Sandra
Kumar Singh Nagendra
Kumar Ajay
Kumar Rahul
Li Chang-Bao
Li Chuanyou
Lindhout P
Ling Hongqing
Liu Dongyuan
Liu Longfei
Lu Chen
Lu Fei
Mathur Saloni
Mayer Klaus XF
McLaren Karen
Menda Naama
Mico Amparo
Mills Adri A
Mohapatra Trilochan
Mueller Lukas
Nicholson Christine
Osorio Sonia
Pandit Awadhesh
Perez-Alonso M
Peters Sander
Philippot Murielle
Pietrella Marco
Praveen Sumera
Qu Dongyu
Regad Farid
Riddle Clare
Rogers Jane
Rombauts Stephane
Royer Suzanne M
Sarita Sarita
Sato Shusei
Schoof Heiko
Seymour Graham B
Sharma Arun K
Sharma Tilak Raj
Shearer Lindsay A
Shi Jinfeng
Shibata Daisuke
Shridhar Smriti
Sims Sarah
Singh Archana
Singh Pradeep
Solanke Amolkumar Y
Spannagl Manuel
Stack Stephen
Stiekema Willem
Sun Shouhong
Szinay Dóra
Tabata Satoshi
Tanksley Steven D
Tecle Isaak Y
Todesco Sara
Traini Alessandra
Tyagi Akhilesh
Valle Giorgio
Van de Peer Yves
van Eck Joyce
van Ham Roeland CHJ
van Staveren Marjo
Vezzi Alessandro
Vrebalov Julia
Vyas Shailendra
Wang Guoping
Wang Jun
Wang Ying
White Ruth
Xue Yongbiao
Yadav Mahavir
Yang Wencai
Yang Xiaohua
Ye Zhibiao
Zamir Dani
Zhang Zhonghua
Zhao Jiuhai
Zouine Mohamed
Publication venue: [Madison, WI] : Crop Science Society of America
Publication date: 01/01/2009
Field of study

The genome of tomato (Solanum lycopersicum L.) is being sequenced by an international consortium of 10 countries (Korea, China, the United Kingdom, India, the Netherlands, France, Japan, Spain, Italy, and the United States) as part of the larger \u201cInternational Solanaceae Genome Project (SOL): Systems Approach to Diversity and Adaptation\u201d initiative. The tomato genome sequencing project uses an ordered bacterial artificial chromosome (BAC) approach to generate a high-quality tomato euchromatic genome sequence for use as a reference genome for the Solanaceae and euasterids. Sequence is deposited at GenBank and at the SOL Genomics Network (SGN). Currently, there are around 1000 BACs finished or in progress, representing more than a third of the projected euchromatic portion of the genome. An annotation effort is also underway by the International Tomato Annotation Group. The expected number of genes in the euchromatin is 3c40,000, based on an estimate from a preliminary annotation of 11% of finished sequence. Here, we present this first snapshot of the emerging tomato genome and its annotation, a short comparison with potato (Solanum tuberosum L.) sequence data, and the tools available for the researchers to exploit this new resource are also presented. In the future, whole-genome shotgun techniques will be combined with the BAC-by-BAC approach to cover the entire tomato genome. The high-quality reference euchromatic tomato sequence is expected to be near completion by 2010

AIR Universita degli studi di Milano

Ghent University Academic Bibliography

Directory of Open Access Journals

Archivio istituzionale della ricerca - Università di Padova

Characterization of Greenbeard Genes Involved in Long-Distance Kind Discrimination in a Microbial Eukaryote

Author: A Dereeper
A Dettmann
A Dettmann
A Fleissner
A Fleissner
A Gardner
A McKenna
A Pandey
A Rayner
A Ross-Gillespie
A Simonin
A Stamatakis
AB Goryachev
AC Leeder
AJ Debets
AW De Tomaso
AW De Tomaso
B Langmead
BS Arbogast
C Fu
C Hall
C Sbrana
CE Ellison
CE Ellison
D Charlesworth
D Chevanne
David J. Kowbel
DC Queller
DC Queller
DC Queller
DG Gibson
DK Aanen
DK Aanen
DP Martin
DX Zhang
E Bastiaans
E Szewczyk
F Debets
F Richard
F Rousset
Gabriel Rosenfield
GH Choi
GM Dunny
GM Roca
H Thorvaldsdottir
HG Ljunggren
HJ Vogel
HV Colot
I Teichert
J Palma-Guerrero
J Zhao
JC Dunlap
JE Galagan
JE Strassmann
JE Strassmann
Jens Heller
Jiuhai Zhao
Joseph Heitman
JR Dettman
JR Dettman
K Katoh
K McCluskey
KA Gibbs
L Heaton
L Li
L Pieuchot
M Freitag
M Fricker
M Giovannetti
M Lopez-Villavicencio
M Paoletti
M Roca
M Roper
M Smith
M Son
M Westergaard
MA Riley
MN Pearson
N. Louise Glass
ND Read
NL Glass
NL Glass
O Rendueles
P Corcoran
P Gladieux
P Librado
PC Hickey
Pierre Gladieux
RH Crozier
RH Davis
S Belanger
S Biella
S Capella-Gutierrez
S Herzog
S Hirose
S Smukalla
SA West
SE Schoustra
SF Altschul
SJ Saupe
T Boehm
T Czaran
TH Pittenger
TR McKitrick
V Ranwez
W Jonkers
W Li
WD Hamilton
Y Bernhards
Publication venue: 'Public Library of Science (PLoS)'
Publication date
Field of study

Crossref