Search CORE

71 research outputs found

WizardLM: Empowering Large Language Models to Follow Complex Instructions

Author: Feng Jiazhan
Geng Xiubo
Jiang Daxin
Sun Qingfeng
Tao Chongyang
Xu Can
Zhao Pu
Zheng Kai
Publication venue
Publication date: 24/04/2023
Field of study

Training large language models (LLM) with open-domain instruction following data brings colossal success. However, manually creating such instruction data is very time-consuming and labor-intensive. Moreover, humans may struggle to produce high-complexity instructions. In this paper, we show an avenue for creating large amounts of instruction data with varying levels of complexity using LLM instead of humans. Starting with an initial set of instructions, we use our proposed Evol-Instruct to rewrite them step by step into more complex instructions. Then, we mix all generated instruction data to fine-tune LLaMA. We call the resulting model WizardLM. Human evaluations on a complexity-balanced test bed show that instructions from Evol-Instruct are superior to human-created ones. By analyzing the human evaluation results of the high complexity part, we demonstrate that outputs from our WizardLM model are preferred to outputs from OpenAI ChatGPT. Even though WizardLM still lags behind ChatGPT in some aspects, our findings suggest that fine-tuning with AI-evolved instructions is a promising direction for enhancing large language models. Our codes and generated data are public at https://github.com/nlpxucan/WizardLMComment: large language model, instruction fine-tun

arXiv.org e-Print Archive

Synergistic Interplay between Search and Large Language Models for Information Retrieval

Author: Feng Jiazhan
Geng Xiubo
Jiang Daxin
Long Guodong
Shen Tao
Tao Chongyang
Xu Can
Zhao Dongyan
Publication venue
Publication date: 12/12/2023
Field of study

Information retrieval (IR) plays a crucial role in locating relevant resources from vast amounts of data, and its applications have evolved from traditional knowledge bases to modern retrieval models (RMs). The emergence of large language models (LLMs) has further revolutionized the IR field by enabling users to interact with search systems in natural languages. In this paper, we explore the advantages and disadvantages of LLMs and RMs, highlighting their respective strengths in understanding user-issued queries and retrieving up-to-date information. To leverage the benefits of both paradigms while circumventing their limitations, we propose InteR, a novel framework that facilitates information refinement through synergy between RMs and LLMs. InteR allows RMs to expand knowledge in queries using LLM-generated knowledge collections and enables LLMs to enhance prompt formulation using retrieved documents. This iterative refinement process augments the inputs of RMs and LLMs, leading to more accurate retrieval. Experiments on large-scale retrieval benchmarks involving web search and low-resource retrieval tasks demonstrate that InteR achieves overall superior zero-shot retrieval performance compared to state-of-the-art methods, even those using relevance judgment. Source code is available at https://github.com/Cyril-JZ/InteRComment: Pre-print. Work in progres

arXiv.org e-Print Archive

LexMAE: Lexicon-Bottlenecked Pretraining for Large-Scale Retrieval

Author: Geng Xiubo
Huang Xiaolong
Jiang Daxin
Jiao Binxing
Shen Tao
Tao Chongyang
Xu Can
Yang Linjun
Publication venue
Publication date: 04/06/2023
Field of study

In large-scale retrieval, the lexicon-weighting paradigm, learning weighted sparse representations in vocabulary space, has shown promising results with high quality and low latency. Despite it deeply exploiting the lexicon-representing capability of pre-trained language models, a crucial gap remains between language modeling and lexicon-weighting retrieval -- the former preferring certain or low-entropy words whereas the latter favoring pivot or high-entropy words -- becoming the main barrier to lexicon-weighting performance for large-scale retrieval. To bridge this gap, we propose a brand-new pre-training framework, lexicon-bottlenecked masked autoencoder (LexMAE), to learn importance-aware lexicon representations. Essentially, we present a lexicon-bottlenecked module between a normal language modeling encoder and a weakened decoder, where a continuous bag-of-words bottleneck is constructed to learn a lexicon-importance distribution in an unsupervised fashion. The pre-trained LexMAE is readily transferred to the lexicon-weighting retrieval via fine-tuning. On the ad-hoc retrieval benchmark, MS-Marco, it achieves 42.6% MRR@10 with 45.8 QPS for the passage dataset and 44.4% MRR@100 with 134.8 QPS for the document dataset, by a CPU machine. And LexMAE shows state-of-the-art zero-shot transfer capability on BEIR benchmark with 12 datasets.Comment: Appeared at ICLR 202

arXiv.org e-Print Archive

Major Ecosystems in China: Dynamics and Challenges for Sustainable Management

Author: APJ Mol
B Güneralp
B Liu
B Noble
B Xu
B Zhang
BJ Fu
BJ Fu
BJ Fu
BL Turner
Bojie Fu
BS Cui
C Hicks
C Le
CH Yan
CL Fang
CN Shi
CT Wang
CWH Lo
D Lu
D Zhu
Department of Forest Resources Management of the State Forestry Administration (DFRMSFA)
DM Zhou
ED Ongley
F Li
FC Shi
FL He
FN He
FS Shi
GD Liu
GQ Bull
GQ Tian
GX Wang
H Guo
H He
H Ren
H Wang
H Wang
H Zhang
H Zhang
HH Zhou
HJ Yang
HW Zhang
HX Wang
J Kauffman
J Wang
J Wu
J Zhang
JF Zhao
JG Han
JG Liu
JG Liu
JH Qu
JK Gao
JN Kittinger
JP Tao
JQ Lei
JT Xu
JW Liu
JW Luan
JY Zhao
JZ Yan
JZ Zhao
K Lei
K Xu
K Zhou
KM Zhang
KQ Chen
KS Song
L Gu
L Kang
LL Jiang
LL Li
LP Song
M Li
M Shao
M Wikelski
MQ Pan
MT Bennett
N Guo
NP He
P Gong
PE McShane
PJ Shi
PR Armsworth
QF Xu
QG Wang
QH Chen
QK Wang
QY Han
QZ Gao
R Costanza
Ranhao Sun
RB Harris
RC Dong
RJ Wang
RS Yin
S Démurger
SJ Wang
SJ Xu
SL Peng
SQ Zhang
SW Wan
SX Cao
SZ Tong
T Tao
Wei Wei
WH Li
WH Ma
WJ Li
WJ Mitsch
WQ Zhu
X Li
XA Zuo
XH Zhang
Xiubo Yu
XL Xu
XL Zhang
XL Zhang
XP Wang
XP Zou
XT Li
XY Zhang
XY Zhang
XY Zong
Y Feng
Y Feng
Y Xiong
YC Li
YG Yuan
YH Lü
YH Lü
Yihe Lü
YL Xu
YQ Li
YQ Zhu
YX Wang
Z Liu
Z Shi
Z Xie
ZF Zhuang
ZL Wang
ZM Feng
ZY Shen
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Introduction to the Task Force's Work

Author: Chen Yiyu
Jiang Luguang
Pittock James
Yu Xiubo
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 10/12/2015
Field of study

The Australian National University