Search CORE

92 research outputs found

GROVE: A Retrieval-augmented Complex Story Generation Framework with A Forest of Evidence

Author: Huang Zhen
Li Dongsheng
Shi Yanqi
Tian Zhiliang
Wen Zhihua
Wu Wei
Yang Yuxin
Publication venue
Publication date: 23/10/2023
Field of study

Conditional story generation is significant in human-machine interaction, particularly in producing stories with complex plots. While Large language models (LLMs) perform well on multiple NLP tasks, including story generation, it is challenging to generate stories with both complex and creative plots. Existing methods often rely on detailed prompts to guide LLMs to meet target conditions, which inadvertently restrict the creative potential of the generated stories. We argue that leveraging information from exemplary human-written stories facilitates generating more diverse plotlines. Delving deeper into story details helps build complex and credible plots. In this paper, we propose a retrieval-au\textbf{G}mented sto\textbf{R}y generation framework with a f\textbf{O}rest of e\textbf{V}id\textbf{E}nce (GROVE) to enhance stories' complexity. We build a retrieval repository for target conditions to produce few-shot examples to prompt LLMs. Additionally, we design an ``asking-why'' prompting scheme that extracts a forest of evidence, providing compensation for the ambiguities that may occur in the generated story. This iterative process uncovers underlying story backgrounds. Finally, we select the most fitting chains of evidence from the evidence forest and integrate them into the generated story, thereby enhancing the narrative's complexity and credibility. Experimental results and numerous examples verify the effectiveness of our method.Comment: Findings of EMNLP 202

arXiv.org e-Print Archive

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Author: Lee Katherine
Li Wei
Liu Peter J.
Matena Michael
Narang Sharan
Raffel Colin
Roberts Adam
Shazeer Noam
Zhou Yanqi
Publication venue
Publication date: 28/07/2020
Field of study

Transfer learning, where a model is first pre-trained on a data-rich task before being fine-tuned on a downstream task, has emerged as a powerful technique in natural language processing (NLP). The effectiveness of transfer learning has given rise to a diversity of approaches, methodology, and practice. In this paper, we explore the landscape of transfer learning techniques for NLP by introducing a unified framework that converts all text-based language problems into a text-to-text format. Our systematic study compares pre-training objectives, architectures, unlabeled data sets, transfer approaches, and other factors on dozens of language understanding tasks. By combining the insights from our exploration with scale and our new ``Colossal Clean Crawled Corpus'', we achieve state-of-the-art results on many benchmarks covering summarization, question answering, text classification, and more. To facilitate future work on transfer learning for NLP, we release our data set, pre-trained models, and code.Comment: Final version as published in JML

arXiv.org e-Print Archive

SpikingJelly: An open-source machine learning infrastructure platform for spike-based intelligence

Author: Chen Ding
Chen Yanqi
Ding Jianhao
Fang Wei
Huang Liwei
Li Guoqi
Masquelier Timothée
Tian Yonghong
Yu Zhaofei
Zhou Huihui
Publication venue
Publication date: 25/10/2023
Field of study

Spiking neural networks (SNNs) aim to realize brain-inspired intelligence on neuromorphic chips with high energy efficiency by introducing neural dynamics and spike properties. As the emerging spiking deep learning paradigm attracts increasing interest, traditional programming frameworks cannot meet the demands of the automatic differentiation, parallel computation acceleration, and high integration of processing neuromorphic datasets and deployment. In this work, we present the SpikingJelly framework to address the aforementioned dilemma. We contribute a full-stack toolkit for pre-processing neuromorphic datasets, building deep SNNs, optimizing their parameters, and deploying SNNs on neuromorphic chips. Compared to existing methods, the training of deep SNNs can be accelerated

11\times

, and the superior extensibility and flexibility of SpikingJelly enable users to accelerate custom models at low costs through multilevel inheritance and semiautomatic code generation. SpikingJelly paves the way for synthesizing truly energy-efficient SNN-based machine intelligence systems, which will enrich the ecology of neuromorphic computing.Comment: Accepted in Science Advances (https://www.science.org/doi/10.1126/sciadv.adi1480

arXiv.org e-Print Archive

Reverse regulation of soluble receptor for advanced glycation end products and proinflammatory factor resistin and S100A12 in Kawasaki disease

Author: Chunhong Xie
Fangqi Gong
Qing Zhang
Songling Fu
Wei Wang
Yanqi Qi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Crossref

Springer - Publisher Connector

Recommended from our members

Periplasmic biomineralization for semi-artificial photosynthesis

Author: Chen Si
Feng Wei
Gao Xiang
Hu Huicheng
Jiang Yuanwen
Lin Yiliang
Luo Yanqi
Prominski Aleksander
Shi Fengyuan
Shi Jiuyun
Talapin Dmitri V.
Tian Bozhi
Xiong Wei
Yang Bin
Yue Jiping
Zhou Chenkun
Publication venue
Publication date: 22/07/2023
Field of study

Semiconductor-based biointerfaces are typically established either on the surface of the plasma membrane or within the cytoplasm. In Gram-negative bacteria, the periplasmic space, characterized by its confinement and the presence of numerous enzymes and peptidoglycans, offers additional opportunities for biomineralization, allowing for nongenetic modulation interfaces. We demonstrate semiconductor nanocluster precipitation containing single- and multiple-metal elements within the periplasm, as observed through various electron- and x-ray-based imaging techniques. The periplasmic semiconductors are metastable and display defect-dominant fluorescent properties. Unexpectedly, the defect-rich (i.e., the low-grade) semiconductor nanoclusters produced in situ can still increase adenosine triphosphate levels and malate production when coupled with photosensitization. We expand the sustainability levels of the biohybrid system to include reducing heavy metals at the primary level, building living bioreactors at the secondary level, and creating semi-artificial photosynthesis at the tertiary level. The biomineralization-enabled periplasmic biohybrids have the potential to serve as defect-tolerant platforms for diverse sustainable applications

Knowledge UChicago

Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference

Author: Ainslie Joshua
Bai Junwen
Brahma Siddhartha
Chang Ming-Wei
Du Nan
Lee Kenton
Lei Tao
Li Bo
Wu Yuexin
Zhang Yu
Zhao Vincent Y.
Zhou Yanqi
Publication venue
Publication date: 10/04/2023
Field of study

We propose Conditional Adapter (CoDA), a parameter-efficient transfer learning method that also improves inference efficiency. CoDA generalizes beyond standard adapter approaches to enable a new way of balancing speed and accuracy using conditional computation. Starting with an existing dense pretrained model, CoDA adds sparse activation together with a small number of new parameters and a light-weight training phase. Our experiments demonstrate that the CoDA approach provides an unexpectedly efficient way to transfer knowledge. Across a variety of language, vision, and speech tasks, CoDA achieves a 2x to 8x inference speed-up compared to the state-of-the-art Adapter approach with moderate to no accuracy loss and the same parameter efficiency

arXiv.org e-Print Archive