Search CORE

8 research outputs found

When Do Program-of-Thoughts Work for Reasoning?

Author: Bi Zhen
Chen Huajun
Deng Shumin
Jiang Yinuo
Zhang Ningyu
Zheng Guozhou
Publication venue
Publication date: 07/09/2023
Field of study

The reasoning capabilities of Large Language Models (LLMs) play a pivotal role in the realm of embodied artificial intelligence. Although there are effective methods like program-of-thought prompting for LLMs which uses programming language to tackle complex reasoning tasks, the specific impact of code data on the improvement of reasoning capabilities remains under-explored. To address this gap, we propose complexity-impacted reasoning score (CIRS), which combines structural and logical attributes, to measure the correlation between code and reasoning abilities. Specifically, we use the abstract syntax tree to encode the structural information and calculate logical complexity by considering the difficulty and the cyclomatic complexity. Through an empirical analysis, we find not all code data of complexity can be learned or understood by LLMs. Optimal level of complexity is critical to the improvement of reasoning abilities by program-aided prompting. Then we design an auto-synthesizing and stratifying algorithm, and apply it to instruction generation for mathematical reasoning and code data filtering for code generation tasks. Extensive results demonstrates the effectiveness of our proposed approach. Code will be integrated into the EasyInstruct framework at https://github.com/zjunlp/EasyInstruct.Comment: Work in progres

arXiv.org e-Print Archive

OceanGPT: A Large Language Model for Ocean Science Tasks

Author: Bi Zhen
Chen Huajun
Ji Daxiong
Ou Yixin
Xue Yida
Zhang Ningyu
Zheng Guozhou
Publication venue
Publication date: 25/10/2023
Field of study

Ocean science, which delves into the oceans that are reservoirs of life and biodiversity, is of great significance given that oceans cover over 70% of our planet's surface. Recently, advances in Large Language Models (LLMs) have transformed the paradigm in science. Despite the success in other domains, current LLMs often fall short in catering to the needs of domain experts like oceanographers, and the potential of LLMs for ocean science is under-explored. The intrinsic reason may be the immense and intricate nature of ocean data as well as the necessity for higher granularity and richness in knowledge. To alleviate these issues, we introduce OceanGPT, the first-ever LLM in the ocean domain, which is expert in various ocean science tasks. We propose DoInstruct, a novel framework to automatically obtain a large volume of ocean domain instruction data, which generates instructions based on multi-agent collaboration. Additionally, we construct the first oceanography benchmark, OceanBench, to evaluate the capabilities of LLMs in the ocean domain. Though comprehensive experiments, OceanGPT not only shows a higher level of knowledge expertise for oceans science tasks but also gains preliminary embodied intelligence capabilities in ocean technology. Codes, data and checkpoints will soon be available at https://github.com/zjunlp/KnowLM.Comment: Work in progress. Project Website: https://zjunlp.github.io/project/OceanGPT

arXiv.org e-Print Archive

EasyEdit: An Easy-to-use Knowledge Editing Framework for Large Language Models

Author: Chen Huajun
Cheng Siyuan
Liu Kangwei
Tian Bozhong
Wang Mengru
Wang Peng
Xi Zekun
Xie Xin
Yao Yunzhi
Zhang Ningyu
Zheng Guozhou
Publication venue
Publication date: 14/08/2023
Field of study

Large Language Models (LLMs) usually suffer from knowledge cutoff or fallacy issues, which means they are unaware of unseen events or generate text with incorrect facts owing to the outdated/noisy data. To this end, many knowledge editing approaches for LLMs have emerged -- aiming to subtly inject/edit updated knowledge or adjust undesired behavior while minimizing the impact on unrelated inputs. Nevertheless, due to significant differences among various knowledge editing methods and the variations in task setups, there is no standard implementation framework available for the community, which hinders practitioners to apply knowledge editing to applications. To address these issues, we propose EasyEdit, an easy-to-use knowledge editing framework for LLMs. It supports various cutting-edge knowledge editing approaches and can be readily apply to many well-known LLMs such as T5, GPT-J, LlaMA, etc. Empirically, we report the knowledge editing results on LlaMA-2 with EasyEdit, demonstrating that knowledge editing surpasses traditional fine-tuning in terms of reliability and generalization. We have released the source code on GitHub at https://github.com/zjunlp/EasyEdit, along with Google Colab tutorials and comprehensive documentation for beginners to get started. Besides, we present an online system for real-time knowledge editing, and a demo video at http://knowlm.zjukg.cn/easyedit.mp4.Comment: The project website is https://github.com/zjunlp/EasyEdi

arXiv.org e-Print Archive

Dart: A Framework for Grid-Based Database Resource Access and Discovery

Author: Chang Huang
Guozhou Zheng
Xiaojun Wu
Zhaohui Wu
Publication venue
Publication date
Field of study

Abstract. The Data Grid serves as a data management solution widely adopted by the existent data-intensive Grid applications. However, we argue that the core Grid data management demands can be better satisfied with introduction of database. In this paper, we provide a database-oriented resource management framework, which is intended to integrate database resources with the Grid infrastructure. We start by outlining the sketch of the proposed Database Grid architecture and then focus on two base-level services: the remote database access and database discovery, which we think should be firstly settled down as a necessary foundation for other high-level database services that are more application-driven. We discuss how the design principles that apply to these base-level services can adapt to the characteristics of Grid environment and how they can be nested within the OGSA paradigm.

CiteSeerX

Promoting Mechanisms of Logistic Capability of Pharmaceutical Wholesale Enterprise Based on Brusselator

Author: Anonymous
Fu Dan
I Prigogine
Ma Shihua
Ma Shihua
SE Fawcett
Sun Guozhou
Wei Yao
Zhang Tienan
Zhang Zhifeng
Zheng Zhun
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population

Author: Chen Huajun
Chen Qiang
Chen Xiang
Deng Shumin
Huang Fei
Li Lei
Li Zhoubo
Liang Xiaozhuan
Qiao Shuofei
Tan Chuanqi
Tao Liankuan
Wang Peng
Xie Xin
Xiong Feiyu
Xu Xin
Yao Yunzhi
Ye Hongbin
Yu Haiyang
Zhang Ningyu
Zhang Wen
Zhang Zhenru
Zheng Guozhou
Publication venue
Publication date: 02/08/2022
Field of study

We present an open-source and extensible knowledge extraction toolkit DeepKE, supporting complicated low-resource, document-level and multimodal scenarios in the knowledge base population. DeepKE implements various information extraction tasks, including named entity recognition, relation extraction and attribute extraction. With a unified framework, DeepKE allows developers and researchers to customize datasets and models to extract information from unstructured data according to their requirements. Specifically, DeepKE not only provides various functional modules and model implementation for different tasks and scenarios but also organizes all components by consistent frameworks to maintain sufficient modularity and extensibility. We release the source code at GitHub in https://github.com/zjunlp/DeepKE with Google Colab tutorials and comprehensive documents for beginners. Besides, we present an online system in http://deepke.openkg.cn/EN/re_doc_show.html for real-time extraction of various tasks, and a demo video.Comment: Work in progress and the project website is http://deepke.zjukg.cn

arXiv.org e-Print Archive

Changing antimicrobial susceptibility and molecular characterisation of Neisseria gonorrhoeae isolates in Guangdong, China: in a background of rapidly rising epidemic

Author: Alam
Bala
Chisholm
Cole
Cole
Day
Fayemiwo
Fifer
Gascoyne
Gianecini
Gose
Guozhou Li
Heping Zheng
Ison
Jieyi Yang
Jinmei Huang
Kirkcaldy
Kirkcaldy
Kubanov
Kularatne
Lahra
Lebedzeu
Martin
Micaëlo
Morita-Ishihara
Ohnishi
Qu
Sanmei Tang
Smolarchuk
Sood
Stefanelli
Tanaka
Tapsall
Thakur
Town
Unemo
Unemo
Wei Chen
Weiming Tang
Wenling Cao
Wentao Chen
Whiley
World Health Organization (WHO)
Wu
Xiaofeng Liu
Xiaolin Qin
Xingzhong Wu
Yasuda
Yin
Yu Yuqi
Yunhu Zhao
Zheng
Zheng
Zheng
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Discovery of a Novel, Orally Efficacious Liver X Receptor (LXR) β Agonist

This article describes the application of Contour to the design and discovery of a novel, potent, orally efficacious liver X receptor β (LXRβ) agonist (17). Contour technology is a structure-based drug design platform that generates molecules using a context perceptive growth algorithm guided by a contact sensitive scoring function. The growth engine uses binding site perception and programmable growth capability to create drug-like molecules by assembling fragments that naturally complement hydrophilic and hydrophobic features of the protein binding site. Starting with a crystal structure of LXRβ and a docked 2-(methylsulfonyl)benzyl alcohol fragment (6), Contour was used to design agonists containing a piperazine core. Compound 17 binds to LXRβ with high affinity and to LXRα to a lesser extent, and induces the expression of LXR target genes in vitro and in vivo. This molecule served as a starting point for further optimization and generation of a candidate which is currently in human clinical trials for treating atopic dermatitis

FigShare