Search CORE

17 research outputs found

Scaling Relationship on Learning Mathematical Reasoning with Large Language Models

Author: Dong Guanting
Li Chengpeng
Lu Keming
Tan Chuanqi
Yuan Hongyi
Yuan Zheng
Zhou Chang
Zhou Jingren
Publication venue
Publication date: 12/09/2023
Field of study

Mathematical reasoning is a challenging task for large language models (LLMs), while the scaling relationship of it with respect to LLM capacity is under-explored. In this paper, we investigate how the pre-training loss, supervised data amount, and augmented data amount influence the reasoning performances of a supervised LLM. We find that pre-training loss is a better indicator of the model's performance than the model's parameter count. We apply supervised fine-tuning (SFT) with different amounts of supervised data and empirically find a log-linear relation between data amount and model performance, and we find better models improve less with enlarged supervised datasets. To augment more data samples for improving model performances without any human effort, we propose to apply Rejection sampling Fine-Tuning (RFT). RFT uses supervised models to generate and collect correct reasoning paths as augmented fine-tuning datasets. We find with augmented samples containing more distinct reasoning paths, RFT improves mathematical reasoning performance more for LLMs. We also find RFT brings more improvement for less performant LLMs. Furthermore, we combine rejection samples from multiple models which push LLaMA-7B to an accuracy of 49.3\% on GSM8K which outperforms the supervised fine-tuning (SFT) accuracy of 35.9\% significantly.Comment: Working in Progres

arXiv.org e-Print Archive

A rare nonsynonymous variant in the lipid metabolic gene HELZ2 related to primary biliary cirrhosis in Chinese Han

Author: Fengchun Zhang
Guanting Lu
Haoze Zhang
Jing Li
Li Wang
Ping Li
Shijie Mu
Si Chen
Xiaoting Wen
Ying Cui
Yongzhe Li
Ziyan Wu
Publication venue: Springer Nature
Publication date: 01/01/2016
Field of study

Springer - Publisher Connector

Prognostic and therapeutic significance of microbial cell-free DNA in plasma of people with acutely decompensated cirrhosis

Author: Cai Shumin
Chen Jinjun
Cheng Xiao
Fan Zhiping
Gu Yixiu
He Qinjun
Hong Changze
Jalan Rajiv
Ji Yali
Lai Qintao
Lan Xiaoqin
Li Beiling
Li Junying
Li Shaochuan
Liu Miaoxia
Liu Qifa
Lu Guanting
Luo Wenfan
Niu Xiaoyun
Wang Yali
Weng Xing
Publication venue: 'Elsevier BV'
Publication date: 01/02/2023
Field of study

BACKGROUND AND AIMS: Although the effect of bacterial infection on cirrhosis has been well-described, the effect of non-hepatotropic virus (NHV) infection is unknown. This study evaluated the genome fragments of circulating microorganisms using metagenomic next-generation sequencing (mNGS) in cirrhosis patients with acute decompensation (AD), focusing on NHVs and related the findings to clinical outcomes. METHODS: Plasma mNGS was performed in 129 cirrhosis patients with AD in study cohort. Ten healthy volunteers and 20, 39, and 81 patients with stable cirrhosis, severe sepsis and hematological malignancies, respectively, were enrolled as controls. Validation assays for human cytomegalovirus (CMV) reactivation in a validation cohort (n = 58) were performed and exploratory treatment instituted. RESULTS: In study cohort, 188 microorganisms were detected in 74.4% (96/129) patients, including viruses (58.0%), bacteria (34.1%), fungi (7.4%) and chlamydia (0.5%). Patients with AD had an NHV signature, and CMV was the most frequent NHV, which correlated with the clinical effect of empirical antibiotic treatment, progression to acute-on-chronic liver failure (ACLF), and 90-day mortality. The NHV signature in ACLF patients was similar to patients with sepsis and hematological malignancies. The treatable NHV, CMV was detected in 24.1% (14/58) patients in the validation cohort. Of the 14 cases with detectable CMV by mNGS, 9 were further validated by DNA RT-PCR or pp65 antigenemia testing. Three patients with CMV reactivation received ganciclovir therapy in exploratory manner with clinical resolutions. CONCLUSIONS: The results of this study suggests that NHVs may have a pathogenic role in complicating the course of AD. Further validation is needed to define whether this should be incorporated in the routine management of AD patients. IMPACT AND IMPLICATIONS: ●Cirrhosis patients with acute decompensation have a non-hepatotropic virus (NHV) signature, which is similar to that in sepsis and hematological malignancies patients. ●The detected viral signature had clinical correlates, including clinical efficacy of empirical antibiotic treatment, progression to acute-on-chronic liver failure and short-term mortality. ●The treatable NHV, CMV reactivation may be involved in the clinical outcomes of decompensated cirrhosis. ●Routine screening for NHVs, especially CMV, may be useful for the management of patients with acutely decompensated cirrhosis

UCL Discovery

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Author: :
Bi Xiao
Chen Deli
Chen Guanting
Chen Shanhuang
Dai Damai
DeepSeek-AI
Deng Chengqi
Ding Honghui
Dong Kai
Du Qiushi
Fu Zhe
Gao Huazuo
Gao Kaige
Gao Wenjun
Ge Ruiqi
Guan Kang
Guo Daya
Guo Jianzhong
Hao Guangbo
Hao Zhewen
He Ying
Hu Wenjie
Huang Panpan
Li Erhang
Li Guowei
Li Jiashi
Li Y. K.
Li Yao
Liang Wenfeng
Lin Fangyun
Liu A. X.
Liu Bo
Liu Wen
Liu Xiaodong
Liu Xin
Liu Yiyuan
Lu Haoyu
Lu Shanghao
Luo Fuli
Ma Shirong
Nie Xiaotao
Pei Tian
Piao Yishi
Qiu Junjie
Qu Hui
Ren Tongzheng
Ren Zehui
Ruan Chong
Sha Zhangli
Shao Zhihong
Song Junxiao
Su Xuecheng
Sun Jingxiang
Sun Yaofeng
Tang Minghui
Wang Bingxuan
Wang Peiyi
Wang Shiyu
Wang Yaohui
Wang Yongji
Wu Tong
Wu Y.
Xie Xin
Xie Zhenda
Xie Ziwei
Xiong Yiliang
Xu Hanwei
Xu R. X.
Xu Yanhong
Yang Dejian
You Yuxiang
Yu Shuiping
Yu Xingkai
Zhang B.
Zhang Haowei
Zhang Lecong
Zhang Liyue
Zhang Mingchuan
Zhang Minghua
Zhang Wentao
Zhang Yichao
Zhao Chenggang
Zhao Yao
Zhou Shangyan
Zhou Shunfeng
Zhu Qihao
Zou Yuheng
Publication venue
Publication date: 05/01/2024
Field of study

The rapid development of open-source large language models (LLMs) has been truly remarkable. However, the scaling law described in previous literature presents varying conclusions, which casts a dark cloud over scaling LLMs. We delve into the study of scaling laws and present our distinctive findings that facilitate scaling of large scale models in two commonly used open-source configurations, 7B and 67B. Guided by the scaling laws, we introduce DeepSeek LLM, a project dedicated to advancing open-source language models with a long-term perspective. To support the pre-training phase, we have developed a dataset that currently consists of 2 trillion tokens and is continuously expanding. We further conduct supervised fine-tuning (SFT) and Direct Preference Optimization (DPO) on DeepSeek LLM Base models, resulting in the creation of DeepSeek Chat models. Our evaluation results demonstrate that DeepSeek LLM 67B surpasses LLaMA-2 70B on various benchmarks, particularly in the domains of code, mathematics, and reasoning. Furthermore, open-ended evaluations reveal that DeepSeek LLM 67B Chat exhibits superior performance compared to GPT-3.5

arXiv.org e-Print Archive

A rare nonsynonymous variant in the lipid metabolic gene HELZ2 related to primary biliary cirrhosis in Chinese Han

Author: A Fernandez-Rodriguez
A Katano-Toki
B Zhang
BP Fairfax
C Berruyer
C Zhao
EJ Heathcote
Fengchun Zhang
G Pascual
GF Mells
GM Hirschfield
Guanting Lu
Haoze Zhang
Jing Li
JR MacDonald
K Amr
K Tamura
Li Wang
M Nakamura
MM Kaplan
N Viswakarma
Ping Li
RR Miles
S Surapureddi
Shijie Mu
Si Chen
T Tomaru
X Liu
Xiaoting Wen
Ying Cui
Yongzhe Li
Ziyan Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Comprehensive Analysis of the Structure and Function of Peptide:N-Glycanase 1 and Relationship with Congenital Disorder of Deglycosylation

Author: Guanting Lu
Hongping Chen
Jin Wu
Xiangguang Miao
Publication venue: 'MDPI AG'
Publication date: 01/04/2022
Field of study

The cytosolic PNGase (peptide:N-glycanase), also known as peptide-N4-(N-acetyl-β-glucosaminyl)-asparagine amidase, is a well-conserved deglycosylation enzyme (EC 3.5.1.52) which catalyzes the non-lysosomal hydrolysis of an N(4)-(acetyl-β-d-glucosaminyl) asparagine residue (Asn, N) into a N-acetyl-β-d-glucosaminyl-amine and a peptide containing an aspartate residue (Asp, D). This enzyme (NGLY1) plays an essential role in the clearance of misfolded or unassembled glycoproteins through a process named ER-associated degradation (ERAD). Accumulating evidence also points out that NGLY1 deficiency can cause an autosomal recessive (AR) human genetic disorder associated with abnormal development and congenital disorder of deglycosylation. In addition, the loss of NGLY1 can affect multiple cellular pathways, including but not limited to NFE2L1 pathway, Creb1/Atf1-AQP pathway, BMP pathway, AMPK pathway, and SLC12A2 ion transporter, which might be the underlying reasons for a constellation of clinical phenotypes of NGLY1 deficiency. The current comprehensive review uncovers the NGLY1’ssdetailed structure and its important roles for participation in ERAD, involvement in CDDG and potential treatment for NGLY1 deficiency

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

PubMed Central

GAAD: A Gene and Autoimmiune Disease Association Database

Author: Guanting Lu
Shijie Mu
Wei-Hua Chen
Xiaowen Hao
Publication venue: 'Elsevier BV'
Publication date: 01/08/2018
Field of study

Autoimmune diseases (ADs) arise from an abnormal immune response of the body against substances and tissues normally present in the body. More than a hundred of ADs have been described in the literature so far. Although their etiology remains largely unclear, various types of ADs tend to share more associated genes with other types of ADs than with non-AD types. Here we present GAAD, a gene and AD association database. In GAAD, we collected 44,762 associations between 49 ADs and 4249 genes from public databases and MEDLINE documents. We manually verified the associations to ensure the quality and credibility. We reconstructed and recapitulated the relationships among ADs using their shared genes, which further validated the quality of our data. We also provided a list of significantly co-occurring gene pairs among ADs; with embedded tools, users can query gene co-occurrences and construct customized co-occurrence network with genes of interest. To make GAAD more straightforward to experimental biologists and medical scientists, we extracted additional information describing the associations through text mining, including the putative diagnostic value of the associations, type and position of gene polymorphisms, expression changes of implicated genes, as well as the phenotypical consequences, and grouped the associations accordingly. GAAD is freely available at http://gaad.medgenius.info. Keywords: Autoimmune diseases, Disease–gene association, Database, Text minin

Directory of Open Access Journals

Visual and optical quality outcomes of SMILE and FS-LASIK for myopia in the very early phase after surgery

Author: Guanting Lu
Ji Bai
Kaijian Chen
Qiuxia Kan
Ting Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2019
Field of study

Abstract Background Small incision lenticule extraction (SMILE) and femtosecond laser-assisted in situ keratomileusis (FS-LASIK) are frequently used to treat myopia. However, little is known about the impact on recovery of these approaches in the very early postsurgical phase (within 24 h). Methods To compare the efficacy of these two procedures for the treatment of myopia in the early phase after surgery, differences in visual acuity, OSI (objective scattering index), cutoff for modulation transfer function (MTF), and SR (Strehl ratio) between SMILE and FS-LASIK were evaluated at 0, 2, 4 and 24 h postoperatively using two-way analysis of variance (ANOVA). Results No significant differences between SMILE and FS-LASIK in the MTF cutoff and SR were found (p > 0.05). However, at 2 h and 4 h after surgery, OSI values in the SMILE group were significantly higher than those in the FS-LASIK group, and visual acuity scores in the SMILE group were significantly poorer than those in the FS-LASIK group (p < 0.05). Regarding subjective symptoms, the number of patients complaining of eye dryness, blurred vision, foreign body sensation and eye soreness in the SMILE group were lower than the number in the FS-LASIK group. Conclusions In conclusion, visual and optical quality outcomes of FS-LASIK for myopia were better than those of SMILE in the very early phase after surgery, a difference that is attributable to the formation of interface haze. Trial registration ChiCTR1900021451

Directory of Open Access Journals

Selection for energy efficiency drives strand-biased gene distribution in prokaryotes

Author: Chen Wei-Hua
Gao Na
Lercher Martin J.
Lu Guanting
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Lagging-strand genes accumulate more deleterious mutations. Genes are thus preferably located on the leading strand, an observation known as strand-biased gene distribution (SGD). Despite of this mechanistic understanding, a satisfactory quantitative model is still lacking. Replication-transcription-collisions induce stalling of the replication machinery, expose DNA to various attacks, and are followed by error-prone repairs. We found that mutational biases in non-transcribed regions can explain similar to 71% of the variations in SGDs in 1,552 genomes, supporting the mutagenesis origin of SGD. Mutational biases introduce energetically cheaper nucleotides on the lagging strand, and result in more expensive protein products; consistently, the cost difference between the two strands explains similar to 50% of the variance in SGDs. Protein costs decrease with increasing gene expression. At similar expression levels, protein products of leading-strand genes are generally cheaper than lagging-strand genes; however, highly-expressed lagging genes are still cheaper than lowly-expressed leading genes. Selection for energy efficiency thus drives some genes to the leading strand, especially those highly expressed and essential, but certainly not all genes. Stronger mutational biases are often associated with low-GC genomes; as low-GC genes encode expensive proteins, low-GC genomes thus tend to have stronger SGDs to alleviate the stronger pressure on efficient energy usage

Kölner UniversitätsPublikationsServer

Directory of Open Access Journals

OGEE v2: an update of the online gene essentiality database with special focus on differentially essential genes in human cancer cell lines

Author: Guanting Lu
Peer Bork
Wei-Hua Chen
Xiao Chen
Xing-Ming Zhao
Publication venue: 'Oxford University Press (OUP)'
Publication date: 30/10/2016
Field of study

OGEE is an Online GEne Essentiality database. To enhance our understanding of the essentiality of genes, in OGEE we collected experimentally tested essential and non-essential genes, as well as associated gene properties known to contribute to gene essentiality. We focus on large-scale experiments, and complement our data with text-mining results. We organized tested genes into data sets according to their sources, and tagged those with variable essentiality statuses across data sets as conditionally essential genes, intending to highlight the complex interplay between gene functions and environments/experimental perturbations. Developments since the last public release include increased numbers of species and gene essentiality data sets, inclusion of non-coding essential sequences and genes with intermediate essentiality statuses. In addition, we included 16 essentiality data sets from cancer cell lines, corresponding to 9 human cancers; with OGEE, users can easily explore the shared and differentially essential genes within and between cancer types. These genes, especially those derived from cell lines that are similar to tumor samples, could reveal the oncogenic drivers, paralogous gene expression pattern and chromosomal structure of the corresponding cancer types, and can be further screened to identify targets for cancer therapy and/or new drug development. OGEE is freely available at http://ogee.medgenius.info

Crossref

PubMed Central

MDC Repository

Online-Publikations-Server der Universität Würzburg