Search CORE

10 research outputs found

Making Linear MDPs Practical via Contrastive Representation Learning

Author: Dai Bo
Gonzalez Joseph E.
Ren Tongzheng
Schuurmans Dale
Yang Mengjiao
Zhang Tianjun
Publication venue
Publication date: 14/07/2022
Field of study

It is common to address the curse of dimensionality in Markov decision processes (MDPs) by exploiting low-rank representations. This motivates much of the recent theoretical study on linear MDPs. However, most approaches require a given representation under unrealistic assumptions about the normalization of the decomposition or introduce unresolved computational challenges in practice. Instead, we consider an alternative definition of linear MDPs that automatically ensures normalization while allowing efficient representation learning via contrastive estimation. The framework also admits confidence-adjusted index algorithms, enabling an efficient and principled approach to incorporating optimism or pessimism in the face of uncertainty. To the best of our knowledge, this provides the first practical representation learning method for linear MDPs that achieves both strong theoretical guarantees and empirical performance. Theoretically, we prove that the proposed algorithm is sample efficient in both the online and offline settings. Empirically, we demonstrate superior performance over existing state-of-the-art model-based and model-free algorithms on several benchmarks.Comment: ICML 2022. The first two authors contribute equall

arXiv.org e-Print Archive

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Author: Deng Chengqi
Dong Kai
Li Zhuoshu
Liu Bo
Liu Wen
Lu Haoyu
Ren Tongzheng
Ruan Chong
Sun Jingxiang
Sun Yaofeng
Wang Bingxuan
Xie Zhenda
Xu Hanwei
Yang Hao
Zhang Bo
Publication venue
Publication date: 11/03/2024
Field of study

We present DeepSeek-VL, an open-source Vision-Language (VL) Model designed for real-world vision and language understanding applications. Our approach is structured around three key dimensions: We strive to ensure our data is diverse, scalable, and extensively covers real-world scenarios including web screenshots, PDFs, OCR, charts, and knowledge-based content, aiming for a comprehensive representation of practical contexts. Further, we create a use case taxonomy from real user scenarios and construct an instruction tuning dataset accordingly. The fine-tuning with this dataset substantially improves the model's user experience in practical applications. Considering efficiency and the demands of most real-world scenarios, DeepSeek-VL incorporates a hybrid vision encoder that efficiently processes high-resolution images (1024 x 1024), while maintaining a relatively low computational overhead. This design choice ensures the model's ability to capture critical semantic and detailed information across various visual tasks. We posit that a proficient Vision-Language Model should, foremost, possess strong language abilities. To ensure the preservation of LLM capabilities during pretraining, we investigate an effective VL pretraining strategy by integrating LLM training from the beginning and carefully managing the competitive dynamics observed between vision and language modalities. The DeepSeek-VL family (both 1.3B and 7B models) showcases superior user experiences as a vision-language chatbot in real-world applications, achieving state-of-the-art or competitive performance across a wide range of visual-language benchmarks at the same model size while maintaining robust performance on language-centric benchmarks. We have made both 1.3B and 7B models publicly accessible to foster innovations based on this foundation model.Comment: https://github.com/deepseek-ai/DeepSeek-V

arXiv.org e-Print Archive

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Author: :
Bi Xiao
Chen Deli
Chen Guanting
Chen Shanhuang
Dai Damai
DeepSeek-AI
Deng Chengqi
Ding Honghui
Dong Kai
Du Qiushi
Fu Zhe
Gao Huazuo
Gao Kaige
Gao Wenjun
Ge Ruiqi
Guan Kang
Guo Daya
Guo Jianzhong
Hao Guangbo
Hao Zhewen
He Ying
Hu Wenjie
Huang Panpan
Li Erhang
Li Guowei
Li Jiashi
Li Y. K.
Li Yao
Liang Wenfeng
Lin Fangyun
Liu A. X.
Liu Bo
Liu Wen
Liu Xiaodong
Liu Xin
Liu Yiyuan
Lu Haoyu
Lu Shanghao
Luo Fuli
Ma Shirong
Nie Xiaotao
Pei Tian
Piao Yishi
Qiu Junjie
Qu Hui
Ren Tongzheng
Ren Zehui
Ruan Chong
Sha Zhangli
Shao Zhihong
Song Junxiao
Su Xuecheng
Sun Jingxiang
Sun Yaofeng
Tang Minghui
Wang Bingxuan
Wang Peiyi
Wang Shiyu
Wang Yaohui
Wang Yongji
Wu Tong
Wu Y.
Xie Xin
Xie Zhenda
Xie Ziwei
Xiong Yiliang
Xu Hanwei
Xu R. X.
Xu Yanhong
Yang Dejian
You Yuxiang
Yu Shuiping
Yu Xingkai
Zhang B.
Zhang Haowei
Zhang Lecong
Zhang Liyue
Zhang Mingchuan
Zhang Minghua
Zhang Wentao
Zhang Yichao
Zhao Chenggang
Zhao Yao
Zhou Shangyan
Zhou Shunfeng
Zhu Qihao
Zou Yuheng
Publication venue
Publication date: 05/01/2024
Field of study

The rapid development of open-source large language models (LLMs) has been truly remarkable. However, the scaling law described in previous literature presents varying conclusions, which casts a dark cloud over scaling LLMs. We delve into the study of scaling laws and present our distinctive findings that facilitate scaling of large scale models in two commonly used open-source configurations, 7B and 67B. Guided by the scaling laws, we introduce DeepSeek LLM, a project dedicated to advancing open-source language models with a long-term perspective. To support the pre-training phase, we have developed a dataset that currently consists of 2 trillion tokens and is continuously expanding. We further conduct supervised fine-tuning (SFT) and Direct Preference Optimization (DPO) on DeepSeek LLM Base models, resulting in the creation of DeepSeek Chat models. Our evaluation results demonstrate that DeepSeek LLM 67B surpasses LLaMA-2 70B on various benchmarks, particularly in the domains of code, mathematics, and reasoning. Furthermore, open-ended evaluations reveal that DeepSeek LLM 67B Chat exhibits superior performance compared to GPT-3.5

arXiv.org e-Print Archive

Prevalence and drug resistance of Salmonella in dogs and cats in Xuzhou, China

Author: Chen Caifa
Pan Zhiming
Shao Wangfeng
Sun Tongzheng
Wang Jianyu
Wei Lingling
Yang Cheng
Zhou Zhengkun
Zhu Aihua
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/05/2020
Field of study

Salmonellosis is a zoonotic disease, and Salmonella spp. can sometimes be found in dogs and cats, posing a risk to human health. In this study, the prevalence and antimicrobial susceptibility of faecal Salmonella were investigated in pet dogs and cats in Xuzhou, Jiangsu Province, China

Directory of Open Access Journals

Cyclic Response of Additive Manufactured 316L Stainless Steel : The Role of Cell Structures

Author: Cui Luqing
Deng Dunyong
Jiang Fuqing
Mousavian Reza Taherzadeh
Moverare Johan
Peng Ru Lin
Sun Xiaoyu
Xin Tongzheng
Yang Zhiqing
Publication venue: 'Elsevier BV'
Publication date: 01/01/2021
Field of study

We report the effect of cell structures on the fatigue behavior of additively manufactured (AM) 316L stainless steel (316LSS). Compared with the cell-free samples, the fatigue process of fully cellular samples only consists of steady and overload stages, without an initial softening stage. Moreover, the fully cellular sample possesses higher strength, lower cyclic softening rate and longer lifetime. Microscopic analyses show no difference in grain orientations, dimensions, and shapes. However, the fully cellular samples show planar dislocation structures, whereas the cell-free samples display wavy dislocation structures. The existence of cell structures promotes the activation of planar slip, delays strain localization, and ultimately enhances the fatigue performance of AM 316LSS.Funding: Swedish Governmental Agency for Innovation Systems (Vinnova)Vinnova [2016-05175]; Science Foundation Ireland (SFI)Science Foundation Ireland [16/RC/3872]; European Regional Development FundEuropean Commission; I-Form industry partners; Ji Hua Laboratroy [X210141TL210]; Center for Additive Manufacturing-metal (CAM2)</p

Publikationer från Linköpings universitet

Digitala Vetenskapliga Arkivet - Academic Archive On-line

The Autophagy-Related Protein ATG8 Orchestrates Asexual Development and AFB1 Biosynthesis in <i>Aspergillus flavus</i>

Author: Fengqin Song
Han Qiu
Jixiang Hu
Jun Tian
Kunlong Yang
Ling Shen
Man Liu
Pingzhi Xu
Qingru Geng
Shan Wang
Tongzheng Sun
Xue Peng
Yongxin Li
Publication venue: MDPI AG
Publication date: 01/05/2024
Field of study

Autophagy, a conserved cellular recycling process, plays a crucial role in maintaining homeostasis under stress conditions. It also regulates the development and virulence of numerous filamentous fungi. In this study, we investigated the specific function of ATG8, a reliable autophagic marker, in the opportunistic pathogen Aspergillus flavus. To investigate the role of atg8 in A. flavus, the deletion and complemented mutants of atg8 were generated according to the homologous recombination principle. Deletion of atg8 showed a significant decrease in conidiation, spore germination, and sclerotia formation compared to the WT and atg8C strains. Additionally, aflatoxin production was found severely impaired in the ∆atg8 mutant. The stress assays demonstrated that ATG8 was important for A. flavus response to oxidative stress. The fluorescence microscopy showed increased levels of reactive oxygen species in the ∆atg8 mutant cells, and the transcriptional result also indicated that genes related to the antioxidant system were significantly reduced in the ∆atg8 mutant. We further found that ATG8 participated in regulating the pathogenicity of A. flavus on crop seeds. These results revealed the biological role of ATG8 in A. flavus, which might provide a potential target for the control of A. flavus and AFB1 biosynthesis

Directory of Open Access Journals