Search CORE

5 research outputs found

Can LLMs Deeply Detect Complex Malicious Queries? A Framework for Jailbreaking via Obfuscating Intent

Author: Fan Zijing
Jiang Zhengwei
Shang Shang
Su Liya
Yao Yepeng
Yao Zhongjiang
Zhang Xiaodan
Zhao Xinqiang
Publication venue
Publication date: 07/05/2024
Field of study

To demonstrate and address the underlying maliciousness, we propose a theoretical hypothesis and analytical approach, and introduce a new black-box jailbreak attack methodology named IntentObfuscator, exploiting this identified flaw by obfuscating the true intentions behind user prompts.This approach compels LLMs to inadvertently generate restricted content, bypassing their built-in content security measures. We detail two implementations under this framework: "Obscure Intention" and "Create Ambiguity", which manipulate query complexity and ambiguity to evade malicious intent detection effectively. We empirically validate the effectiveness of the IntentObfuscator method across several models, including ChatGPT-3.5, ChatGPT-4, Qwen and Baichuan, achieving an average jailbreak success rate of 69.21\%. Notably, our tests on ChatGPT-3.5, which claims 100 million weekly active users, achieved a remarkable success rate of 83.65\%. We also extend our validation to diverse types of sensitive content like graphic violence, racism, sexism, political sensitivity, cybersecurity threats, and criminal skills, further proving the substantial impact of our findings on enhancing 'Red Team' strategies against LLM content security frameworks

arXiv.org e-Print Archive

TeleChat Technical Report

Author: Che Zhonghao
Fang Ruiyu
Fu Weiwei
he Xuewei
He Zhongjiang
Huang Xiaomeng
Huang Yuyao
Huang Zilu
Jiang Zhuoru
Li Xuelong
Li Yongxiang
Li Zhongqiu
Liu Shixuan
Liu Xinzhang
Lu Zhilong
Peng Jiaxin
Pu Luwen
Shi Lingling
Song Shuangyong
Wang Chao
Wang Shiquan
Wang Xin
Wang Yan
Wang Zihan
Xie Qiyi
Xiong Sishi
Xu Huinan
Yang Bingkai
Yao Yitong
Zhang Jie
Zhang Yanhan
Zhang Yin
Zhang Yuxiang
Zhang Zhaoxi
Zhao Yu
Zheng Wenjun
Publication venue
Publication date: 01/04/2024
Field of study

In this technical report, we present TeleChat, a collection of large language models (LLMs) with parameters of 3 billion, 7 billion and 12 billion. It includes pretrained language models as well as fine-tuned chat models that is aligned with human preferences. TeleChat is initially pretrained on an extensive corpus containing a diverse collection of texts from both English and Chinese languages, including trillions of tokens. Subsequently, the model undergoes fine-tuning to align with human preferences, following a detailed methodology that we describe. We evaluate the performance of TeleChat on various tasks, including language understanding, mathematics, reasoning, code generation, and knowledge-based question answering. Our findings indicate that TeleChat achieves comparable performance to other open-source models of similar size across a wide range of public benchmarks. To support future research and applications utilizing LLMs, we release the fine-tuned model checkpoints of TeleChat's 7B and 12B variant, along with code and a portion of our pretraining data, to the public community.Comment: 28 pages, 2 figure

arXiv.org e-Print Archive

Cartilage oligomeric matrix protein is a novel notch ligand driving embryonic stem cell differentiation towards the smooth muscle lineage

Author: Cai Zeyu
Chen Zhongjiang
Dai Rongbo
Fu Yi
Gong Ze
Kong Wei
Liu Fei
Liu Zhujiang
Ma Baihui
Mao Chenfeng
Wang Li
Xie Nan
Xu Qingbo
Yao Fang
Yu Fang
Zhao Guizhen
Publication venue: 'Elsevier BV'
Publication date: 01/08/2018
Field of study

King's Research Portal

Co-impulse multispectral photoacoustic microscopy and optical coherence tomography system using a single supercontinuum laser

Author: Barrick
Billeh
Bonduab
Chen
Chen
Chen
Da Xing
Hajireza
Li
Menon
Qiao
Qin
Shu
Wang
Wang
Yao
Yicheng Hu
Ying Chang
Zhang
Zhang
Zhang
Zhongjiang Chen
Zhou
Publication venue: 'The Optical Society'
Publication date
Field of study

Crossref

Encrypted traffic classification based on Gaussian mixture models and Hidden Markov Models

Author: Aceto
Aceto
Auld
Casino
Chen
Dainotti
Dempster
Desai
Dingledine
Dong
Fan
Georgoulas
Huang
Jingguo Ge
Karagiannis
Karagiannis
Korczynski
Kornycky
Liu
Liu
Pan
Prince
Queiroz
Rezaei
Runkang He
Sharafaldin
Tahaei
Tan
Wang
Wang
Wright
Xiaosheng Lin
Yao
Yulei Wu
Yuxiang Ma
Zhongjiang Yao
Zou
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref