Search CORE

202 research outputs found

Bias Assessment and Mitigation in LLM-based Code Generation

Author: Bu Qingwen
Chen Junjie
Cui Heming
Huang Dong
Xie Xiaofei
Zhang Jie
Publication venue
Publication date: 03/09/2023
Field of study

Utilizing state-of-the-art Large Language Models (LLMs), automatic code generation models play a pivotal role in enhancing the productivity and efficiency of software development coding procedures. As the adoption of LLMs becomes more widespread in software coding ecosystems, a pressing issue has emerged: does the generated code contain social biases, such as those related to age, gender, and race? This issue concerns the integrity, fairness, and ethical foundation of software applications that depend on the code generated by these models, yet is under-explored in the literature. This paper presents a novel bias assessment framework that is specifically designed for code generation tasks. Based on this framework, we conduct an extensive evaluation on the bias of nine state-of-the-art LLM-based code generation models. Our findings reveal that first, 31.45\% to 79.93\% code functions generated by our evaluated code generation models are biased, and 9.68\% to 37.37\% code functions' functionality are affected by the bias, which means biases not only exist in code generation models but in some cases, directly affect the functionality of the generated code, posing risks of unintended and possibly harmful software behaviors. To mitigate bias from code generation models, we propose three mitigation strategies, which can decrease the biased code ratio to a very low level of 0.4\% to 4.57\%

arXiv.org e-Print Archive

FT2Ra: A Fine-Tuning-Inspired Approach to Retrieval-Augmented Code Completion

Author: Bu Lei
Feng Ruitao
Ge Jidong
Guo Qi
Li Xiaohong
Liu Shangqing
Tang Ze
Wang Junjie
Xie Xiaofei
Publication venue
Publication date: 01/04/2024
Field of study

The rise of code pre-trained models has significantly enhanced various coding tasks, such as code completion, and tools like GitHub Copilot. However, the substantial size of these models, especially large models, poses a significant challenge when it comes to fine-tuning them for specific downstream tasks. As an alternative approach, retrieval-based methods have emerged as a promising solution, augmenting model predictions without the need for fine-tuning. Despite their potential, a significant challenge is that the designs of these methods often rely on heuristics, leaving critical questions about what information should be stored or retrieved and how to interpolate such information for augmenting predictions. To tackle this challenge, we first perform a theoretical analysis of the fine-tuning process, highlighting the importance of delta logits as a catalyst for improving model predictions. Building on this insight, we develop a novel retrieval-based method, FT2Ra, which aims to mimic genuine fine-tuning. While FT2Ra adopts a retrieval-based mechanism, it uniquely adopts a paradigm with a learning rate and multi-epoch retrievals, which is similar to fine-tuning.In token-level completion, which represents a relatively easier task, FT2Ra achieves a 4.29% improvement in accuracy compared to the best baseline method on UniXcoder. In the more challenging line-level completion task, we observe a substantial more than twice increase in Exact Match (EM) performance, indicating the significant advantages of our theoretical analysis. Notably, even when operating without actual fine-tuning, FT2Ra exhibits competitive performance compared to the models with real fine-tuning.Comment: ISSTA 202

arXiv.org e-Print Archive

Low-Theta Electroencephalography Coherence Predicts Cigarette Craving in Nicotine Addiction

Author: Chialun Liu
Chuan Fan
Chuan Fan
Junjie Bu
Junjie Bu
Pengyu Zhang
Ru Ma
Shinan Sun
Xiaochu Zhang
Xiaochu Zhang
Xiaochu Zhang
Xiaochu Zhang
Yan Cheng
Yi Piao
Publication venue: 'Frontiers Media SA'
Publication date: 01/05/2019
Field of study

Addicts are often vulnerable to drug use in the presence of drug cues, which elicit significant drug cue reactivity. Mounting neuroimaging evidence suggests an association between functional magnetic resonance imaging connectivity networks and smoking cue reactivity; however, there is still little understanding of the electroencephalography (EEG) coherence basis of smoking cue reactivity. We therefore designed two independent experiments wherein nicotine-dependent smokers performed a smoking cue reactivity task during EEG recording. Experiment I showed that a low-theta EEG coherence network occurring 400–600 ms after onset during long-range (mainly between frontal and parieto-occipital) scalp regions, which was involved in smoking cue reactivity. Moreover, the average coherence of this network was significantly correlated with participants’ level of cigarette craving. In experiment II, we tested an independent group of smokers and demonstrated that the low-theta coherence network significantly predicted changes in individuals’ cigarette craving. Thus, the low-theta EEG coherence in smokers’ brains might be a biomarker of smoking cue reactivity and can predict addiction behavior

Directory of Open Access Journals

Neuroimaging Studies Reveal the Subtle Difference Among Social Network Size Measurements and Shed Light on New Directions

Author: Junjie Bu
Ru Ma
Ruiqi Huang
Shen Liu
Xiaochu Zhang
Xiaochu Zhang
Xiaochu Zhang
Xiaochu Zhang
Xiaoming Liu
Xiaoming Liu
Xueli Chen
Yunlu Xie
Yuzhi Luo
Publication venue: 'Frontiers Media SA'
Publication date: 01/07/2018
Field of study

Social network size is a key feature when we explore the constructions of human social networks. Despite the disparate understanding of individuals’ social networks, researchers have reached a consensus that human’s social networks are hierarchically organized with different layers, which represent emotional bonds and interaction frequency. Social brain hypothesis emphasizes the significance of complex and demanding social interaction environments and assumes that the cognitive constraints may have an impact on the social network size. This paper reviews neuroimaging studies on social networks that explored the connection between individuals’ social network size and neural mechanisms and finds that Social Network Index (SNI) and Social Network Questionnaires (SNQs) are the mostly-adopted measurements of one’s social network size. The two assessments have subtle difference in essence as they measure the different sublayers of one’s social network. The former measures the relatively outer sub-layer of one’s stable social relationship, similar to the sympathy group, while the latter assesses the innermost layer—the core of one’s social network, often referred to as support clique. This subtle difference is also corroborated by neuroimaging studies, as SNI-measured social network size is largely correlated with the amygdala, while SNQ-assessed social network size is closely related to both the amygdala and the orbitofrontal cortex. The two brain regions respond to disparate degrees of social closeness, respectively. Finally, it proposes a careful choice among the measurements for specific purposes and some new approaches to assess individuals’ social network size

Directory of Open Access Journals

A Variable Ionized Disk Wind in the Black Hole Candidate EXO 1846–031

Author: Altamirano Diego
Bu Q. C.
Cai C.
Cao X. L.
Chang Z.
Chen L.
Chen T. X.
Chen Y.
Chen Y. P.
Cui W. W.
Dauser Thomas
Du Y. Y.
Gao G. H.
García Javier A.
Ge M. Y.
Gu Y. D.
Guan J.
Guo C. C.
Han D. W.
Huang Y.
Huo J.
Ji Long
Jia S. M.
Jiang W. C.
Jin J.
Kong L. D.
Li B.
Li C. K.
Li G.
Li T. P.
Li W.
Li X.
Li X. B.
Li X. F.
Li Z. W.
Liang X. H.
Liao J. Y.
Liu C. Z.
Liu H. W.
Liu X. J.
Lu F. J.
Lu X. F.
Luo Q.
Luo T.
Ma X.
Maggi Pierre
Mao Junjie
Meng B.
Méndez Mariano
Nang Y.
Nie J. Y.
Ou G.
Qu J. L.
Sai N.
Shang R. C.
Song L. M.
Song X. Y.
Sun L.
Tan Y.
Tao L.
Tuo Y. L.
Wang W. S.
Wang Y. D.
Wang Y. S.
Wang Yanan
Wen X. Y.
Wu B. B.
Wu B. Y.
Wu M.
Xiao G. C.
Xiao S.
Xiong S. L.
Xu Y. P.
Yang S.
Yang Y. J.
Yi Q. B.
Yin Q. Q.
You Y.
Zhang F.
Zhang H. M.
Zhang J.
Zhang L.
Zhang S.
Zhang S. N.
Zhang W.
Zhang W. C.
Zhang Y. F.
Zhao H. S.
Zhao X. F.
Zheng S. J.
Zhou D. K.
Publication venue: 'American Astronomical Society'
Publication date: 01/01/2021
Field of study

After 34 yr, the black hole candidate EXO 1846–031 went into outburst again in 2019. We investigate its spectral properties in the hard intermediate and the soft states with NuSTAR and Insight-HXMT. A reflection component has been detected in the two spectral states but possibly originating from different illumination spectra: in the intermediate state, the illuminating source is attributed to a hard coronal component, which has been commonly observed in other X-ray binaries, whereas in the soft state, the reflection is probably produced by disk self-irradiation. Both cases support EXO 1846–031 as a low-inclination system of ~40°. An absorption line is clearly detected at ~7.2 keV in the hard intermediate state, corresponding to a highly ionized disk wind (log} ξ > 6.1) with a velocity of up to 0.06c. Meanwhile, quasi-simultaneous radio emissions have been detected before and after the X-rays, implying the coexistence of disk winds and jets in this system. If only the high-flux segment of the NuSTAR observation is considered, the observed wind appears to be magnetically driven. The absorption line disappeared in the soft state and a narrow emission line appeared at ~6.7 keV on top of the reflection component, which may be evidence for disk winds, but data with higher spectral resolution are required to examine this

Вихретоковый анизотропный термоэлектрический первичный преобразователь лучистого потока

Author: An She-Juan
Bassig Bryan A
Berndt Sonja I
Burdett Laurie
Cai Qiuyin
Caporaso Neil E
Cawthon Richard
Chan John K C
Chang Gee-Chen
Chang I-Shou
Chang Jiang
Chanock Stephen J
Chatterjee Nilanjan
Chen Chien-Jen
Chen Chih-Yi
Chen Chung-Hsing
Chen Hongyan
Chen Kexin
Chen Kuan-Yu
Chen Kun-Chieh
Chen Ying
Chen Ying-Hsiang
Chen Yuh-Min
Choi Jin Eun
Choi Yi Young
Chow Wong-Ho
Chu Minjie
Chung Charles C
Dong Jing
Fraumeni Joseph F
Gao Yu-Tang
Guan Peng
Guo Huan
He Qincheng
Hong Yun-Chul
Hosgood H Dean
Hsiao Chin-Fu
Hsiung Chao Agnes
Hu Lingmin
Hu Wei
Hu Zhibin
Huang Ming-Shyan
Hung Jen-Yu
Hutchinson Amy
Jeon Hyo-Sung
Ji Bu-Tian
Jin Li
Jung Yoo Jin
Kang Chang Hyun
Kim Christopher
Kim Hee Nam
Kim Jin Hee
Kim Jun Suk
Kim Yeul Hong
Kim Young Tae
Kim Young-Chul
Klein Robert J
Kohno Takashi
Kunitoh Hideo
Kweon Sun-Seog
Lan Qing
Landi Maria Teresa
Lawrence Charles
Lee Victor Ho Fun
Li Haixin
Li Jihua
Li Yao-Jen
Li Yuqing
Lim Wei-Yen
Lin Chien-Chung
Lin Dongxin
Lin Hsien-Chih
Liu Jianjun
Liu Li
Lo Yen-Li
Lu Daru
Machiela Mitchell J
Matsuo Keitaro
Mitsudomi Tetsuya
Oh In-Jae
Pao William
Park In Kyu
Park Jae Yong
Park Kyong Hwa
Perng Reury-Perng
Qian Biyun
Rothman Nathaniel
Seow Adeline
Seow Wei Jie
Shen Hongbing
Shin Min-Ho
Shiraishi Kouya
Shu Xiao-Ou
Sihoe Alan Dart Loon
Su Jian
Su Wu-Chou
Sung Jae Sook
Sung Sook Whan
Tan Wen
Tsai Ying-Huang
Tucker Margaret
Vermeulen Roel C H
Wang Chih-Liang
Wang Jiu-Cun
Wang Junwen
Wang Wen-Chang
Wang Zhaoming
Wei Fusheng
Wen Wanqing
Wong Maria Pik
Wu Chen
Wu Guoping
Wu Junjie
Wu Tangchun
Wu Wei
Wu Yi-Long
Xiang Yong-Bing
Xu Jun
Xu Ping
Yang Pan-Chyr
Yang Tsung-Ying
Yeager Meredith
Yin Zhihua
Yokota Jun
Yoon Ho-Il
Yu Chong-Jen
Zhang Xu-Chao
Zhao Xueying
Zhao Zhenhong
Zheng Hong
Zheng Wei
Zhou Baosen
Publication venue: Інститут фізики напівпровідників імені В.Є. Лашкарьова НАН України
Publication date: 01/01/2010
Field of study

Представлена оригинальная конструкция первичного преобразователя лучистого потока, который может служить основой для создания приемника неселективного излучения с повышенной чувствительностью

Наукова електронна бібліотека періодичних видань НАН України (Vernadsky National Library of Ukraine)

Crossref

PubMed Central

Utrecht University Repository