202 research outputs found
Bias Assessment and Mitigation in LLM-based Code Generation
Utilizing state-of-the-art Large Language Models (LLMs), automatic code
generation models play a pivotal role in enhancing the productivity and
efficiency of software development coding procedures. As the adoption of LLMs
becomes more widespread in software coding ecosystems, a pressing issue has
emerged: does the generated code contain social biases, such as those related
to age, gender, and race? This issue concerns the integrity, fairness, and
ethical foundation of software applications that depend on the code generated
by these models, yet is under-explored in the literature. This paper presents a
novel bias assessment framework that is specifically designed for code
generation tasks. Based on this framework, we conduct an extensive evaluation
on the bias of nine state-of-the-art LLM-based code generation models. Our
findings reveal that first, 31.45\% to 79.93\% code functions generated by our
evaluated code generation models are biased, and 9.68\% to 37.37\% code
functions' functionality are affected by the bias, which means biases not only
exist in code generation models but in some cases, directly affect the
functionality of the generated code, posing risks of unintended and possibly
harmful software behaviors. To mitigate bias from code generation models, we
propose three mitigation strategies, which can decrease the biased code ratio
to a very low level of 0.4\% to 4.57\%
FT2Ra: A Fine-Tuning-Inspired Approach to Retrieval-Augmented Code Completion
The rise of code pre-trained models has significantly enhanced various coding
tasks, such as code completion, and tools like GitHub Copilot. However, the
substantial size of these models, especially large models, poses a significant
challenge when it comes to fine-tuning them for specific downstream tasks. As
an alternative approach, retrieval-based methods have emerged as a promising
solution, augmenting model predictions without the need for fine-tuning.
Despite their potential, a significant challenge is that the designs of these
methods often rely on heuristics, leaving critical questions about what
information should be stored or retrieved and how to interpolate such
information for augmenting predictions.
To tackle this challenge, we first perform a theoretical analysis of the
fine-tuning process, highlighting the importance of delta logits as a catalyst
for improving model predictions. Building on this insight, we develop a novel
retrieval-based method, FT2Ra, which aims to mimic genuine fine-tuning. While
FT2Ra adopts a retrieval-based mechanism, it uniquely adopts a paradigm with a
learning rate and multi-epoch retrievals, which is similar to fine-tuning.In
token-level completion, which represents a relatively easier task, FT2Ra
achieves a 4.29% improvement in accuracy compared to the best baseline method
on UniXcoder. In the more challenging line-level completion task, we observe a
substantial more than twice increase in Exact Match (EM) performance,
indicating the significant advantages of our theoretical analysis. Notably,
even when operating without actual fine-tuning, FT2Ra exhibits competitive
performance compared to the models with real fine-tuning.Comment: ISSTA 202
Low-Theta Electroencephalography Coherence Predicts Cigarette Craving in Nicotine Addiction
Addicts are often vulnerable to drug use in the presence of drug cues, which elicit significant drug cue reactivity. Mounting neuroimaging evidence suggests an association between functional magnetic resonance imaging connectivity networks and smoking cue reactivity; however, there is still little understanding of the electroencephalography (EEG) coherence basis of smoking cue reactivity. We therefore designed two independent experiments wherein nicotine-dependent smokers performed a smoking cue reactivity task during EEG recording. Experiment I showed that a low-theta EEG coherence network occurring 400–600 ms after onset during long-range (mainly between frontal and parieto-occipital) scalp regions, which was involved in smoking cue reactivity. Moreover, the average coherence of this network was significantly correlated with participants’ level of cigarette craving. In experiment II, we tested an independent group of smokers and demonstrated that the low-theta coherence network significantly predicted changes in individuals’ cigarette craving. Thus, the low-theta EEG coherence in smokers’ brains might be a biomarker of smoking cue reactivity and can predict addiction behavior
Neuroimaging Studies Reveal the Subtle Difference Among Social Network Size Measurements and Shed Light on New Directions
Social network size is a key feature when we explore the constructions of human social networks. Despite the disparate understanding of individuals’ social networks, researchers have reached a consensus that human’s social networks are hierarchically organized with different layers, which represent emotional bonds and interaction frequency. Social brain hypothesis emphasizes the significance of complex and demanding social interaction environments and assumes that the cognitive constraints may have an impact on the social network size. This paper reviews neuroimaging studies on social networks that explored the connection between individuals’ social network size and neural mechanisms and finds that Social Network Index (SNI) and Social Network Questionnaires (SNQs) are the mostly-adopted measurements of one’s social network size. The two assessments have subtle difference in essence as they measure the different sublayers of one’s social network. The former measures the relatively outer sub-layer of one’s stable social relationship, similar to the sympathy group, while the latter assesses the innermost layer—the core of one’s social network, often referred to as support clique. This subtle difference is also corroborated by neuroimaging studies, as SNI-measured social network size is largely correlated with the amygdala, while SNQ-assessed social network size is closely related to both the amygdala and the orbitofrontal cortex. The two brain regions respond to disparate degrees of social closeness, respectively. Finally, it proposes a careful choice among the measurements for specific purposes and some new approaches to assess individuals’ social network size
A Variable Ionized Disk Wind in the Black Hole Candidate EXO 1846–031
After 34 yr, the black hole candidate EXO 1846–031 went into outburst again in 2019. We investigate its spectral properties in the hard intermediate and the soft states with NuSTAR and Insight-HXMT. A reflection component has been detected in the two spectral states but possibly originating from different illumination spectra: in the intermediate state, the illuminating source is attributed to a hard coronal component, which has been commonly observed in other X-ray binaries, whereas in the soft state, the reflection is probably produced by disk self-irradiation. Both cases support EXO 1846–031 as a low-inclination system of ~40°. An absorption line is clearly detected at ~7.2 keV in the hard intermediate state, corresponding to a highly ionized disk wind (log} ξ > 6.1) with a velocity of up to 0.06c. Meanwhile, quasi-simultaneous radio emissions have been detected before and after the X-rays, implying the coexistence of disk winds and jets in this system. If only the high-flux segment of the NuSTAR observation is considered, the observed wind appears to be magnetically driven. The absorption line disappeared in the soft state and a narrow emission line appeared at ~6.7 keV on top of the reflection component, which may be evidence for disk winds, but data with higher spectral resolution are required to examine this
Вихретоковый анизотропный термоэлектрический первичный преобразователь лучистого потока
Представлена оригинальная конструкция первичного преобразователя лучистого потока, который может служить основой для создания приемника неселективного излучения с повышенной чувствительностью
- …