202 research outputs found

    Bias Assessment and Mitigation in LLM-based Code Generation

    Full text link
    Utilizing state-of-the-art Large Language Models (LLMs), automatic code generation models play a pivotal role in enhancing the productivity and efficiency of software development coding procedures. As the adoption of LLMs becomes more widespread in software coding ecosystems, a pressing issue has emerged: does the generated code contain social biases, such as those related to age, gender, and race? This issue concerns the integrity, fairness, and ethical foundation of software applications that depend on the code generated by these models, yet is under-explored in the literature. This paper presents a novel bias assessment framework that is specifically designed for code generation tasks. Based on this framework, we conduct an extensive evaluation on the bias of nine state-of-the-art LLM-based code generation models. Our findings reveal that first, 31.45\% to 79.93\% code functions generated by our evaluated code generation models are biased, and 9.68\% to 37.37\% code functions' functionality are affected by the bias, which means biases not only exist in code generation models but in some cases, directly affect the functionality of the generated code, posing risks of unintended and possibly harmful software behaviors. To mitigate bias from code generation models, we propose three mitigation strategies, which can decrease the biased code ratio to a very low level of 0.4\% to 4.57\%

    FT2Ra: A Fine-Tuning-Inspired Approach to Retrieval-Augmented Code Completion

    Full text link
    The rise of code pre-trained models has significantly enhanced various coding tasks, such as code completion, and tools like GitHub Copilot. However, the substantial size of these models, especially large models, poses a significant challenge when it comes to fine-tuning them for specific downstream tasks. As an alternative approach, retrieval-based methods have emerged as a promising solution, augmenting model predictions without the need for fine-tuning. Despite their potential, a significant challenge is that the designs of these methods often rely on heuristics, leaving critical questions about what information should be stored or retrieved and how to interpolate such information for augmenting predictions. To tackle this challenge, we first perform a theoretical analysis of the fine-tuning process, highlighting the importance of delta logits as a catalyst for improving model predictions. Building on this insight, we develop a novel retrieval-based method, FT2Ra, which aims to mimic genuine fine-tuning. While FT2Ra adopts a retrieval-based mechanism, it uniquely adopts a paradigm with a learning rate and multi-epoch retrievals, which is similar to fine-tuning.In token-level completion, which represents a relatively easier task, FT2Ra achieves a 4.29% improvement in accuracy compared to the best baseline method on UniXcoder. In the more challenging line-level completion task, we observe a substantial more than twice increase in Exact Match (EM) performance, indicating the significant advantages of our theoretical analysis. Notably, even when operating without actual fine-tuning, FT2Ra exhibits competitive performance compared to the models with real fine-tuning.Comment: ISSTA 202

    Low-Theta Electroencephalography Coherence Predicts Cigarette Craving in Nicotine Addiction

    Get PDF
    Addicts are often vulnerable to drug use in the presence of drug cues, which elicit significant drug cue reactivity. Mounting neuroimaging evidence suggests an association between functional magnetic resonance imaging connectivity networks and smoking cue reactivity; however, there is still little understanding of the electroencephalography (EEG) coherence basis of smoking cue reactivity. We therefore designed two independent experiments wherein nicotine-dependent smokers performed a smoking cue reactivity task during EEG recording. Experiment I showed that a low-theta EEG coherence network occurring 400–600 ms after onset during long-range (mainly between frontal and parieto-occipital) scalp regions, which was involved in smoking cue reactivity. Moreover, the average coherence of this network was significantly correlated with participants’ level of cigarette craving. In experiment II, we tested an independent group of smokers and demonstrated that the low-theta coherence network significantly predicted changes in individuals’ cigarette craving. Thus, the low-theta EEG coherence in smokers’ brains might be a biomarker of smoking cue reactivity and can predict addiction behavior

    Neuroimaging Studies Reveal the Subtle Difference Among Social Network Size Measurements and Shed Light on New Directions

    Get PDF
    Social network size is a key feature when we explore the constructions of human social networks. Despite the disparate understanding of individuals’ social networks, researchers have reached a consensus that human’s social networks are hierarchically organized with different layers, which represent emotional bonds and interaction frequency. Social brain hypothesis emphasizes the significance of complex and demanding social interaction environments and assumes that the cognitive constraints may have an impact on the social network size. This paper reviews neuroimaging studies on social networks that explored the connection between individuals’ social network size and neural mechanisms and finds that Social Network Index (SNI) and Social Network Questionnaires (SNQs) are the mostly-adopted measurements of one’s social network size. The two assessments have subtle difference in essence as they measure the different sublayers of one’s social network. The former measures the relatively outer sub-layer of one’s stable social relationship, similar to the sympathy group, while the latter assesses the innermost layer—the core of one’s social network, often referred to as support clique. This subtle difference is also corroborated by neuroimaging studies, as SNI-measured social network size is largely correlated with the amygdala, while SNQ-assessed social network size is closely related to both the amygdala and the orbitofrontal cortex. The two brain regions respond to disparate degrees of social closeness, respectively. Finally, it proposes a careful choice among the measurements for specific purposes and some new approaches to assess individuals’ social network size

    A Variable Ionized Disk Wind in the Black Hole Candidate EXO 1846–031

    Get PDF
    After 34 yr, the black hole candidate EXO 1846–031 went into outburst again in 2019. We investigate its spectral properties in the hard intermediate and the soft states with NuSTAR and Insight-HXMT. A reflection component has been detected in the two spectral states but possibly originating from different illumination spectra: in the intermediate state, the illuminating source is attributed to a hard coronal component, which has been commonly observed in other X-ray binaries, whereas in the soft state, the reflection is probably produced by disk self-irradiation. Both cases support EXO 1846–031 as a low-inclination system of ~40°. An absorption line is clearly detected at ~7.2 keV in the hard intermediate state, corresponding to a highly ionized disk wind (log} ξ > 6.1) with a velocity of up to 0.06c. Meanwhile, quasi-simultaneous radio emissions have been detected before and after the X-rays, implying the coexistence of disk winds and jets in this system. If only the high-flux segment of the NuSTAR observation is considered, the observed wind appears to be magnetically driven. The absorption line disappeared in the soft state and a narrow emission line appeared at ~6.7 keV on top of the reflection component, which may be evidence for disk winds, but data with higher spectral resolution are required to examine this

    Вихретоковый анизотропный термоэлектрический первичный преобразователь лучистого потока

    Get PDF
    Представлена оригинальная конструкция первичного преобразователя лучистого потока, который может служить основой для создания приемника неселективного излучения с повышенной чувствительностью
    corecore