205 research outputs found

    LEGION: Harnessing Pre-trained Language Models for GitHub Topic Recommendations with Distribution-Balance Loss

    Full text link
    Open-source development has revolutionized the software industry by promoting collaboration, transparency, and community-driven innovation. Today, a vast amount of various kinds of open-source software, which form networks of repositories, is often hosted on GitHub - a popular software development platform. To enhance the discoverability of the repository networks, i.e., groups of similar repositories, GitHub introduced repository topics in 2017 that enable users to more easily explore relevant projects by type, technology, and more. It is thus crucial to accurately assign topics for each GitHub repository. Current methods for automatic topic recommendation rely heavily on TF-IDF for encoding textual data, presenting challenges in understanding semantic nuances. This paper addresses the limitations of existing techniques by proposing Legion, a novel approach that leverages Pre-trained Language Models (PTMs) for recommending topics for GitHub repositories. The key novelty of Legion is three-fold. First, Legion leverages the extensive capabilities of PTMs in language understanding to capture contextual information and semantic meaning in GitHub repositories. Second, Legion overcomes the challenge of long-tailed distribution, which results in a bias toward popular topics in PTMs, by proposing a Distribution-Balanced Loss (DB Loss) to better train the PTMs. Third, Legion employs a filter to eliminate vague recommendations, thereby improving the precision of PTMs. Our empirical evaluation on a benchmark dataset of real-world GitHub repositories shows that Legion can improve vanilla PTMs by up to 26% on recommending GitHubs topics. Legion also can suggest GitHub topics more precisely and effectively than the state-of-the-art baseline with an average improvement of 20% and 5% in terms of Precision and F1-score, respectively.Comment: Accepted to EASE'2

    Impacts of Foreign Direct Investment on Economic Development: Does Institutional Quality Matter?

    Get PDF
    The linkage between foreign direct investment (FDI) and economic development has been demonstrated in economic literature. In this study, we analyze the impact of FDI on economic development, considering the role of institutional quality in 63 provinces/cities in Vietnam in the period 2005–2022. Applying various regression methods, such as Pooled OLS, FEM, REM, GMM, and PVAR, the results confirm that foreign direct investment and institutional quality have a positive impact on economic development. Findings also provide evidence that institutional quality is an important factor in attracting FDI, determining both the quality and quantity of inflows from other countries into Vietnam. Some policy implications are given to promote the role of institutions and attract foreign direct investment, thereby promoting the economic development of provinces and cities in Vietnam. Doi: 10.28991/ESJ-2023-07-06-05 Full Text: PD

    Beyond Traditional Approaches: Multi-Task Network for Breast Ultrasound Diagnosis

    Full text link
    Breast Ultrasound plays a vital role in cancer diagnosis as a non-invasive approach with cost-effective. In recent years, with the development of deep learning, many CNN-based approaches have been widely researched in both tumor localization and cancer classification tasks. Even though previous single models achieved great performance in both tasks, these methods have some limitations in inference time, GPU requirement, and separate fine-tuning for each model. In this study, we aim to redesign and build end-to-end multi-task architecture to conduct both segmentation and classification. With our proposed approach, we achieved outstanding performance and time efficiency, with 79.8% and 86.4% in DeepLabV3+ architecture in the segmentation task.Comment: 7 pages, 3 figure

    Investigation of the local environment of SnO2 in an applied magnetic field

    Get PDF
    This paper presents the results of time-differential perturbed gamma–gamma angular correlation measurements of SnO2 thin films carried out in an applied magnetic field. The measurements were performed upon the implantation of Fe at 80 keV and 111In (111Cd) at 160 keV. The samples were further characterized by energydispersive X-ray spectroscopy. The hyperfine parameters were studied at room temperature with and without an applied magnetic field. The results indicate the presence of two distinct local environments for the probe nuclei. Both occupy a paramagnetic state and correspond to a substitutional Sn site in the rutile phase of SnO2 with different numbers of electrons added to SnO2:Cd0. In addition, the crystal homogeneity of the site 1 increases upon applying the magnetic field

    Investigation of sorption of Cu₂₊, Zn₂₊ and Cd₂₊ ions by a composite adsorbent obtained from bentonite-like clay and hydroxyapatite

    Get PDF
    The article analyses the results of the measuring of pHpzc (point zero charge) of the surface of adsorbents by potentiometric titration are presente

    Medication Use and Adherence in Patients with Hypertension: A Prospective Study in Vietnam

    Get PDF
    Objective: to document patients’ antihypertensive agents, determine their medication adherence, and identify factors associated with the adherence.  Material and Methods: A prospective study was performed on a group of hypertensive outpatients, with social health insurance, in Can Tho, Vietnam. The study included 330 patients over 18 years old, who agreed to participate and could listen, speak and answer questions in Vietnamese. The data collection method was based on prescriptions and patient interviews. Data were analyzed using descriptive statistics, and Generalized Estimating Equations with Poisson-log linear distribution.  Results: Among the drug use characteristics, 76.1% were prescribed beta-blockers, 91.5% polytherapy, and 63.0% changed drugs at the third follow-up visit. The percentage of patients who adhered to medication ranged from 70.0% to 91.2%. Factors that improved drug adherence included: the academic level at high school or higher (39.0% increase), living in urban areas (15.0% increase), having a job related to social interaction (11.2%), and having a family history of hypertension (9.0% increase). Factors that reduced adherence included: advanced age (22.0% decrease), prolonged disease duration (16.0% decrease), prolonged treatment duration (11.0% decrease), and changes in at least one type of antihypertensive drug (8.0% decrease).  Conclusion: The highlight of this study is the demonstration of an inverse relationship between the adherence rate and the number of follow-up visits: the higher the number of visits, the lower the adherence rate. The 3rd follow-up adherence rate was 70.0%, and the decreased adherence rate is related to older age, higher education levels, and a longer duration of treatment

    Class based Influence Functions for Error Detection

    Full text link
    Influence functions (IFs) are a powerful tool for detecting anomalous examples in large scale datasets. However, they are unstable when applied to deep networks. In this paper, we provide an explanation for the instability of IFs and develop a solution to this problem. We show that IFs are unreliable when the two data points belong to two different classes. Our solution leverages class information to improve the stability of IFs. Extensive experiments show that our modification significantly improves the performance and stability of IFs while incurring no additional computational cost.Comment: Thang Nguyen-Duc, Hoang Thanh-Tung, and Quan Hung Tran are co-first authors of this paper. 12 pages, 12 figures. Accepted to ACL 202
    corecore