Search CORE

140 research outputs found

DocChecker: Bootstrapping Code Large Language Model for Detecting and Resolving Code-Comment Inconsistencies

Author: Bui Nghi D. Q.
Dau Anh T. V.
Guo Jin L. C.
Publication venue
Publication date: 02/02/2024
Field of study

Comments within source code are essential for developers to comprehend the code's purpose and ensure its correct usage. However, as codebases evolve, maintaining an accurate alignment between the comments and the code becomes increasingly challenging. Recognizing the growing interest in automated solutions for detecting and correcting differences between code and its accompanying comments, current methods rely primarily on heuristic rules. In contrast, this paper presents DocChecker, a tool powered by deep learning. DocChecker is adept at identifying inconsistencies between code and comments, and it can also generate synthetic comments. This capability enables the tool to detect and correct instances where comments do not accurately reflect their corresponding code segments. We demonstrate the effectiveness of DocChecker using the Just-In-Time and CodeXGlue datasets in different settings. Particularly, DocChecker achieves a new State-of-the-art result of 72.3% accuracy on the Inconsistency Code-Comment Detection (ICCD) task and 33.64 BLEU-4 on the code summarization task against other Large Language Models (LLMs), even surpassing GPT 3.5 and CodeLlama. DocChecker is accessible for use and evaluation. It can be found on our GitHub https://github.com/FSoft-AI4Code/DocChecker and as an Online Tool http://4.193.50.237:5000/. For a more comprehensive understanding of its functionality, a demonstration video is available on YouTube https://youtu.be/FqnPmd531xw

arXiv.org e-Print Archive

Effect of arbuscular mycorrhizal fungus on the growth and polyphenol production of medicinal plants: Ehretia asperula and Solanum procumben

Author: BUI Cuong V.
LE Quang D.
TRAN Lam D.
VO Anh T. K.
Publication venue: University of Agricultural Sciences and Veterinary Medicine (UASVM), Cluj-Napoca, Romania
Publication date: 24/02/2022
Field of study

The study was conducted to evaluate the influence of arbuscular mycorrhizal fungus (Rhizophagus intradices) on growth and polyphenol production of the two important and popular medicinal plants in Vietnam: Ehretia asperula Zoll. & Mor. and Solanum procumbens Lour. The results showed a significant effect of the fungus on the growth of these two species with the growth indices such as height, weight and P content that were all higher than those of non-AM plants; although the indices of AM symbiosis in the plant roots were not as high as other plants in previous studies. The effect of AM fungus on polyphenol production was different between the two species. In E. asperula, the effect of AM fungi on polyphenol production was not significant; whereas in S. procumbens, AM symbiosis significantly increased polyphenol production in plant biomass, especially in roots. The different growth times of the two species might cause the different effects of AM fungus on polyphenol production

Notulae Botanicae Horti Agrobotanici Cluj-Napoca

UIT-Saviors at MEDVQA-GI 2023: Improving Multimodal Learning with Image Enhancement for Gastrointestinal Visual Question Answering

Author: Bui Linh N. P.
Nguyen Thien T. B.
Thai Triet M.
Tieu Hao K.
Vo Anh T.
Publication venue
Publication date: 19/11/2023
Field of study

In recent years, artificial intelligence has played an important role in medicine and disease diagnosis, with many applications to be mentioned, one of which is Medical Visual Question Answering (MedVQA). By combining computer vision and natural language processing, MedVQA systems can assist experts in extracting relevant information from medical image based on a given question and providing precise diagnostic answers. The ImageCLEFmed-MEDVQA-GI-2023 challenge carried out visual question answering task in the gastrointestinal domain, which includes gastroscopy and colonoscopy images. Our team approached Task 1 of the challenge by proposing a multimodal learning method with image enhancement to improve the VQA performance on gastrointestinal images. The multimodal architecture is set up with BERT encoder and different pre-trained vision models based on convolutional neural network (CNN) and Transformer architecture for features extraction from question and endoscopy image. The result of this study highlights the dominance of Transformer-based vision models over the CNNs and demonstrates the effectiveness of the image enhancement process, with six out of the eight vision models achieving better F1-Score. Our best method, which takes advantages of BERT+BEiT fusion and image enhancement, achieves up to 87.25% accuracy and 91.85% F1-Score on the development test set, while also producing good result on the private test set with accuracy of 82.01%.Comment: ImageCLEF2023 published version: https://ceur-ws.org/Vol-3497/paper-129.pd

arXiv.org e-Print Archive

Structure-Function Relationships Affecting the Sensing Mechanism of Monolayer-Protected Cluster Doped Xerogel Amperometric Glucose Biosensors

Author: Bui Tram Anh
DiPasquale Luke T.
Hall Jackson R.
Leopold Michael C.
Minocha Aastha
Poulos Nicholas G.
Publication venue: UR Scholarship Repository
Publication date: 16/03/2015
Field of study

A systematic study of the structure–function relationships critical to understanding the sensing mechanism of 1st generation amperometric glucose biosensors with an embedded nanoparticle (NP) network is presented. Xerogel-based films featuring embedded glucose oxidase enzyme and doped with alkanethiolate-protected gold NPs, known as monolayer protected clusters (MPCs), exhibit significantly enhanced performance compared to analogous systems without NPs including higher sensitivity, faster response time, and extended linear/dynamic ranges. The proposed mechanism involves diffusion of the glucose to glucose oxidase within the xerogel, enzymatic reaction production of H2O2 with subsequent diffusion to the embedded network of MPCs where it is oxidized, an event immediately reported via fast electron transfer (ET) through the MPC system to the working electrode. Various aspects of the film construct and strategy are systematically probed using amperometry, voltammetry, and solid-state electronic conductivity measurements, including the effects of MPC peripheral chain length, MPC functionalization via place-exchange reaction, MPC core size, and the MPC density or concentration within the xerogel composite films. The collective results of these experiments support the proposed mechanism and identify interparticle spacing and the electronic communication through the MPC network is the most significant factor in the sensing scheme with the diffusional aspects of the mechanism that may be affected by film/MPC hydrophobicity and functionality (i.e., glucose and H2O2 diffusion) shown to be less substantial contributors to the overall enhanced performance. Understanding the structure–function relationships of effective sensing schemes allows for the employment of the strategy for future biosensor design toward clinically relevant targets

Crossref

University of Richmond

LEGION: Harnessing Pre-trained Language Models for GitHub Topic Recommendations with Distribution-Balance Loss

Author: Bui Anh M. T.
Cong Thanh-Le
Dang Yen-Trang
Huynh Quyet-Thang
Le Bach
Nguyen Phuc-Thanh
Nguyen Phuong T.
Publication venue
Publication date: 09/03/2024
Field of study

Open-source development has revolutionized the software industry by promoting collaboration, transparency, and community-driven innovation. Today, a vast amount of various kinds of open-source software, which form networks of repositories, is often hosted on GitHub - a popular software development platform. To enhance the discoverability of the repository networks, i.e., groups of similar repositories, GitHub introduced repository topics in 2017 that enable users to more easily explore relevant projects by type, technology, and more. It is thus crucial to accurately assign topics for each GitHub repository. Current methods for automatic topic recommendation rely heavily on TF-IDF for encoding textual data, presenting challenges in understanding semantic nuances. This paper addresses the limitations of existing techniques by proposing Legion, a novel approach that leverages Pre-trained Language Models (PTMs) for recommending topics for GitHub repositories. The key novelty of Legion is three-fold. First, Legion leverages the extensive capabilities of PTMs in language understanding to capture contextual information and semantic meaning in GitHub repositories. Second, Legion overcomes the challenge of long-tailed distribution, which results in a bias toward popular topics in PTMs, by proposing a Distribution-Balanced Loss (DB Loss) to better train the PTMs. Third, Legion employs a filter to eliminate vague recommendations, thereby improving the precision of PTMs. Our empirical evaluation on a benchmark dataset of real-world GitHub repositories shows that Legion can improve vanilla PTMs by up to 26% on recommending GitHubs topics. Legion also can suggest GitHub topics more precisely and effectively than the state-of-the-art baseline with an average improvement of 20% and 5% in terms of Precision and F1-score, respectively.Comment: Accepted to EASE'2

arXiv.org e-Print Archive

Pathogenicity of an H5N1 avian influenza virus isolated in Vietnam in 2012 and reliability of conjunctival samples for diagnosis of infection

Author: Bui Anh N.
Bui Vuong N.
Dao Tung D.
Imai Kunitoshi
Inui Kenjiro
Nguyen Khong V.
Nguyen Lien T.
Nguyen Tham T.H.
Ogawa Haruko
Pham Nga T.
Runstadler Jonathan
Trinh Dai Q.
Publication venue: 'Elsevier BV'
Publication date: 01/10/2013
Field of study

The continued spread of highly pathogenic avian influenza virus (HPAIV) subtype H5N1 among poultry in Vietnam poses a potential threat to animals and public health. To evaluate the pathogenicity of a 2012 H5N1 HPAIV isolate and to assess the utility of conjunctival swabs for viral detection and isolation in surveillance, an experimental infection with HPAIV subtype H5N1 was carried out in domestic ducks. Ducks were infected with 10[superscript 7.2] TCID[subscript 50] of A/duck/Vietnam/QB1207/2012 (H5N1), which was isolated from a moribund domestic duck. In the infected ducks, clinical signs of disease, including neurological disorder, were observed. Ducks started to die at 3 days-post-infection (dpi), and the study mortality reached 67%. Viruses were recovered from oropharyngeal and conjunctival swabs until 7 dpi and from cloacal swabs until 4 dpi. In the ducks that died or were sacrificed on 3, 5, or 6 dpi, viruses were recovered from lung, brain, heart, pancreas and intestine, among which the highest virus titers were in the lung, brain or heart. Results of virus titration were confirmed by real-time RT-PCR. Genetic and phylogenetic analysis of the HA gene revealed that the isolate belongs to clade 2.3.2.1 similarly to the H5N1 viruses isolated in Vietnam in 2012. The present study demonstrated that this recent HPAI H5N1 virus of clade 2.3.2.1 could replicate efficiently in the systemic organs, including the brain, and cause severe disease with neurological symptoms in domestic ducks. Therefore, this HPAI H5N1 virus seems to retain the neurotrophic feature and has further developed properties of shedding virus from the oropharynx and conjunctiva in addition to the cloaca, potentially posing a higher risk of virus spread through cross-contact and/or environmental transmission. Continued surveillance and diagnostic programs using conjunctival swabs in the field would further verify the apparent reliability of conjunctival samples for the detection of AIV.Japan Society for the Promotion of Science (Grant-in-Aid for Bilateral Joint Projects)Heiwa Nakajima FoundationNational Institute of Allergy and Infectious Diseases (U.S.) (Contract HHSN2662007000010C

DSpace@MIT

Crossref

PubMed Central

Retinal Alpha-Synuclein Accumulation Correlates with Retinal Dysfunction and Structural Thinning in the A53T Mouse Model of Parkinson’s Disease

Author: Bui Bang V
Finkelstein David I
Hoang Anh
Nguyen Christine T O
Tran Katie K N
Wong Vickie H Y
Publication venue: DigitalCommons@TMC
Publication date: 01/01/2023
Field of study

Abnormal alpha-synuclein (α-SYN) protein deposition has long been recognized as one of the pathological hallmarks of Parkinson\u27s disease\u27s (PD). This study considers the potential utility of PD retinal biomarkers by investigating retinal changes in a well characterized PD model of α-SYN overexpression and how these correspond to the presence of retinal α-SYN. Transgenic A53T homozygous (HOM) mice overexpressing human α-SYN and wildtype (WT) control littermates were assessed at 4, 6, and 14 months of age (male and female

DigitalCommons@The Texas Medical Center

Clinically relevant preservation conditions for mesenchymal stem/stromal cells derived from perinatal and adult tissue sources

Author: Bach Trung Q.
Bui Anh V.
Bui Hue T.H.
Hoang Duc M.
Hoang Van T.
Jun Adriel Peng Guo
Le Hang M.
Ngo Anh T.L.
Nguyen Liem T.
Trinh Nhung T.H.
Publication venue: Wiley
Publication date: 01/11/2021
Field of study

The interplay between mesenchymal stem/stromal cells (MSCs) and preservation conditions is critical to maintain the viability and functionality of these cells before administration. We observed that Ringer lactate (RL) maintained high viability of bone marrow–derived MSCs for up to 72 h at room temperature (18°C–22°C), whereas adipose-derived and umbilical cord-derived MSCs showed the highest viability for 72 h at a cold temperature (4°C–8°C). These cells maintained their adherence ability with an improved recovery rate and metabolic profiles (glycolysis and mitochondrial respiration) similar to those of freshly harvested cells. Growth factor and cytokine analyses revealed that the preserved cells released substantial amounts of leukaemia inhibitory factors (LIFs), hepatocyte growth factor (HGF) and vascular endothelial growth factor-A (VEGF-A), as well as multiple cytokines (eg IL-4, IL-6, IL-8, MPC-1 and TNF-α). Our data provide the simplest clinically relevant preservation conditions that maintain the viability, stemness and functionality of MSCs from perinatal and adult tissue sources

Repository@Nottingham

PubMed Central

A New Approach and Tool in Verifying Asynchronous Circuits

Author: Bui Thang H.
Dinh-Duc Anh-Vu
Le-Huu Khoi-Nguyen
Nguyen Tin T.
Publication venue: 'The Radio and Electronics Association of Vietnam (REV)'
Publication date: 01/06/2013
Field of study

Research in asynchronous circuit approach has been carried out recently when asynchronous circuits are presented more widely in electronic systems. As they are more important in human life, their correctness should be considered carefully. Although there are some EDA tools for design and synthesis of asynchronous circuits, they are lack of methods for verifying the correctness of the produced circuits. In this work, we are about to propose a verification method and apply it in making a new version of the PAiD tool that can enable engineers to design, synthesize and verify asynchronous circuits. Experiments in verifying circuits have been also provided in this work

Crossref

REV Journal on Electronics and Communications

Class based Influence Functions for Error Detection

Author: Bui Nghi D. Q.
Dau Anh T. V.
Huu-Tien Dang
Nguyen Hieu Ngoc
Nguyen-Duc Thang
Thanh-Tung Hoang
Tran Quan Hung
Publication venue
Publication date: 02/05/2023
Field of study

Influence functions (IFs) are a powerful tool for detecting anomalous examples in large scale datasets. However, they are unstable when applied to deep networks. In this paper, we provide an explanation for the instability of IFs and develop a solution to this problem. We show that IFs are unreliable when the two data points belong to two different classes. Our solution leverages class information to improve the stability of IFs. Extensive experiments show that our modification significantly improves the performance and stability of IFs while incurring no additional computational cost.Comment: Thang Nguyen-Duc, Hoang Thanh-Tung, and Quan Hung Tran are co-first authors of this paper. 12 pages, 12 figures. Accepted to ACL 202

arXiv.org e-Print Archive