Search CORE

136 research outputs found

Define, Evaluate, and Improve Task-Oriented Cognitive Capabilities for Instruction Generation Models

Author: Daumé III Hal
Nguyen Khanh
Zhao Lingjun
Publication venue
Publication date: 28/05/2023
Field of study

Recent work studies the cognitive capabilities of language models through psychological tests designed for humans. While these studies are helpful for understanding the general capabilities of these models, there is no guarantee that a model possessing sufficient capabilities to pass those tests would actually use those capabilities in performing real-life tasks. In this work, we formulate task-oriented cognitive capabilities, which are human-like cognitive capabilities that language models leverage to perform tasks. These capabilities are (i) the ability to quickly generate good candidate utterances (the search capability) (ii) the ability to predict how a listener interprets those utterances and choose the most appropriate one (the pragmatic capability). We design an evaluation scheme for comparing these capabilities of a language model with those of a human. Applying this scheme to examine various models in a navigation instruction generation problem, we find that their pragmatic capability is severely lacking. This insight leads us to augment them with better models of the listener and obtain a significant boost of 11% in success rate in guiding real humans. Our work advocates for having a principled procedure for aligning language models with humans that involves (i) formulating task-oriented capabilities, (ii) devising a method to quantify their deficiency, and (iii) iteratively improving them.Comment: Findings of ACL 202

arXiv.org e-Print Archive

Hallucination Detection for Grounded Instruction Generation

Author: Daumé III Hal
Nguyen Khanh
Zhao Lingjun
Publication venue
Publication date: 23/10/2023
Field of study

We investigate the problem of generating instructions to guide humans to navigate in simulated residential environments. A major issue with current models is hallucination: they generate references to actions or objects that are inconsistent with what a human follower would perform or encounter along the described path. We develop a model that detects these hallucinated references by adopting a model pre-trained on a large corpus of image-text pairs, and fine-tuning it with a contrastive loss that separates correct instructions from instructions containing synthesized hallucinations. Our final model outperforms several baselines, including using word probability estimated by the instruction-generation model, and supervised models based on LSTM and Transformer

arXiv.org e-Print Archive

Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections

Author: Daumé III Hal
Nguyen Khanh
Zhao Lingjun
Publication venue
Publication date: 26/02/2024
Field of study

This paper addresses the challenge of leveraging imperfect language models to guide human decision-making in the context of a grounded navigation task. We show that an imperfect instruction generation model can be complemented with an effective communication mechanism to become more successful at guiding humans. The communication mechanism we build comprises models that can detect potential hallucinations in instructions and suggest practical alternatives, and an intuitive interface to present that information to users. We show that this approach reduces the human navigation error by up to 29% with no additional cognitive burden. This result underscores the potential of integrating diverse communication channels into AI systems to compensate for their imperfections and enhance their utility for humans

arXiv.org e-Print Archive

CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation

Author: Skinner Katherine A.
Song Jingyu
Zhao Lingjun
Publication venue
Publication date: 27/03/2024
Field of study

In the field of 3D object detection for autonomous driving, LiDAR-Camera (LC) fusion is the top-performing sensor configuration. Still, LiDAR is relatively high cost, which hinders adoption of this technology for consumer automobiles. Alternatively, camera and radar are commonly deployed on vehicles already on the road today, but performance of Camera-Radar (CR) fusion falls behind LC fusion. In this work, we propose Camera-Radar Knowledge Distillation (CRKD) to bridge the performance gap between LC and CR detectors with a novel cross-modality KD framework. We use the Bird's-Eye-View (BEV) representation as the shared feature space to enable effective knowledge distillation. To accommodate the unique cross-modality KD path, we propose four distillation losses to help the student learn crucial features from the teacher model. We present extensive evaluations on the nuScenes dataset to demonstrate the effectiveness of the proposed CRKD framework. The project page for CRKD is https://song-jingyu.github.io/CRKD.Comment: Accepted to CVPR 202

arXiv.org e-Print Archive

YUAN 2.0: A Large Language Model with Localized Filtering-based Attention

Author: Chen Xi
Li Lingjun
Luo Jiangang
Wang Chao
Wang Shenling
Wang Wei
Wu Shaohua
Yu Tong
Zhang Jiahua
Zhang Rongguo
Zhao Bing
Zhao Xudong
Publication venue
Publication date: 18/12/2023
Field of study

In this work, we develop and release Yuan 2.0, a series of large language models with parameters ranging from 2.1 billion to 102.6 billion. The Localized Filtering-based Attention (LFA) is introduced to incorporate prior knowledge of local dependencies of natural language into Attention. A data filtering and generating system is presented to build pre-training and fine-tuning dataset in high quality. A distributed training method with non-uniform pipeline parallel, data parallel, and optimizer parallel is proposed, which greatly reduces the bandwidth requirements of intra-node communication, and achieves good performance in large-scale distributed training. Yuan 2.0 models display impressive ability in code generation, math problem-solving, and chatting compared with existing models. The latest version of YUAN 2.0, including model weights and source code, is accessible at Github

arXiv.org e-Print Archive

SYNERGISTIC EFFECT OF RADIATION AND TRADITIONAL CHINESE MEDICINE RHIZOMA TYPHONII ETHANOL EXTRACTS DEPENDS ON P53 EXPRESSION IN TREATMENT OF LEWIS MOUSE LUNG CANCER CELLS

Author: Li Xin
Liu Xiaodong
Luo Yungang
Luo Yunxiao
Zhang Yifan
Zhao Lingjun
Zhao Yinlong
Zhong Lili
Publication venue: 'African Traditional Herbal Medicine Supporters Initiative (ATHMSI)'
Publication date: 29/11/2014
Field of study

Background: Lung cancer is the leading cause of cancer-related death, and it is the most common cancer in terms of both incidence and mortality. There is an urgent need on novel therapeutic strategies for lung cancer. Traditional Chinese herbal medicines (CHM) have potential valuable for cancer treatment. Materials and Methods: Lewis mouse lung cancer cell line and Lewis cells tumors xenograft were used in this experiment. MTT assay was used to detect cell proliferation, flow cytometry (FCM) analysis and Western blotting to detect cell apoptosis, and colony formation assay to evaluate the effect of combined therapy of RT+IR. Results: Our data showed that Rhizoma typhonii (RT) obviously inhibited the proliferations of Lewis cells in time and dose dependent manners by MTT assay and enhanced radiosensitivity by colony formation assay. The effects of RT to Ionizing radiation (IR) therapy were demonstrated radiosensitivity on tumors xenograft experiment. In our study, RT induced apoptotic in Lewis cells directly and enhanced the pro-apoptotic effect of IR by regulating the expression of p53. Conclusions: These data suggested that RT may be a great potential anti-tumor medicine and the combination of RT and IR may provide a new therapeutic strategy for the treatment of Lewis lung cancer

ATHMSI Journals

Block-Sparse Coding-Based Machine Learning Approach for Dependable Device-Free Localization in IoT Environment

Author: Ding Shuxue
Huang Huakun
Huang Huawei
Li Zhenni
Su Chunhua
Tan Zhiyuan
Zhao Lingjun
Publication venue: Institute of Electrical and Electronics Engineers
Publication date: 01/03/2021
Field of study

Device-free localization (DFL) locates targets without equipping with wireless devices or tag under the Internet-of-Things (IoT) architectures. As an emerging technology, DFL has spawned extensive applications in IoT environment, such as intrusion detection, mobile robot localization, and location-based services. Current DFL-related machine learning (ML) algorithms still suffer from low localization accuracy and weak dependability/robustness because the group structure has not been considered in their location estimation, which leads to a undependable process. To overcome these challenges, we propose in this work a dependable block-sparse scheme by particularly considering the group structure of signals. An accurate and robust ML algorithm named block-sparse coding with the proximal operator (BSCPO) is proposed for DFL. In addition, a severe Gaussian noise is added in the original sensing signals for preserving network-related privacy as well as improving the dependability of model. The real-world data-driven experimental results show that the proposed BSCPO achieves robust localization and signal-recovery performance even under severely noisy conditions and outperforms state-of-the-art DFL methods. For single-target localization, BSCPO retains high accuracy when the signal-to-noise ratio exceeds-10 dB. BSCPO is also able to localize accurately under most multitarget localization test cases

Repository@Napier

A five-collagen-based risk model in lung adenocarcinoma: prognostic significance and immune landscape

Author: Guangmao Yu
Jiandong Zhang
Jianyi Ding
Junjun Zhao
Lingjun Dong
Linhai Fu
Ting Zhu
Xiang Wang
Yuanlin Wu
Zhupeng Li
Publication venue: 'Frontiers Media SA'
Publication date: 01/07/2023
Field of study

As part of the tumor microenvironment (TME), collagen plays a significant role in cancer fibrosis formation. However, the collagen family expression profile and clinical features in lung adenocarcinoma (LUAD) are poorly understood. The objective of the present work was to investigate the expression pattern of genes from the collagen family in LUAD and to develop a predictive signature based on collagen family. The Cancer Genome Atlas (TCGA) samples were used as the training set, and five additional cohort samples obtained from the Gene Expression Omnibus (GEO) database were used as the validation set. A predictive model based on five collagen genes, including COL1A1, COL4A3, COL5A1, COL11A1, and COL22A1, was created by analyzing samples from the TCGA cohort using LASSO Cox analysis and univariate/multivariable Cox regression. Using Collagen-Risk scores, LUAD patients were then divided into high- and low-risk groups. KM survival analysis showed that collagen signature presented a robust prognostic power. GO and KEGG analyses confirmed that collagen signature was associated with extracellular matrix organization, ECM-receptor interaction, PI3K-Akts and AGE-RAGE signaling activation. High-risk patients exhibited a considerable activation of the p53 pathway and cell cycle, according to GSEA analysis. The Collage-Risk model showed unique features in immune cell infiltration and tumor-associated macrophage (TAM) polarization of the TME. Additionally, we deeply revealed the association of collagen signature with immune checkpoints (ICPs), tumor mutation burden (TMB), and tumor purity. We first constructed a reliable prognostic model based on TME principal component—collagen, which would enable clinicians to treat patients with LUAD more individually

Directory of Open Access Journals

Enhancing regulatory T-cell function via inhibition of high mobility group box 1 protein signaling in immune thrombocytopenia

Author: Guosheng Li
Haoyi Wang
Jun Peng
Lingjun Wang
Ming Hou
Ning An
Panpan Han
Pengcheng Xu
Tianshu Yu
Xiaofei Ni
Yajing Zhao
Yanfeng Liu
Yu Hou
Yubin Li
Yunqi Sun
Publication venue: Ferrata Storti Foundation
Publication date: 01/10/2022
Field of study

Primary immune thrombocytopenia (ITP) is the most common acquired autoimmune bleeding disorder. Abnormally increased levels of High Mobility Group Box 1 (HMGB1) protein associate with thrombocytopenia and therapeutic outcome in ITP. Previous studies proposed that a natural inhibitor of HMGB1, 18β-glycyrrhetinic acid (18β-GA), could be used for its anti-inflammatory and immune-modulatory effects, although its ability to correct immune balance in ITP is unclear. In this study, we showed that plasma HMGB1 correlated negatively with platelet counts in ITP patients, and confirmed that 18β-GA stimulated the production of regulatory T cells (Treg), restored the balance of CD4+ T-cell subsets and enhanced the suppressive function of Treg through blocking the effect on HMGB1 in patients with ITP. HMGB1 short hairpin RNA interference masked the effect of 18β-GA in Treg of ITP patients. Furthermore, we found that 18β-GA alleviated thrombocytopenia in mice with ITP. Briefly, anti-CD61 immune-sensitized splenocytes were transferred into severe combined immunodeficient mice to induce a murine model of severe ITP. The proportion of circulating Treg increased significantly, while the level of plasma HMGB1 and serum antiplatelet antibodies decreased significantly in ITP mice along 18β-GA treatment. In addition, 18β-GA reduced phagocytic activity of macrophages towards platelets both in ITP patients and ITP mice. These results indicate that 18β-GA has the potential to restore immune balance in ITP via inhibition of HMGB1 signaling. In short, this study reveals the role of HMGB1 in ITP, which may serve as a potential target for thrombocytopenia therapy

Directory of Open Access Journals