Search CORE

179 research outputs found

TableLlama: Towards Open Large Generalist Models for Tables

Author: Li Yifei
Sun Huan
Yue Xiang
Zhang Tianshu
Publication venue
Publication date: 15/11/2023
Field of study

Semi-structured tables are ubiquitous. There has been a variety of tasks that aim to automatically interpret, augment, and query tables. Current methods often require pretraining on tables or special model architecture design, are restricted to specific table types, or have simplifying assumptions about tables and tasks. This paper makes the first step towards developing open-source large language models (LLMs) as generalists for a diversity of table-based tasks. Towards that end, we construct TableInstruct, a new dataset with a variety of realistic tables and tasks, for instruction tuning and evaluating LLMs. We further develop the first open-source generalist model for tables, TableLlama, by fine-tuning Llama 2 (7B) with LongLoRA to address the long context challenge. We experiment under both in-domain setting and out-of-domain setting. On 7 out of 8 in-domain tasks, TableLlama achieves comparable or better performance than the SOTA for each task, despite the latter often has task-specific design. On 6 out-of-domain datasets, it achieves 6-48 absolute point gains compared with the base model, showing that training on TableInstruct enhances the model's generalizability. We will open-source our dataset and trained model to boost future work on developing open generalist models for tables

arXiv.org e-Print Archive

AttributionBench: How Hard is Automatic Attribution Evaluation?

Author: Li Yifei
Liao Zeyi
Sun Huan
Yue Xiang
Publication venue
Publication date: 22/02/2024
Field of study

Modern generative search engines enhance the reliability of large language model (LLM) responses by providing cited evidence. However, evaluating the answer's attribution, i.e., whether every claim within the generated responses is fully supported by its cited evidence, remains an open problem. This verification, traditionally dependent on costly human evaluation, underscores the urgent need for automatic attribution evaluation methods. To bridge the gap in the absence of standardized benchmarks for these methods, we present AttributionBench, a comprehensive benchmark compiled from various existing attribution datasets. Our extensive experiments on AttributionBench reveal the challenges of automatic attribution evaluation, even for state-of-the-art LLMs. Specifically, our findings show that even a fine-tuned GPT-3.5 only achieves around 80% macro-F1 under a binary classification formulation. A detailed analysis of more than 300 error cases indicates that a majority of failures stem from the model's inability to process nuanced information, and the discrepancy between the information the model has access to and that human annotators do

arXiv.org e-Print Archive

Rosmarinic Acid Prevents Cisplatin-Induced Liver and Kidney Injury by Inhibiting Inflammatory Responses and Enhancing Total Antioxidant Capacity, Thereby Activating the Nrf2 Signaling Pathway

Author: Ji Min
Lv Li
Wu Liqin
Xiang Yifei
Publication venue: eGrove
Publication date: 01/11/2022
Field of study

Drug-induced liver and kidney damage is an emergent clinical issue that should be addressed. Rosmarinic acid (RA) has obvious anti-inflammatory and antioxidant effects, so we evaluated the anti-inflammatory and antioxidant effects of RA pretreatment on serum and liver and kidney tissues of cisplatin (CP)-treated mice and explored the possible mechanisms. The results showed that RA pretreatment effectively downregulated the serum, liver, and kidney levels of ALT, AST, BUN, and CRE and the inflammatory factors IL-1β, IL-6, and TNF-α, and simultaneously enhanced the total antioxidant capacity of the liver and kidney. RA pretreatment significantly reduced the levels of MPO, MDA, and NO in liver and kidney tissue, inhibited the mRNA expression of IL-1β, IL-6, and TNF-α in liver and kidney tissue, activated the Nrf2 signaling pathway, and upregulated the mRNA expression of downstream target genes. Our findings show that RA could effectively prevent and alleviate acute liver and kidney injury caused by CP

eGrove (Univ. of Mississippi)

Acceleration compensation of a novel piezoelectric balance for the short duration impulse measurement: a time series analysis approach

Author: Wanting Dai
Yifei Gao
Zhenyuan Jia
Zhongtian Xiang
Zongjin Ren
Publication venue: 'JVE International Ltd.'
Publication date: 30/12/2015
Field of study

A novel piezoelectric balance was developed to measure the six-component forces for the complex aircraft scaled model in the impulse combustion wind tunnel at a short duration airloads Mach number of 5. The piezoelectric balance using four triaxial piezoelectric load cells yields the high stiffness, sensitive and good dynamic response characteristics. The dynamic model-balance system was built to analyze the vibration characteristic. The time series analysis approach was developed on the basis of the system transfer function and the natural frequency, and the accelerated forces which induce the airloads overshooting oscillations had been obtained by the second order derivatives function. The experimental results have shown that the problem of overshooting oscillations effect of the impulse can be effectively solved by the acceleration compensation technology for the complex test model with the novel piezoelectric balance

Journal of Vibroengineering

Fully distributed consensus for high-order strict-feedback nonlinear multiagent systems with switched topologies

Author: Wang Ronghao
Wu Yifei
Xiang Zhengrong
Xu Rui
Zheng Sunyu
Publication venue: Institute of Information Theory and Automation AS CR
Publication date: 01/01/2023
Field of study

summary:This paper studies the distributed consensus problem of high-order strict-feedback nonlinear multiagent systems. By employing the adaptive backstepping technique and switched system theory, a novel protocol is proposed for MASs with switched topologies. Global information such as the number of agents and communication topology is not used. In addition, the communication topology between agents can be switched between possible topologies at any time. Based on the Lyapunov function method, the proposed adaptive protocol guarantees the complete consensus of multiagent systems without restricting the dwell time of the switched signal. Finally, two numerical examples are provided to illustrate the effectiveness and advantages of the given protocol

Institute of Mathematics AS CR, v. v. i.

Label Propagation for Graph Label Noise

Author: Cheng Yao
Li Dongsheng
Li Xiang
Luo Siqiang
Shan Caihua
Shen Yifei
Publication venue
Publication date: 25/10/2023
Field of study

Label noise is a common challenge in large datasets, as it can significantly degrade the generalization ability of deep neural networks. Most existing studies focus on noisy labels in computer vision; however, graph models encompass both node features and graph topology as input, and become more susceptible to label noise through message-passing mechanisms. Recently, only a few works have been proposed to tackle the label noise on graphs. One major limitation is that they assume the graph is homophilous and the labels are smoothly distributed. Nevertheless, real-world graphs may contain varying degrees of heterophily or even be heterophily-dominated, leading to the inadequacy of current methods. In this paper, we study graph label noise in the context of arbitrary heterophily, with the aim of rectifying noisy labels and assigning labels to previously unlabeled nodes. We begin by conducting two empirical analyses to explore the impact of graph homophily on graph label noise. Following observations, we propose a simple yet efficient algorithm, denoted as LP4GLN. Specifically, LP4GLN is an iterative algorithm with three steps: (1) reconstruct the graph to recover the homophily property, (2) utilize label propagation to rectify the noisy labels, (3) select high-confidence labels to retain for the next iteration. By iterating these steps, we obtain a set of correct labels, ultimately achieving high accuracy in the node classification task. The theoretical analysis is also provided to demonstrate its remarkable denoising "effect". Finally, we conduct experiments on 10 benchmark datasets under varying graph heterophily levels and noise types, comparing the performance of LP4GLN with 7 typical baselines. Our results illustrate the superior performance of the proposed LP4GLN

arXiv.org e-Print Archive

Generating by Understanding: Neural Visual Generation with Logical Symbol Groundings

Author: Dai Wang-Zhou
Ding Yao-Xiang
Jin Yu
Luo Zhexu
Peng Yifei
Ren Zhong
Zhou Kun
Publication venue
Publication date: 26/10/2023
Field of study

Despite the great success of neural visual generative models in recent years, integrating them with strong symbolic knowledge reasoning systems remains a challenging task. The main challenges are two-fold: one is symbol assignment, i.e. bonding latent factors of neural visual generators with meaningful symbols from knowledge reasoning systems. Another is rule learning, i.e. learning new rules, which govern the generative process of the data, to augment the knowledge reasoning systems. To deal with these symbol grounding problems, we propose a neural-symbolic learning approach, Abductive Visual Generation (AbdGen), for integrating logic programming systems with neural visual generative models based on the abductive learning framework. To achieve reliable and efficient symbol assignment, the quantized abduction method is introduced for generating abduction proposals by the nearest-neighbor lookups within semantic codebooks. To achieve precise rule learning, the contrastive meta-abduction method is proposed to eliminate wrong rules with positive cases and avoid less-informative rules with negative cases simultaneously. Experimental results on various benchmark datasets show that compared to the baselines, AbdGen requires significantly fewer instance-level labeling information for symbol assignment. Furthermore, our approach can effectively learn underlying logical generative rules from data, which is out of the capability of existing approaches

arXiv.org e-Print Archive