Search CORE

200 research outputs found

Progressive-Hint Prompting Improves Reasoning in Large Language Models

Author: Li Yu
Li Zhenguo
Liu Zhengying
Xie Enze
Zheng Chuanyang
Publication venue
Publication date: 19/04/2023
Field of study

The performance of Large Language Models (LLMs) in reasoning tasks depends heavily on prompt design, with Chain-of-Thought (CoT) and self-consistency being critical methods that enhance this ability. However, these methods do not fully exploit the answers generated by the LLM to guide subsequent responses. This paper proposes a new prompting method, named Progressive-Hint Prompting (PHP), that enables automatic multiple interactions between users and LLMs by using previously generated answers as hints to progressively guide toward the correct answers. PHP is orthogonal to CoT and self-consistency, making it easy to combine with state-of-the-art techniques to further improve performance. We conducted an extensive and comprehensive evaluation to demonstrate the effectiveness of the proposed method. Our experimental results on six benchmarks show that combining CoT and self-consistency with PHP significantly improves accuracy while remaining highly efficient. For instance, with text-davinci-003, we observed a 4.2% improvement on GSM8K with greedy decoding compared to Complex CoT, and a 46.17% reduction in sample paths with self-consistency. With GPT-4 and PHP, we achieve state-of-the-art performances on SVAMP (91.9%), GSM8K (95.5%) and AQuA (79.9%).Comment: Tech Repor

arXiv.org e-Print Archive

An adaptive model checking test for functional linear model

Author: Kong Linglong
Li Lingzhu
Liu Yi
Shi Enze
Sun Ke
Publication venue
Publication date: 04/04/2022
Field of study

Numerous studies have been devoted to the estimation and inference problems for functional linear models (FLM). However, few works focus on model checking problem that ensures the reliability of results. Limited tests in this area do not have tractable null distributions or asymptotic analysis under alternatives. Also, the functional predictor is usually assumed to be fully observed, which is impractical. To address these problems, we propose an adaptive model checking test for FLM. It combines regular moment-based and conditional moment-based tests, and achieves model adaptivity via the dimension of a residual-based subspace. The advantages of our test are manifold. First, it has a tractable chi-squared null distribution and higher powers under the alternatives than its components. Second, asymptotic properties under different underlying models are developed, including the unvisited local alternatives. Third, the test statistic is constructed upon finite grid points, which incorporates the discrete nature of collected data. We develop the desirable relationship between sample size and number of grid points to maintain the asymptotic properties. Besides, we provide a data-driven approach to estimate the dimension leading to model adaptivity, which is promising in sufficient dimension reduction. We conduct comprehensive numerical experiments to demonstrate the advantages the test inherits from its two simple components

arXiv.org e-Print Archive

T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation

Author: Huang Kaiyi
Li Zhenguo
Liu Xihui
Sun Kaiyue
Xie Enze
Publication venue
Publication date: 12/07/2023
Field of study

Despite the stunning ability to generate high-quality images by recent text-to-image models, current approaches often struggle to effectively compose objects with different attributes and relationships into a complex and coherent scene. We propose T2I-CompBench, a comprehensive benchmark for open-world compositional text-to-image generation, consisting of 6,000 compositional text prompts from 3 categories (attribute binding, object relationships, and complex compositions) and 6 sub-categories (color binding, shape binding, texture binding, spatial relationships, non-spatial relationships, and complex compositions). We further propose several evaluation metrics specifically designed to evaluate compositional text-to-image generation. We introduce a new approach, Generative mOdel fine-tuning with Reward-driven Sample selection (GORS), to boost the compositional text-to-image generation abilities of pretrained text-to-image models. Extensive experiments and evaluations are conducted to benchmark previous methods on T2I-CompBench, and to validate the effectiveness of our proposed evaluation metrics and GORS approach. Project page is available at https://karine-h.github.io/T2I-CompBench/.Comment: Project page: https://karine-h.github.io/T2I-CompBench

arXiv.org e-Print Archive

DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning

Author: Li Jiawei
Li Zhenguo
Liu Zhaoqiang
Liu Zhili
Shi Han
Xie Enze
Yao Lewei
Zhou Daquan
Publication venue
Publication date: 13/04/2023
Field of study

Diffusion models have proven to be highly effective in generating high-quality images. However, adapting large pre-trained diffusion models to new domains remains an open challenge, which is critical for real-world applications. This paper proposes DiffFit, a parameter-efficient strategy to fine-tune large pre-trained diffusion models that enable fast adaptation to new domains. DiffFit is embarrassingly simple that only fine-tunes the bias term and newly-added scaling factors in specific layers, yet resulting in significant training speed-up and reduced model storage costs. Compared with full fine-tuning, DiffFit achieves 2

\times

training speed-up and only needs to store approximately 0.12\% of the total model parameters. Intuitive theoretical analysis has been provided to justify the efficacy of scaling factors on fast adaptation. On 8 downstream datasets, DiffFit achieves superior or competitive performances compared to the full fine-tuning while being more efficient. Remarkably, we show that DiffFit can adapt a pre-trained low-resolution generative model to a high-resolution one by adding minimal cost. Among diffusion-based methods, DiffFit sets a new state-of-the-art FID of 3.02 on ImageNet 512

\times

512 benchmark by fine-tuning only 25 epochs from a public pre-trained ImageNet 256

\times

256 checkpoint while being 30

\times

more training efficient than the closest competitor.Comment: Tech Repor

arXiv.org e-Print Archive

Pathology Steered Stratification Network for Subtype Identification in Alzheimer's Disease

Author: Chen Minghan
Li Jiadi
Song Qianqian
Wu Guorong
Xu Enze
Yang Defu
Zhang Jingwen
Publication venue
Publication date: 25/08/2023
Field of study

Alzheimer's disease (AD) is a heterogeneous, multifactorial neurodegenerative disorder characterized by beta-amyloid, pathologic tau, and neurodegeneration. There are no effective treatments for Alzheimer's disease at a late stage, urging for early intervention. However, existing statistical inference approaches of AD subtype identification ignore the pathological domain knowledge, which could lead to ill-posed results that are sometimes inconsistent with the essential neurological principles. Integrating systems biology modeling with machine learning, we propose a novel pathology steered stratification network (PSSN) that incorporates established domain knowledge in AD pathology through a reaction-diffusion model, where we consider non-linear interactions between major biomarkers and diffusion along brain structural network. Trained on longitudinal multimodal neuroimaging data, the biological model predicts long-term trajectories that capture individual progression pattern, filling in the gaps between sparse imaging data available. A deep predictive neural network is then built to exploit spatiotemporal dynamics, link neurological examinations with clinical profiles, and generate subtype assignment probability on an individual basis. We further identify an evolutionary disease graph to quantify subtype transition probabilities through extensive simulations. Our stratification achieves superior performance in both inter-cluster heterogeneity and intra-cluster homogeneity of various clinical scores. Applying our approach to enriched samples of aging populations, we identify six subtypes spanning AD spectrum, where each subtype exhibits a distinctive biomarker pattern that is consistent with its clinical outcome. PSSN provides insights into pre-symptomatic diagnosis and practical guidance on clinical treatments, which may be further generalized to other neurodegenerative diseases

arXiv.org e-Print Archive

SIAD: Self-supervised Image Anomaly Detection System

Author: Chen Bin
Huo Enze
Jiang Bolin
Lan Chenxi
Li Jiawei
Li Naiqi
Li Yaowei
Liu Yan
Xie Yuqiu
Zhang Xinyi
Publication venue
Publication date: 08/10/2023
Field of study

Recent trends in AIGC effectively boosted the application of visual inspection. However, most of the available systems work in a human-in-the-loop manner and can not provide long-term support to the online application. To make a step forward, this paper outlines an automatic annotation system called SsaA, working in a self-supervised learning manner, for continuously making the online visual inspection in the manufacturing automation scenarios. Benefit from the self-supervised learning, SsaA is effective to establish a visual inspection application for the whole life-cycle of manufacturing. In the early stage, with only the anomaly-free data, the unsupervised algorithms are adopted to process the pretext task and generate coarse labels for the following data. Then supervised algorithms are trained for the downstream task. With user-friendly web-based interfaces, SsaA is very convenient to integrate and deploy both of the unsupervised and supervised algorithms. So far, the SsaA system has been adopted for some real-life industrial applications.Comment: 4 pages, 3 figures, ICCV 2023 Demo Trac

arXiv.org e-Print Archive

A review on heterogeneous solid catalysts and related catalytic mechanisms for epoxidation of olefins with H2O2

Author: Du Zexue
Fan Yongxian
Ge Zhonghua
Li Xiaonian
Min Enze
Xie Huali
Zhou Chunhui
Publication venue: 'Croatian Society of Chemical Engineers/HDKI'
Publication date: 01/01/2008
Field of study

The epoxidation reaction using heterogeneous solid catalysts with H2O2 as oxidants are environmentally friendly routes to produce extensively useful epoxides which are traditionally obtained from capital-intensive or environmentally polluted processes. In this paper, various types of solid catalysts for the epoxidation of olefins with H2O2 as oxidants are reviewed. The efficient catalysts reported include microporous and mesoporous molecular sieves, layered-type materials, inorganic oxides, supported catalysts, zeolite encapsulated metal complexes, polyoxometalates, and supported organometallic catalysts. The proposed reaction mechanisms over different solid catalysts are summarized. The problems and perspectives to further efficiently improve the catalytic performances of the concerned heterogeneous catalysts for epoxidation reaction are remarked

HRČAK - Portal of Croatian Scientific and Professional Journals

Hrčak - Portal of scientific journals of Croatia