196 research outputs found

    Explanation Regeneration via Information Bottleneck

    Full text link
    Explaining the black-box predictions of NLP models naturally and accurately is an important open problem in natural language generation. These free-text explanations are expected to contain sufficient and carefully-selected evidence to form supportive arguments for predictions. Due to the superior generative capacity of large pretrained language models, recent work built on prompt engineering enables explanation generation without specific training. However, explanation generated through single-pass prompting often lacks sufficiency and conciseness. To address this problem, we develop an information bottleneck method EIB to produce refined explanations that are sufficient and concise. Our approach regenerates the free-text explanation by polishing the single-pass output from the pretrained language model but retaining the information that supports the contents being explained. Experiments on two out-of-domain tasks verify the effectiveness of EIB through automatic evaluation and thoroughly-conducted human evaluation.Comment: Accepted in ACL2023 Finding

    DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models

    Full text link
    Recently, diffusion models have emerged as a new paradigm for generative models. Despite the success in domains using continuous signals such as vision and audio, adapting diffusion models to natural language is under-explored due to the discrete nature of texts, especially for conditional generation. We tackle this challenge by proposing DiffuSeq: a diffusion model designed for sequence-to-sequence (Seq2Seq) text generation tasks. Upon extensive evaluation over a wide range of Seq2Seq tasks, we find DiffuSeq achieving comparable or even better performance than six established baselines, including a state-of-the-art model that is based on pre-trained language models. Apart from quality, an intriguing property of DiffuSeq is its high diversity during generation, which is desired in many Seq2Seq tasks. We further include a theoretical analysis revealing the connection between DiffuSeq and autoregressive/non-autoregressive models. Bringing together theoretical analysis and empirical evidence, we demonstrate the great potential of diffusion models in complex conditional language generation tasks. Code is available at \url{https://github.com/Shark-NLP/DiffuSeq}Comment: ICLR 2023 camera read

    Compositional Exemplars for In-context Learning

    Full text link
    Large pretrained language models (LMs) have shown impressive In-Context Learning (ICL) ability, where the model learns to do an unseen task via a prompt consisting of input-output examples as the demonstration, without any parameter updates. The performance of ICL is highly dominated by the quality of the selected in-context examples. However, previous selection methods are mostly based on simple heuristics, leading to sub-optimal performance. In this work, we formulate in-context example selection as a subset selection problem. We propose CEIL (Compositional Exemplars for In-context Learning), which is instantiated by Determinantal Point Processes (DPPs) to model the interaction between the given input and in-context examples, and optimized through a carefully-designed contrastive learning objective to obtain preference from LMs. We validate CEIL on 12 classification and generation datasets from 7 distinct NLP tasks, including sentiment analysis, paraphrase detection, natural language inference, commonsense reasoning, open-domain question answering, code generation, and semantic parsing. Extensive experiments demonstrate not only the state-of-the-art performance but also the transferability and compositionality of CEIL, shedding new light on effective and efficient in-context learning. Our code is released at https://github.com/HKUNLP/icl-ceil.Comment: Accepted in ICML 202

    Unsupervised Explanation Generation via Correct Instantiations

    Full text link
    While large pre-trained language models (PLM) have shown their great skills at solving discriminative tasks, a significant gap remains when compared with humans for explanation-related tasks. Among them, explaining the reason why a statement is wrong (e.g., against commonsense) is incredibly challenging. The major difficulty is finding the conflict point, where the statement contradicts our real world. This paper proposes Neon, a two-phrase, unsupervised explanation generation framework. Neon first generates corrected instantiations of the statement (phase I), then uses them to prompt large PLMs to find the conflict point and complete the explanation (phase II). We conduct extensive experiments on two standard explanation benchmarks, i.e., ComVE and e-SNLI. According to both automatic and human evaluations, Neon outperforms baselines, even for those with human-annotated instantiations. In addition to explaining a negative prediction, we further demonstrate that Neon remains effective when generalizing to different scenarios.Comment: Accepted to AAAI-2

    OS-Copilot: Towards Generalist Computer Agents with Self-Improvement

    Full text link
    Autonomous interaction with the computer has been a longstanding challenge with great potential, and the recent proliferation of large language models (LLMs) has markedly accelerated progress in building digital agents. However, most of these agents are designed to interact with a narrow domain, such as a specific software or website. This narrow focus constrains their applicability for general computer tasks. To this end, we introduce OS-Copilot, a framework to build generalist agents capable of interfacing with comprehensive elements in an operating system (OS), including the web, code terminals, files, multimedia, and various third-party applications. We use OS-Copilot to create FRIDAY, a self-improving embodied agent for automating general computer tasks. On GAIA, a general AI assistants benchmark, FRIDAY outperforms previous methods by 35%, showcasing strong generalization to unseen applications via accumulated skills from previous tasks. We also present numerical and quantitative evidence that FRIDAY learns to control and self-improve on Excel and Powerpoint with minimal supervision. Our OS-Copilot framework and empirical findings provide infrastructure and insights for future research toward more capable and general-purpose computer agents.Comment: Project page: https://os-copilot.github.i

    Chitosan-Alginate Sponge: Preparation and Application in Curcumin Delivery for Dermal Wound Healing in Rat

    Get PDF
    A biodegradable sponge, composed of chitosan (CS) and sodium alginate (SA), was successfully obtained in this work. The sponge was ethereal and pliable. The chemical structure and morphology of the sponges was characterized by FTIR and SEM. The swelling ability, in vitro drug release and degradation behaviors, and an in vivo animal test were employed to confirm the applicability of this sponge as a wound dressing material. As the chitosan content in the sponge decreased, the swelling ability decreased. All types of the sponges exhibited biodegradable properties. The release of curcumin from the sponges could be controlled by the crosslinking degree. Curcumin could be released from the sponges in an extended period for up to 20 days. An in vivo animal test using SD rat showed that sponge had better effect than cotton gauze, and adding curcumin into the sponge enhanced the therapeutic healing effect

    Precipitable water vapour retrieval from GPS precise point positioning and NCEP CFSv2 dataset during typhoon events

    Get PDF
    Radiosonde is extensively used for understanding meteorological parameters in the vertical direction. Four typhoon events, including three landfalls (MERANTI, NEPARTAK, and MEGI) and one non-landfall (MALAKAS), were chosen in analysing the precipitable water vapour (PWV) characteristics in this study. The spatial distribution of the three radiosonde stations in Zhejiang province does not meet the requirement in analysing changes in PWV during typhoon event. Global position system (GPS) observations are an alternative method for deriving the PWV. This enables improvements in the temporal⁻spatial resolution of PWV computed by the radiosonde measurements. The National Centers for Environmental Prediction (NCEP) re-analysed data were employed for interpolating temperature and atmosphere pressure at the GPS antennas height. The PWV computed from GPS observations and NCEP re-analysed data were then compared with the true PWV. The maximum difference of radiosonde and GPS PWV was not more than 30 mm at Taiz station. The Root-Mean-Square (RMS) of PWV differences between radiosonde and GPS was not more than 5 mm in January, February, March, November, and December. It was slightly greater than 5 mm in April. High RMS in May, June, July, August, September, and October implies that differences in GPS and radiosonde PWVs are evident in these months. Correlation coefficients of GPS and radiosonde PWVs were more than 0.9, indicating that the changes in GPS and radiosonde PWVs are similar. Radiosonde calculated PWVs were used for GPS PWV calibration for understanding the PWV changes during the period of a typhoon event. The results from three landfall typhoons show that the average PWV over Zhejiang province is increasing and approaching China mainland. In contrast, MALAKAS did not make landfall and shows a decreasing PWV trend, although it was heading to China mainland. Generally, the PWV change can be used to predict whether the typhoon will make landfall in these cases. PWV spatial distribution of MERANTI shows that PWV peaks change along the typhoon epicenter over Zhejiang province
    corecore