90 research outputs found

    Object-aware Inversion and Reassembly for Image Editing

    Full text link
    By comparing the original and target prompts in editing task, we can obtain numerous editing pairs, each comprising an object and its corresponding editing target. To allow editability while maintaining fidelity to the input image, existing editing methods typically involve a fixed number of inversion steps that project the whole input image to its noisier latent representation, followed by a denoising process guided by the target prompt. However, we find that the optimal number of inversion steps for achieving ideal editing results varies significantly among different editing pairs, owing to varying editing difficulties. Therefore, the current literature, which relies on a fixed number of inversion steps, produces sub-optimal generation quality, especially when handling multiple editing pairs in a natural image. To this end, we propose a new image editing paradigm, dubbed Object-aware Inversion and Reassembly (OIR), to enable object-level fine-grained editing. Specifically, we design a new search metric, which determines the optimal inversion steps for each editing pair, by jointly considering the editability of the target and the fidelity of the non-editing region. We use our search metric to find the optimal inversion step for each editing pair when editing an image. We then edit these editing pairs separately to avoid concept mismatch. Subsequently, we propose an additional reassembly step to seamlessly integrate the respective editing results and the non-editing region to obtain the final edited image. To systematically evaluate the effectiveness of our method, we collect two datasets for benchmarking single- and multi-object editing, respectively. Experiments demonstrate that our method achieves superior performance in editing object shapes, colors, materials, categories, etc., especially in multi-object editing scenarios.Comment: Project Page: https://aim-uofa.github.io/OIR-Diffusion

    Pruning Meets Low-Rank Parameter-Efficient Fine-Tuning

    Full text link
    Large pre-trained models (LPMs), such as LLaMA and ViT-G, have shown exceptional performance across various tasks. Although parameter-efficient fine-tuning (PEFT) has emerged to cheaply fine-tune these large models on downstream tasks, their deployment is still hindered by the vast model scale and computational costs. Neural network pruning offers a solution for model compression by removing redundant parameters, but most existing methods rely on computing parameter gradients. However, obtaining the gradients is computationally prohibitive for LPMs, which necessitates the exploration of alternative approaches. To this end, we propose a unified framework for efficient fine-tuning and deployment of LPMs, termed LoRAPrune. We first design a PEFT-aware pruning criterion, which utilizes the values and gradients of Low-Rank Adaption (LoRA), rather than the gradients of pre-trained parameters for importance estimation. We then propose an iterative pruning procedure to remove redundant parameters while maximizing the advantages of PEFT. Thus, our LoRAPrune delivers an accurate, compact model for efficient inference in a highly cost-effective manner. Experimental results on various tasks demonstrate that our method achieves state-of-the-art results. For instance, in the VTAB-1k benchmark, LoRAPrune utilizes only 0.76% of the trainable parameters and outperforms magnitude and movement pruning methods by a significant margin, achieving a mean Top-1 accuracy that is 5.7% and 4.3% higher, respectively. Moreover, our approach achieves comparable performance to PEFT methods, highlighting its efficacy in delivering high-quality results while benefiting from the advantages of pruning

    Evaluation of precipitable water vapor from five reanalysis products with ground-based GNSS observations

    Get PDF
    At present, the global reliability and accuracy of Precipitable Water Vapor (PWV) from different reanalysis products have not been comprehensively evaluated. In this study, PWV values derived by 268 Global Navigation Satellite Systems (GNSS) stations around the world covering the period from 2016 to 2018 are used to evaluate the accuracies of PWV values from five reanalysis products. The temporal and spatial evolution is not taken into account in this analysis, although the temporal and spatial evolution of atmospheric flows is one of the most important information elements available in numerical weather prediction products. The evaluation results present that five reanalysis products with PWV accuracy from high to low are in the order of the fifth generation of European Centre for Medium-Range Weather Forecasts (ECMWF) Reanalysis (ERA5), ERA-Interim, Japanese 55-year Reanalysis (JRA-55), National Centers for Environmental Prediction/National Center for Atmospheric Research (NCEP/NCAR), and NCEP/DOE (Department of Energy) according to root mean square error (RMSE), bias and correlation coefficient. The ERA5 has the smallest RMSE value of 1.84 mm, while NCEP/NCAR and NCEP/DOE have bigger RMSE values of 3.34 mm and 3.51 mm, respectively. The findings demonstrate that ERA5 and two NCEP reanalysis products have the best and worst performance, respectively, among five reanalysis products. The differences in the accuracy of the five reanalysis products are mainly attributed to the differences in the spatial resolution of reanalysis products. There are some large absolute biases greater than 4 mm between GNSS PWV values and the PWV values of five reanalysis products in the southwest of South America and western China due to the limit of terrains and fewer observations. The accuracies of five reanalysis products are compared in different climatic zones. The results indicate that the absolute accuracies of five reanalysis products are highest in the polar regions and lowest in the tropics. Furthermore, the effects of different seasons on the accuracies of five reanalysis products are also analyzed, which indicates that RMSE values of five reanalysis products in summer and in winter are the largest and the smallest in the temperate regions. Evaluation results from five reanalysis products can help us to learn more about the advantages and disadvantages of the five released water vapor products and promote their applications.Peer ReviewedPostprint (published version

    HPC-GPT: Integrating Large Language Model for High-Performance Computing

    Full text link
    Large Language Models (LLMs), including the LLaMA model, have exhibited their efficacy across various general-domain natural language processing (NLP) tasks. However, their performance in high-performance computing (HPC) domain tasks has been less than optimal due to the specialized expertise required to interpret the model responses. In response to this challenge, we propose HPC-GPT, a novel LLaMA-based model that has been supervised fine-tuning using generated QA (Question-Answer) instances for the HPC domain. To evaluate its effectiveness, we concentrate on two HPC tasks: managing AI models and datasets for HPC, and data race detection. By employing HPC-GPT, we demonstrate comparable performance with existing methods on both tasks, exemplifying its excellence in HPC-related scenarios. Our experiments on open-source benchmarks yield extensive results, underscoring HPC-GPT's potential to bridge the performance gap between LLMs and HPC-specific tasks. With HPC-GPT, we aim to pave the way for LLMs to excel in HPC domains, simplifying the utilization of language models in complex computing applications.Comment: 9 page

    In situ time-resolved FTIRS study of adsorption and oxidation of ethylene glycol on Pt(100) electrode

    Get PDF
    Adsorption and oxidation of ethylene glycol (EG) on Pt(100) electrode were studied by in situ time-resolved FTIRS (TRFTIRS). The TRFTIR spectra recorded at 0.10 V illustrate that an IR band appears near 2050 cm(-1) at t > 5 s, corresponding to linearly bonded CO formed in dissociative adsorption of EG The TRFTIR results have confirmed also that CO species are distributed uniformly on Pt(100) surface. Another band appears near 2342 cm(-1) at t < 70 s, associating with IR absorption of CO2 produced in the direct oxidation of EG With the increase of electrode potential, the direct oxidation of EG becomes gradually the main reaction. When the potential is above 0.40 V, the oxidation of EG occurs mainly via the reactive intermediates, i.e. species containing -COOH determined by in situ TRFTIRS
    • …
    corecore