137 research outputs found

    DAQE: Enhancing the Quality of Compressed Images by Finding the Secret of Defocus

    Full text link
    Image defocus is inherent in the physics of image formation caused by the optical aberration of lenses, providing plentiful information on image quality. Unfortunately, the existing quality enhancement approaches for compressed images neglect the inherent characteristic of defocus, resulting in inferior performance. This paper finds that in compressed images, the significantly defocused regions are with better compression quality and two regions with different defocus values possess diverse texture patterns. These findings motivate our defocus-aware quality enhancement (DAQE) approach. Specifically, we propose a novel dynamic region-based deep learning architecture of the DAQE approach, which considers the region-wise defocus difference of compressed images in two aspects. (1) The DAQE approach employs fewer computational resources to enhance the quality of significantly defocused regions, while more resources on enhancing the quality of other regions; (2) The DAQE approach learns to separately enhance diverse texture patterns for the regions with different defocus values, such that texture-wise one-on-one enhancement can be achieved. Extensive experiments validate the superiority of our DAQE approach in terms of quality enhancement and resource-saving, compared with other state-of-the-art approaches

    Adversarial Training with Fast Gradient Projection Method against Synonym Substitution based Text Attacks

    Full text link
    Adversarial training is the most empirically successful approach in improving the robustness of deep neural networks for image classification.For text classification, however, existing synonym substitution based adversarial attacks are effective but not efficient to be incorporated into practical text adversarial training. Gradient-based attacks, which are very efficient for images, are hard to be implemented for synonym substitution based text attacks due to the lexical, grammatical and semantic constraints and the discrete text input space. Thereby, we propose a fast text adversarial attack method called Fast Gradient Projection Method (FGPM) based on synonym substitution, which is about 20 times faster than existing text attack methods and could achieve similar attack performance. We then incorporate FGPM with adversarial training and propose a text defense method called Adversarial Training with FGPM enhanced by Logit pairing (ATFL). Experiments show that ATFL could significantly improve the model robustness and block the transferability of adversarial examples.Comment: Accepted by AAAI 2021, code is available at https://github.com/JHL-HUST/FGP

    Evaluation Index System for Railway Hub Logistics Base System Layout Planning, Taking Hefei Railway Hub in China as an Example

    Get PDF
    The construction of a railway hub logistics base system has many influencing factors and a high construction cost. There is an extremely important social and economic significance to the evaluation of its planning scheme. This study has obtained practical experience from a large number of existing railway hub planning schemes in China, using the analytic hierarchy process. Then, macro- and micro-level layout planning principles were analyzed. Moreover, 16 evaluation indicators were established at the macro and micro levels. The analytic hierarchy process and a comprehensive evaluation index method were used to deal with all indicators and give the score of the planning scheme. Lastly, Hefei railway hub in China was taken as an example to test the theory above

    ReFusion: Learning Image Fusion from Reconstruction with Learnable Loss via Meta-Learning

    Full text link
    Image fusion aims to combine information from multiple source images into a single one with more comprehensive informational content. The significant challenges for deep learning-based image fusion algorithms are the lack of a definitive ground truth as well as the corresponding distance measurement, with current manually given loss functions constrain the flexibility of model and generalizability for unified fusion tasks. To overcome these limitations, we introduce a unified image fusion framework based on meta-learning, named ReFusion, which provides a learning paradigm that obtains the optimal fusion loss for various fusion tasks based on reconstructing the source images. Compared to existing methods, ReFusion employs a parameterized loss function, dynamically adjusted by the training framework according to the specific scenario and task. ReFusion is constituted by three components: a fusion module, a loss proposal module, and a source reconstruction module. To ensure the fusion module maximally preserves the information from the source images, enabling the reconstruction of the source images from the fused image, we adopt a meta-learning strategy to train the loss proposal module using reconstruction loss. The update of the fusion module relies on the fusion loss proposed by the loss proposal module. The alternating updates of the three modules mutually facilitate each other, aiming to propose an appropriate fusion loss for different tasks and yield satisfactory fusion results. Extensive experiments demonstrate that ReFusion is capable of adapting to various tasks, including infrared-visible, medical, multi-focus, and multi-exposure image fusion. The code will be released

    Uncertainty Guided Adaptive Warping for Robust and Efficient Stereo Matching

    Full text link
    Correlation based stereo matching has achieved outstanding performance, which pursues cost volume between two feature maps. Unfortunately, current methods with a fixed model do not work uniformly well across various datasets, greatly limiting their real-world applicability. To tackle this issue, this paper proposes a new perspective to dynamically calculate correlation for robust stereo matching. A novel Uncertainty Guided Adaptive Correlation (UGAC) module is introduced to robustly adapt the same model for different scenarios. Specifically, a variance-based uncertainty estimation is employed to adaptively adjust the sampling area during warping operation. Additionally, we improve the traditional non-parametric warping with learnable parameters, such that the position-specific weights can be learned. We show that by empowering the recurrent network with the UGAC module, stereo matching can be exploited more robustly and effectively. Extensive experiments demonstrate that our method achieves state-of-the-art performance over the ETH3D, KITTI, and Middlebury datasets when employing the same fixed model over these datasets without any retraining procedure. To target real-time applications, we further design a lightweight model based on UGAC, which also outperforms other methods over KITTI benchmarks with only 0.6 M parameters.Comment: Accepted by ICCV202

    VersaT2I: Improving Text-to-Image Models with Versatile Reward

    Full text link
    Recent text-to-image (T2I) models have benefited from large-scale and high-quality data, demonstrating impressive performance. However, these T2I models still struggle to produce images that are aesthetically pleasing, geometrically accurate, faithful to text, and of good low-level quality. We present VersaT2I, a versatile training framework that can boost the performance with multiple rewards of any T2I model. We decompose the quality of the image into several aspects such as aesthetics, text-image alignment, geometry, low-level quality, etc. Then, for every quality aspect, we select high-quality images in this aspect generated by the model as the training set to finetune the T2I model using the Low-Rank Adaptation (LoRA). Furthermore, we introduce a gating function to combine multiple quality aspects, which can avoid conflicts between different quality aspects. Our method is easy to extend and does not require any manual annotation, reinforcement learning, or model architecture changes. Extensive experiments demonstrate that VersaT2I outperforms the baseline methods across various quality criteria

    Farm-waste-derived recyclable photothermal evaporator

    Get PDF
    © The Author(s), 2021. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in Tian, Y., Liu, X., Li, J., Deng, Y., DeGiorgis, J. A., Zhou, S., Caratenuto, A., Minus, M. L., Wan, Y., Xiao, G., & Zheng, Y. Farm-waste-derived recyclable photothermal evaporator. Cell Reports Physical Science, 2(9), (2021): 100549, https://doi.org/10.1016./j.xcrp.2021.100549Interfacial solar steam generation is emerging as a promising technique for efficient desalination. Although increasing efforts have been made, challenges exist for achieving a balance among a plethora of performance indicators—for example, rapid evaporation, durability, low-cost deployment, and salt rejection. Here, we demonstrate that carbonized manure can convert 98% of sunlight into heat, and the strong capillarity of porous carbon fibers networks pumps sufficient water to evaporation interfaces. Salt diffusion within microchannels enables quick salt drainage to the bulk seawater to prevent salt accumulation. With these advantages, this biomass-derived evaporator is demonstrated to feature a high evaporation rate of 2.81 kg m−2 h−1 under 1 sun with broad robustness to acidity and alkalinity. These advantages, together with facial deployment, offer an approach for converting farm waste to energy with high efficiency and easy implementation, which is particularly well suited for developing regions.This project is supported by the National Science Foundation through grant no. CBET-1941743. This project is based upon work supported in part by the National Science Foundation under EPSCoR Cooperative Agreement no. OIA-1655221

    Characteristics of slamming pressure and force for trimaran hull

    Get PDF
    In this paper, the characteristics of the impact pressure and force of a trimaran section was studied by Computational Fluid Dynamics (CFD). The time domain features of the slamming pressure or force showed a strong correlation with the penetration depth regardless of the specific ways of water entry. The effects of velocity and acceleration on the impact pressure and force were analyzed. It was found that the initial impact of the main hull and the wet-deck slamming were predominantly affected by the entry velocities, whilst the acceleration had almost no effect for initial impact. The impact velocity presented a quadratic relation with slamming pressure/forces, and the relation between acceleration and wet-deck slamming pressure/force was linear. These were consistent with the patterns implied by analytical models such as the Wagner or MLM (Modified Logvinovich model) theories
    corecore