56 research outputs found

    Provably Improved Context-Based Offline Meta-RL with Attention and Contrastive Learning

    Full text link
    Meta-learning for offline reinforcement learning (OMRL) is an understudied problem with tremendous potential impact by enabling RL algorithms in many real-world applications. A popular solution to the problem is to infer task identity as augmented state using a context-based encoder, for which efficient learning of robust task representations remains an open challenge. In this work, we provably improve upon one of the SOTA OMRL algorithms, FOCAL, by incorporating intra-task attention mechanism and inter-task contrastive learning objectives, to robustify task representation learning against sparse reward and distribution shift. Theoretical analysis and experiments are presented to demonstrate the superior performance and robustness of our end-to-end and model-free framework compared to prior algorithms across multiple meta-RL benchmarks.Comment: 21 pages, 7 figure

    Antarctic snow-covered sea ice topography derivation from TanDEM-X using polarimetric SAR interferometry

    Get PDF
    Single-pass interferometric synthetic aperture radar (InSAR) enables the possibility for sea ice topographic retrieval despite the inherent dynamics of sea ice. InSAR digital elevation models (DEMs) are measuring the radar scattering center height. The height bias induced by the penetration of electromagnetic waves into snow and ice leads to inaccuracies of the InSAR DEM, especially for thick and deformed sea ice with snow cover. In this study, an elevation difference between the satellite-measured InSAR DEM and the airborne-measured optical DEM is observed from a coordinated campaign over the western Weddell Sea in Antarctica. The objective is to correct the penetration bias and generate a precise sea ice topographic map from the single-pass InSAR data. With the potential of retrieving sea ice geophysical information by the polarimetric-interferometry (Pol-InSAR) technique, a two-layer-plus-volume model is proposed to represent the sea ice vertical structure and its scattering mechanisms. Furthermore, a simplified version of the model is derived, to allow its inversion with limited a priori knowledge, which is then applied to a topographic retrieval scheme. The experiments are performed across four polarizations: HH, VV, Pauli 1 (HH + VV), and Pauli 2 (HH − VV). The model-retrieved performance is validated with the optically derived DEM of the sea ice topography, showing an excellent performance with root-mean-square error as low as 0.26 m in Pauli-1 (HH + VV) polarization

    ShadowDiffusion: When Degradation Prior Meets Diffusion Model for Shadow Removal

    Full text link
    Recent deep learning methods have achieved promising results in image shadow removal. However, their restored images still suffer from unsatisfactory boundary artifacts, due to the lack of degradation prior embedding and the deficiency in modeling capacity. Our work addresses these issues by proposing a unified diffusion framework that integrates both the image and degradation priors for highly effective shadow removal. In detail, we first propose a shadow degradation model, which inspires us to build a novel unrolling diffusion model, dubbed ShandowDiffusion. It remarkably improves the model's capacity in shadow removal via progressively refining the desired output with both degradation prior and diffusive generative prior, which by nature can serve as a new strong baseline for image restoration. Furthermore, ShadowDiffusion progressively refines the estimated shadow mask as an auxiliary task of the diffusion generator, which leads to more accurate and robust shadow-free image generation. We conduct extensive experiments on three popular public datasets, including ISTD, ISTD+, and SRD, to validate our method's effectiveness. Compared to the state-of-the-art methods, our model achieves a significant improvement in terms of PSNR, increasing from 31.69dB to 34.73dB over SRD dataset

    Boosting Visual-Language Models by Exploiting Hard Samples

    Full text link
    Contrastive Language-Image Pre-training (CLIP) has become the standard for learning cross-modal representations between images and text. Efforts to improve its capabilities typically demand the collection of additional data and retraining with new loss functions. While effective, the added requirements limit their practical use due to the increased resource and time investments needed. In this work, we present HELIP, a cost-effective strategy tailored to enhance the performance of existing CLIP models without the need for training a model from scratch or collecting additional data. Our method allows for effortless integration with existing models' training pipelines, providing an instant boost by training them with selected challenging text-image pairs from their original training datasets. HELIP treats each text-image pair as a single point in the joint vision-language space, identifying those in close proximity as hard pairs. By incorporating the challenging data, pre-trained CLIP models are refined using both the traditional contrastive loss and the newly introduced hard negative margin loss, ensuring the challenging data is fully utilized. On comprehensive benchmarks, HELIP consistently boosts existing models to achieve leading performance. In particular, it improves the zero-shot classification accuracy on ImageNet for SLIP models pre-trained on CC3M, CC12M and YFCC15M datasets. The improvements are 3.05%, 4.47%, and 10.1% respectively, achieved within two epochs of training. In addition, across fine-grained classification datasets, HELIP improves the zero-shot performance of pre-trained CLIP and SLIP by an average of 8.4% and 18.6%, and their linear probe performance by an average of 9.5% and 3.0%.Comment: The code is publicly available at https://github.com/haonan3/HELI

    Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation

    Full text link
    Diffusion models have proven to be highly effective in image and video generation; however, they still face composition challenges when generating images of varying sizes due to single-scale training data. Adapting large pre-trained diffusion models for higher resolution demands substantial computational and optimization resources, yet achieving a generation capability comparable to low-resolution models remains elusive. This paper proposes a novel self-cascade diffusion model that leverages the rich knowledge gained from a well-trained low-resolution model for rapid adaptation to higher-resolution image and video generation, employing either tuning-free or cheap upsampler tuning paradigms. Integrating a sequence of multi-scale upsampler modules, the self-cascade diffusion model can efficiently adapt to a higher resolution, preserving the original composition and generation capabilities. We further propose a pivot-guided noise re-schedule strategy to speed up the inference process and improve local structural details. Compared to full fine-tuning, our approach achieves a 5X training speed-up and requires only an additional 0.002M tuning parameters. Extensive experiments demonstrate that our approach can quickly adapt to higher resolution image and video synthesis by fine-tuning for just 10k steps, with virtually no additional inference time.Comment: Project Page: https://guolanqing.github.io/Self-Cascade

    Nomenclatural and taxonomic notes on Rubus davidianus Kuntze and R. viburnifolius Franch

    Get PDF
    Critical examinations of specimens, with literature reviews, have shown that Rubus davidianus is conspecific with R. lambertianus. Therefore, we treat R. davidianus as a new synonym within Rubus. We propose a new name, Rubus loirensis Ti R. Huang nom. nov. to replace the later homonym of R. pycnanthus Genev. Additionally, lectotypification of three names, R. davidianus Kuntze, R. malifolius Focke and R. viburnifolius Franch., are designated here after examination of previous works

    Tyrosine 23 Phosphorylation-Dependent Cell-Surface Localization of Annexin A2 Is Required for Invasion and Metastases of Pancreatic Cancer

    Get PDF
    The aggressiveness of pancreatic ductal adenocarcinoma (PDA) is characterized by its high metastatic potential and lack of effective therapies, which is the result of a lack of understanding of the mechanisms involved in promoting PDA metastases. We identified Annexin A2 (ANXA2), a member of the Annexin family of calcium-dependent phospholipid binding proteins, as a new molecule that promotes PDA invasion and metastases. We found ANXA2 to be a PDA-associated antigen recognized by post-treatment sera of patients who demonstrated prolonged survival following treatment with a PDA-specific vaccine. Cell surface ANXA2 increases with PDA development and progression. Knockdown of ANXA2 expression by RNA interference or blocking with anti-ANXA2 antibodies inhibits in vitro invasion of PDA cells. In addition, post-vaccination patient sera inhibits in vitro invasion of PDA cells, suggesting that therapeutic anti-ANXA2 antibodies are induced by the vaccine. Furthermore, cell-surface localization of ANXA2 is tyrosine 23 phosphorylation-dependent; and tyrosine 23 phosphorylation is required for PDA invasion. We demonstrated that tyrosine 23 phosphorylation resulting in surface expression of ANXA2 is required for TGFβ-induced, Rho-mediated epithelial-mesenchymal transition (EMT), linking the cellular function of ANXA2 which was previously shown to be associated with small GTPase-regulated cytoskeletal rearrangements, to the EMT process in PDA. Finally, using mouse PDA models, we showed that shRNA knock-down of ANXA2, a mutation at tyrosine 23, or anti-ANXA2 antibodies, inhibit PDA metastases and prolong mouse survival. Thus, ANXA2 is part of a novel molecular pathway underlying PDA metastases and a new target for development of PDA therapeutics

    Polarimetric Behavior for the Derivation of Sea Ice Topographic Height From TanDEM-X Interferometric SAR Data

    No full text
    Single-pass interferometric synthetic aperture radar (InSAR) is an effective technique for sea ice topographic retrieval despite the inherent dynamics of sea ice. However, the penetration of microwaves into snow-covered thick ice and the achievable height sensitivity for tens-of-centimeters thin ice are two major issues, which limit the accuracy of InSAR-derived sea ice topography. Polarimetry provides scattering information concerning the sea ice properties and has the potential, in combination with interferometry, to achieve an accurate reconstruction of a sea ice digital elevation model (DEM). This article studies the relation between polarimetric signatures and sea ice topography, and explores the possibility to compensate the penetration bias by merging copolar coherence into InSAR processing. The newly generated topographic map has a root-mean-square error under 0.3 m. For thin ice below 1 m, a positive relation between copolar phase phi(coPol) and surface height is observed, suggesting that fcoPol can effectively characterize thin sea ice topography. For thick ice with ridges, the maximum polarimetric phase difference Delta phi(maxPol) reveals a particular shape of the coherence region, which can be interpreted as oriented volume scattering. It suggests that the model-based approach using polarimetric SAR interferometry assuming an oriented volume scattering model is promising in measuring the scattering centers in thick and deformed sea ice. The study of polarimetric behavior for the InSAR DEM is, therefore, a step forward toward accurate modeling of sea ice topography from polarimetric single-pass InSAR data.ISSN:1939-1404ISSN:2151-153
    • …
    corecore