56 research outputs found
Provably Improved Context-Based Offline Meta-RL with Attention and Contrastive Learning
Meta-learning for offline reinforcement learning (OMRL) is an understudied
problem with tremendous potential impact by enabling RL algorithms in many
real-world applications. A popular solution to the problem is to infer task
identity as augmented state using a context-based encoder, for which efficient
learning of robust task representations remains an open challenge. In this
work, we provably improve upon one of the SOTA OMRL algorithms, FOCAL, by
incorporating intra-task attention mechanism and inter-task contrastive
learning objectives, to robustify task representation learning against sparse
reward and distribution shift. Theoretical analysis and experiments are
presented to demonstrate the superior performance and robustness of our
end-to-end and model-free framework compared to prior algorithms across
multiple meta-RL benchmarks.Comment: 21 pages, 7 figure
Antarctic snow-covered sea ice topography derivation from TanDEM-X using polarimetric SAR interferometry
Single-pass interferometric synthetic aperture radar (InSAR) enables the possibility for sea ice topographic retrieval despite the inherent dynamics of sea ice. InSAR digital elevation models (DEMs) are measuring the radar scattering center height. The height bias induced by the penetration of electromagnetic waves into snow and ice leads to inaccuracies of the InSAR DEM, especially for thick and deformed sea ice with snow cover. In this study, an elevation difference between the satellite-measured InSAR DEM and the airborne-measured optical DEM is observed from a coordinated campaign over the western Weddell Sea in Antarctica. The objective is to correct the penetration bias and generate a precise sea ice topographic map from the single-pass InSAR data. With the potential of retrieving sea ice geophysical information by the polarimetric-interferometry (Pol-InSAR) technique, a two-layer-plus-volume model is proposed to represent the sea ice vertical structure and its scattering mechanisms. Furthermore, a simplified version of the model is derived, to allow its inversion with limited a priori knowledge, which is then applied to a topographic retrieval scheme. The experiments are performed across four polarizations: HH, VV, Pauli 1 (HH + VV), and Pauli 2 (HH − VV). The model-retrieved performance is validated with the optically derived DEM of the sea ice topography, showing an excellent performance with root-mean-square error as low as 0.26 m in Pauli-1 (HH + VV) polarization
ShadowDiffusion: When Degradation Prior Meets Diffusion Model for Shadow Removal
Recent deep learning methods have achieved promising results in image shadow
removal. However, their restored images still suffer from unsatisfactory
boundary artifacts, due to the lack of degradation prior embedding and the
deficiency in modeling capacity. Our work addresses these issues by proposing a
unified diffusion framework that integrates both the image and degradation
priors for highly effective shadow removal. In detail, we first propose a
shadow degradation model, which inspires us to build a novel unrolling
diffusion model, dubbed ShandowDiffusion. It remarkably improves the model's
capacity in shadow removal via progressively refining the desired output with
both degradation prior and diffusive generative prior, which by nature can
serve as a new strong baseline for image restoration. Furthermore,
ShadowDiffusion progressively refines the estimated shadow mask as an auxiliary
task of the diffusion generator, which leads to more accurate and robust
shadow-free image generation. We conduct extensive experiments on three popular
public datasets, including ISTD, ISTD+, and SRD, to validate our method's
effectiveness. Compared to the state-of-the-art methods, our model achieves a
significant improvement in terms of PSNR, increasing from 31.69dB to 34.73dB
over SRD dataset
Boosting Visual-Language Models by Exploiting Hard Samples
Contrastive Language-Image Pre-training (CLIP) has become the standard for
learning cross-modal representations between images and text. Efforts to
improve its capabilities typically demand the collection of additional data and
retraining with new loss functions. While effective, the added requirements
limit their practical use due to the increased resource and time investments
needed. In this work, we present HELIP, a cost-effective strategy tailored to
enhance the performance of existing CLIP models without the need for training a
model from scratch or collecting additional data. Our method allows for
effortless integration with existing models' training pipelines, providing an
instant boost by training them with selected challenging text-image pairs from
their original training datasets. HELIP treats each text-image pair as a single
point in the joint vision-language space, identifying those in close proximity
as hard pairs. By incorporating the challenging data, pre-trained CLIP models
are refined using both the traditional contrastive loss and the newly
introduced hard negative margin loss, ensuring the challenging data is fully
utilized. On comprehensive benchmarks, HELIP consistently boosts existing
models to achieve leading performance. In particular, it improves the zero-shot
classification accuracy on ImageNet for SLIP models pre-trained on CC3M, CC12M
and YFCC15M datasets. The improvements are 3.05%, 4.47%, and 10.1%
respectively, achieved within two epochs of training. In addition, across
fine-grained classification datasets, HELIP improves the zero-shot performance
of pre-trained CLIP and SLIP by an average of 8.4% and 18.6%, and their linear
probe performance by an average of 9.5% and 3.0%.Comment: The code is publicly available at https://github.com/haonan3/HELI
Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
Diffusion models have proven to be highly effective in image and video
generation; however, they still face composition challenges when generating
images of varying sizes due to single-scale training data. Adapting large
pre-trained diffusion models for higher resolution demands substantial
computational and optimization resources, yet achieving a generation capability
comparable to low-resolution models remains elusive. This paper proposes a
novel self-cascade diffusion model that leverages the rich knowledge gained
from a well-trained low-resolution model for rapid adaptation to
higher-resolution image and video generation, employing either tuning-free or
cheap upsampler tuning paradigms. Integrating a sequence of multi-scale
upsampler modules, the self-cascade diffusion model can efficiently adapt to a
higher resolution, preserving the original composition and generation
capabilities. We further propose a pivot-guided noise re-schedule strategy to
speed up the inference process and improve local structural details. Compared
to full fine-tuning, our approach achieves a 5X training speed-up and requires
only an additional 0.002M tuning parameters. Extensive experiments demonstrate
that our approach can quickly adapt to higher resolution image and video
synthesis by fine-tuning for just 10k steps, with virtually no additional
inference time.Comment: Project Page: https://guolanqing.github.io/Self-Cascade
Nomenclatural and taxonomic notes on Rubus davidianus Kuntze and R. viburnifolius Franch
Critical examinations of specimens, with literature reviews, have shown that Rubus davidianus is conspecific with R. lambertianus. Therefore, we treat R. davidianus as a new synonym within Rubus. We propose a new name, Rubus loirensis Ti R. Huang nom. nov. to replace the later homonym of R. pycnanthus Genev. Additionally, lectotypification of three names, R. davidianus Kuntze, R. malifolius Focke and R. viburnifolius Franch., are designated here after examination of previous works
Tyrosine 23 Phosphorylation-Dependent Cell-Surface Localization of Annexin A2 Is Required for Invasion and Metastases of Pancreatic Cancer
The aggressiveness of pancreatic ductal adenocarcinoma (PDA) is characterized by
its high metastatic potential and lack of effective therapies, which is the
result of a lack of understanding of the mechanisms involved in promoting PDA
metastases. We identified Annexin A2 (ANXA2), a member of the Annexin family of
calcium-dependent phospholipid binding proteins, as a new molecule that promotes
PDA invasion and metastases. We found ANXA2 to be a PDA-associated antigen
recognized by post-treatment sera of patients who demonstrated prolonged
survival following treatment with a PDA-specific vaccine. Cell surface ANXA2
increases with PDA development and progression. Knockdown of ANXA2 expression by
RNA interference or blocking with anti-ANXA2 antibodies inhibits in
vitro invasion of PDA cells. In addition, post-vaccination patient
sera inhibits in vitro invasion of PDA cells, suggesting that
therapeutic anti-ANXA2 antibodies are induced by the vaccine. Furthermore,
cell-surface localization of ANXA2 is tyrosine 23 phosphorylation-dependent; and
tyrosine 23 phosphorylation is required for PDA invasion. We demonstrated that
tyrosine 23 phosphorylation resulting in surface expression of ANXA2 is required
for TGFβ-induced, Rho-mediated epithelial-mesenchymal transition (EMT),
linking the cellular function of ANXA2 which was previously shown to be
associated with small GTPase-regulated cytoskeletal rearrangements, to the EMT
process in PDA. Finally, using mouse PDA models, we showed that shRNA knock-down
of ANXA2, a mutation at tyrosine 23, or anti-ANXA2 antibodies,
inhibit PDA metastases and prolong mouse survival. Thus, ANXA2 is part of a
novel molecular pathway underlying PDA metastases and a new target for
development of PDA therapeutics
Polarimetric Behavior for the Derivation of Sea Ice Topographic Height From TanDEM-X Interferometric SAR Data
Single-pass interferometric synthetic aperture radar (InSAR) is an effective technique for sea ice topographic retrieval despite the inherent dynamics of sea ice. However, the penetration of microwaves into snow-covered thick ice and the achievable height sensitivity for tens-of-centimeters thin ice are two major issues, which limit the accuracy of InSAR-derived sea ice topography. Polarimetry provides scattering information concerning the sea ice properties and has the potential, in combination with interferometry, to achieve an accurate reconstruction of a sea ice digital elevation model (DEM). This article studies the relation between polarimetric signatures and sea ice topography, and explores the possibility to compensate the penetration bias by merging copolar coherence into InSAR processing. The newly generated topographic map has a root-mean-square error under 0.3 m. For thin ice below 1 m, a positive relation between copolar phase phi(coPol) and surface height is observed, suggesting that fcoPol can effectively characterize thin sea ice topography. For thick ice with ridges, the maximum polarimetric phase difference Delta phi(maxPol) reveals a particular shape of the coherence region, which can be interpreted as oriented volume scattering. It suggests that the model-based approach using polarimetric SAR interferometry assuming an oriented volume scattering model is promising in measuring the scattering centers in thick and deformed sea ice. The study of polarimetric behavior for the InSAR DEM is, therefore, a step forward toward accurate modeling of sea ice topography from polarimetric single-pass InSAR data.ISSN:1939-1404ISSN:2151-153
- …