126 research outputs found
Learning Image Deraining Transformer Network with Dynamic Dual Self-Attention
Recently, Transformer-based architecture has been introduced into single
image deraining task due to its advantage in modeling non-local information.
However, existing approaches tend to integrate global features based on a dense
self-attention strategy since it tend to uses all similarities of the tokens
between the queries and keys. In fact, this strategy leads to ignoring the most
relevant information and inducing blurry effect by the irrelevant
representations during the feature aggregation. To this end, this paper
proposes an effective image deraining Transformer with dynamic dual
self-attention (DDSA), which combines both dense and sparse attention
strategies to better facilitate clear image reconstruction. Specifically, we
only select the most useful similarity values based on top-k approximate
calculation to achieve sparse attention. In addition, we also develop a novel
spatial-enhanced feed-forward network (SEFN) to further obtain a more accurate
representation for achieving high-quality derained results. Extensive
experiments on benchmark datasets demonstrate the effectiveness of our proposed
method.Comment: 6 pages, 5 figure
EC-Conf: An Ultra-fast Diffusion Model for Molecular Conformation Generation with Equivariant Consistency
Despite recent advancement in 3D molecule conformation generation driven by
diffusion models, its high computational cost in iterative diffusion/denoising
process limits its application. In this paper, an equivariant consistency model
(EC-Conf) was proposed as a fast diffusion method for low-energy conformation
generation. In EC-Conf, a modified SE (3)-equivariant transformer model was
directly used to encode the Cartesian molecular conformations and a highly
efficient consistency diffusion process was carried out to generate molecular
conformations. It was demonstrated that, with only one sampling step, it can
already achieve comparable quality to other diffusion-based models running with
thousands denoising steps. Its performance can be further improved with a few
more sampling iterations. The performance of EC-Conf is evaluated on both
GEOM-QM9 and GEOM-Drugs sets. Our results demonstrate that the efficiency of
EC-Conf for learning the distribution of low energy molecular conformation is
at least two magnitudes higher than current SOTA diffusion models and could
potentially become a useful tool for conformation generation and sampling.Comment: 10 pages, 3 figure
- …