2,497 research outputs found
A Novel Representation for Two-dimensional Image Structures
This paper presents a novel approach towards two-dimensional (2D) image structures modeling. To obtain more degrees of freedom, a 2D image signal is embedded into a certain geometric algebra. Coupling methods of differential geometry, tensor algebra, monogenic signal and quadrature filter, we can design a general model for 2D structures as the monogenic extension of a curvature tensor. Based on it, a local representation for the intrinsically two-dimensional (i2D) structure is derived as the monogenic curvature signal. From it, independent features of local amplitude, phase and orientation are simultaneously extracted. Besides, a monogenic curvature scale-space can be built by applying a Poisson kernel to the monogenic curvature signal. Compared with the other related work, the remarkable advantage of our approach lies in the rotationally invariant phase evaluation of 2D structures in a multi-scale framework, which delivers access to phase-based processing in many computer vision tasks
Signal Modeling for Two-Dimensional Image Structures and Scale-Space Based Image Analysis
Model based image representation plays an important role in many computer vision tasks. Consequently, it is of high significance to model image structures with more powerful representation capabilities. In the literature, there exist bulk of researches for intensity based modeling. However, most of them suffer from the illumination variation. On the other hand, phase information, which carries most essential structural information of the original signal, has the advantage of being invariant to the brightness change. Therefore, phase based image analysis is advantageous when compared to purely intensity based approaches. This thesis aims to propose novel image representations for 2D image structures, from which useful local features can be extracted, which are useful for phase based image analysis. The first approach presents a 2D rotationally invariant quadrature filter. This model is able to handle superimposed intrinsically two-dimensional (i2D) patterns with flexible angles of intersection. Hence, it can be regarded as an extension of the structure multivector. The second approach is the monogenic curvature tensor. Coupling methods of differential geometry, tensor algebra, monogenic signal and quadrature filter, we can design a general model for 2D structures as the monogenic extension of a curvature tensor. Based on it, local representations for the intrinsically one-dimensional (i1D) and i2D structures are derived as the monogenic signal and the generalized monogenic curvature signal, respectively. From them, independent features of local amplitude, phase and orientation are simultaneously extracted. Besides, a generalized monogenic curvature scale-space can be built by applying a Poisson kernel to the monogenic curvature tensor. Compared with other related work, the remarkable advantage of our approach lies in the rotationally invariant phase evaluation of 2D structures in a multi-scale framework, which delivers access to phase-based processing in many computer vision tasks. To demonstrate the efficiency and power of the theoretic framework, some computer vision applications are presented, which include the phase based image reconstruction, detecting i2D image structures using local phase and monogenic curvature tensor for optical flow estimation
One for All, All for One: Learning and Transferring User Embeddings for Cross-Domain Recommendation
Cross-domain recommendation is an important method to improve recommender
system performance, especially when observations in target domains are sparse.
However, most existing techniques focus on single-target or dual-target
cross-domain recommendation (CDR) and are hard to be generalized to CDR with
multiple target domains. In addition, the negative transfer problem is
prevalent in CDR, where the recommendation performance in a target domain may
not always be enhanced by knowledge learned from a source domain, especially
when the source domain has sparse data. In this study, we propose CAT-ART, a
multi-target CDR method that learns to improve recommendations in all
participating domains through representation learning and embedding transfer.
Our method consists of two parts: a self-supervised Contrastive AuToencoder
(CAT) framework to generate global user embeddings based on information from
all participating domains, and an Attention-based Representation Transfer (ART)
framework which transfers domain-specific user embeddings from other domains to
assist with target domain recommendation. CAT-ART boosts the recommendation
performance in any target domain through the combined use of the learned global
user representation and knowledge transferred from other domains, in addition
to the original user embedding in the target domain. We conducted extensive
experiments on a collected real-world CDR dataset spanning 5 domains and
involving a million users. Experimental results demonstrate the superiority of
the proposed method over a range of prior arts. We further conducted ablation
studies to verify the effectiveness of the proposed components. Our collected
dataset will be open-sourced to facilitate future research in the field of
multi-domain recommender systems and user modeling.Comment: 9 pages, accepted by WSDM 202
Align Yourself: Self-supervised Pre-training for Fine-grained Recognition via Saliency Alignment
Self-supervised contrastive learning has demonstrated great potential in
learning visual representations. Despite their success on various downstream
tasks such as image classification and object detection, self-supervised
pre-training for fine-grained scenarios is not fully explored. In this paper,
we first point out that current contrastive methods are prone to memorizing
background/foreground texture and therefore have a limitation in localizing the
foreground object. Analysis suggests that learning to extract discriminative
texture information and localization are equally crucial for self-supervised
pre-training in fine-grained scenarios. Based on our findings, we introduce
cross-view saliency alignment (CVSA), a contrastive learning framework that
first crops and swaps saliency regions of images as a novel view generation and
then guides the model to localize on the foreground object via a cross-view
alignment loss. Extensive experiments on four popular fine-grained
classification benchmarks show that CVSA significantly improves the learned
representation.Comment: The second version of CVSA. 10 pages, 4 figure
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN
Masked image modeling (MIM), an emerging self-supervised pre-training method,
has shown impressive success across numerous downstream vision tasks with
Vision transformers (ViTs). Its underlying idea is simple: a portion of the
input image is randomly masked out and then reconstructed via the pre-text
task. However, the working principle behind MIM is not well explained, and
previous studies insist that MIM primarily works for the Transformer family but
is incompatible with CNNs. In this paper, we first study interactions among
patches to understand what knowledge is learned and how it is acquired via the
MIM task. We observe that MIM essentially teaches the model to learn better
middle-order interactions among patches and extract more generalized features.
Based on this fact, we propose an Architecture-Agnostic Masked Image Modeling
framework (AMIM), which is compatible with both Transformers and CNNs in a
unified way. Extensive experiments on popular benchmarks show that our AMIM
learns better representations without explicit design and endows the backbone
model with the stronger capability to transfer to various downstream tasks for
both Transformers and CNNs.Comment: Preprint under review (update reversion). The source code will be
released in https://github.com/Westlake-AI/openmixu
Shape evolution and bubble formation of acoustically levitated drops
In this study, we investigated the shape evolution and bubble formation of acoustically levitated drops upon increasing the sound intensity. Here, a levitated liquid drop evolves progressively from an oblate spheroidal shape to a flattened film to a thin bowl-shaped film, eventually forming a closed bubble. Through systematic experiments, numerical simulation, and scaling analysis, we demonstrate that the buckled geometry of the liquid film can drastically enhance the suction effect of acoustic radiation pressure at its rim, forming a significant pressure gradient inside the film which causes an abrupt area expansion and bubble formation. Our results provide the mechanical origin responsible for the shape evolution and bubble formation of acoustically levitated drops, and highlight the role of buckled geometry in the levitation and manipulation of liquid films in an ultrasound field
Charge Measurement of Cosmic Ray Nuclei with the Plastic Scintillator Detector of DAMPE
One of the main purposes of the DArk Matter Particle Explorer (DAMPE) is to
measure the cosmic ray nuclei up to several tens of TeV or beyond, whose origin
and propagation remains a hot topic in astrophysics. The Plastic Scintillator
Detector (PSD) on top of DAMPE is designed to measure the charges of cosmic ray
nuclei from H to Fe and serves as a veto detector for discriminating gamma-rays
from charged particles. We propose in this paper a charge reconstruction
procedure to optimize the PSD performance in charge measurement. Essentials of
our approach, including track finding, alignment of PSD, light attenuation
correction, quenching and equalization correction are described detailedly in
this paper after a brief description of the structure and operational principle
of the PSD. Our results show that the PSD works very well and almost all the
elements in cosmic rays from H to Fe are clearly identified in the charge
spectrum.Comment: 20 pages, 4 figure
Giant thermal transport tuning at a metal/ferroelectric interface
Interfacial thermal transport plays a prominent role in the thermal management of nanoscale objects and is of fundamental importance for basic research and nanodevices. At metal/insulator interfaces, a configuration commonly found in electronic devices, heat transport strongly depends upon the effective energy transfer from thermalized electrons in the metal to the phonons in the insulator. However, the mechanism of interfacial electron–phonon coupling and thermal transport at metal/insulator interfaces is not well understood. Here, the observation of a substantial enhancement of the interfacial thermal resistance and the important role of surface charges at the metal/ferroelectric interface in an Al/BiFeO3 membrane are reported. By applying uniaxial strain, the interfacial thermal resistance can be varied substantially (up to an order of magnitude), which is attributed to the renormalized interfacial electron–phonon coupling caused by the charge redistribution at the interface due to the polarization rotation. These results imply that surface charges at a metal/insulator interface can substantially enhance the interfacial electron–phonon-mediated thermal coupling, providing a new route to optimize the thermal transport performance in next-generation nanodevices, power electronics, and thermal logic devices.Peer ReviewedPostprint (author's final draft
An In Vivo Screen Identifies PYGO2 as a Driver for Metastatic Prostate Cancer
Advanced prostate cancer displays conspicuous chromosomal instability and rampant copy number aberrations, yet the identity of functional drivers resident in many amplicons remain elusive. Here, we implemented a functional genomics approach to identify new oncogenes involved in prostate cancer progression. Through integrated analyses of focal amplicons in large prostate cancer genomic and transcriptomic datasets as well as genes upregulated in metastasis, 276 putative oncogenes were enlisted into an in vivo gain-of-function tumorigenesis screen. Among the top positive hits, we conducted an in-depth functional analysis on Pygopus family PHD finger 2 (PYGO2), located in the amplicon at 1q21.3. PYGO2 overexpression enhances primary tumor growth and local invasion to draining lymph nodes. Conversely, PYGO2 depletion inhibits prostate cancer cell invasion in vitro and progression of primary tumor and metastasis in vivo In clinical samples, PYGO2 upregulation associated with higher Gleason score and metastasis to lymph nodes and bone. Silencing PYGO2 expression in patient-derived xenograft models impairs tumor progression. Finally, PYGO2 is necessary to enhance the transcriptional activation in response to ligand-induced Wnt/β-catenin signaling. Together, our results indicate that PYGO2 functions as a driver oncogene in the 1q21.3 amplicon and may serve as a potential prognostic biomarker and therapeutic target for metastatic prostate cancer.Significance: Amplification/overexpression of PYGO2 may serve as a biomarker for prostate cancer progression and metastasis. Cancer Res; 78(14); 3823-33. ©2018 AACR
Compressing the two-particle Green's function using wavelets: Theory and application to the Hubbard atom
Precise algorithms capable of providing controlled solutions in the presence
of strong interactions are transforming the landscape of quantum many-body
physics. Particularly exciting breakthroughs are enabling the computation of
non-zero temperature correlation functions. However, computational challenges
arise due to constraints in resources and memory limitations, especially in
scenarios involving complex Green's functions and lattice effects. Leveraging
the principles of signal processing and data compression, this paper explores
the wavelet decomposition as a versatile and efficient method for obtaining
compact and resource-efficient representations of the many-body theory of
interacting systems. The effectiveness of the wavelet decomposition is
illustrated through its application to the representation of generalized
susceptibilities and self-energies in a prototypical interacting fermionic
system, namely the Hubbard model at half-filling in its atomic limit. These
results are the first proof-of-principle application of the wavelet compression
within the realm of many-body physics and demonstrate the potential of this
wavelet-based compression scheme for understanding the physics of correlated
electron systems.Comment: 25 pages, 16 figures, 2 table
- …