2,366 research outputs found

    A Novel Representation for Two-dimensional Image Structures

    Get PDF
    This paper presents a novel approach towards two-dimensional (2D) image structures modeling. To obtain more degrees of freedom, a 2D image signal is embedded into a certain geometric algebra. Coupling methods of differential geometry, tensor algebra, monogenic signal and quadrature filter, we can design a general model for 2D structures as the monogenic extension of a curvature tensor. Based on it, a local representation for the intrinsically two-dimensional (i2D) structure is derived as the monogenic curvature signal. From it, independent features of local amplitude, phase and orientation are simultaneously extracted. Besides, a monogenic curvature scale-space can be built by applying a Poisson kernel to the monogenic curvature signal. Compared with the other related work, the remarkable advantage of our approach lies in the rotationally invariant phase evaluation of 2D structures in a multi-scale framework, which delivers access to phase-based processing in many computer vision tasks

    Signal Modeling for Two-Dimensional Image Structures and Scale-Space Based Image Analysis

    Get PDF
    Model based image representation plays an important role in many computer vision tasks. Consequently, it is of high significance to model image structures with more powerful representation capabilities. In the literature, there exist bulk of researches for intensity based modeling. However, most of them suffer from the illumination variation. On the other hand, phase information, which carries most essential structural information of the original signal, has the advantage of being invariant to the brightness change. Therefore, phase based image analysis is advantageous when compared to purely intensity based approaches. This thesis aims to propose novel image representations for 2D image structures, from which useful local features can be extracted, which are useful for phase based image analysis. The first approach presents a 2D rotationally invariant quadrature filter. This model is able to handle superimposed intrinsically two-dimensional (i2D) patterns with flexible angles of intersection. Hence, it can be regarded as an extension of the structure multivector. The second approach is the monogenic curvature tensor. Coupling methods of differential geometry, tensor algebra, monogenic signal and quadrature filter, we can design a general model for 2D structures as the monogenic extension of a curvature tensor. Based on it, local representations for the intrinsically one-dimensional (i1D) and i2D structures are derived as the monogenic signal and the generalized monogenic curvature signal, respectively. From them, independent features of local amplitude, phase and orientation are simultaneously extracted. Besides, a generalized monogenic curvature scale-space can be built by applying a Poisson kernel to the monogenic curvature tensor. Compared with other related work, the remarkable advantage of our approach lies in the rotationally invariant phase evaluation of 2D structures in a multi-scale framework, which delivers access to phase-based processing in many computer vision tasks. To demonstrate the efficiency and power of the theoretic framework, some computer vision applications are presented, which include the phase based image reconstruction, detecting i2D image structures using local phase and monogenic curvature tensor for optical flow estimation

    One for All, All for One: Learning and Transferring User Embeddings for Cross-Domain Recommendation

    Full text link
    Cross-domain recommendation is an important method to improve recommender system performance, especially when observations in target domains are sparse. However, most existing techniques focus on single-target or dual-target cross-domain recommendation (CDR) and are hard to be generalized to CDR with multiple target domains. In addition, the negative transfer problem is prevalent in CDR, where the recommendation performance in a target domain may not always be enhanced by knowledge learned from a source domain, especially when the source domain has sparse data. In this study, we propose CAT-ART, a multi-target CDR method that learns to improve recommendations in all participating domains through representation learning and embedding transfer. Our method consists of two parts: a self-supervised Contrastive AuToencoder (CAT) framework to generate global user embeddings based on information from all participating domains, and an Attention-based Representation Transfer (ART) framework which transfers domain-specific user embeddings from other domains to assist with target domain recommendation. CAT-ART boosts the recommendation performance in any target domain through the combined use of the learned global user representation and knowledge transferred from other domains, in addition to the original user embedding in the target domain. We conducted extensive experiments on a collected real-world CDR dataset spanning 5 domains and involving a million users. Experimental results demonstrate the superiority of the proposed method over a range of prior arts. We further conducted ablation studies to verify the effectiveness of the proposed components. Our collected dataset will be open-sourced to facilitate future research in the field of multi-domain recommender systems and user modeling.Comment: 9 pages, accepted by WSDM 202

    Align Yourself: Self-supervised Pre-training for Fine-grained Recognition via Saliency Alignment

    Full text link
    Self-supervised contrastive learning has demonstrated great potential in learning visual representations. Despite their success on various downstream tasks such as image classification and object detection, self-supervised pre-training for fine-grained scenarios is not fully explored. In this paper, we first point out that current contrastive methods are prone to memorizing background/foreground texture and therefore have a limitation in localizing the foreground object. Analysis suggests that learning to extract discriminative texture information and localization are equally crucial for self-supervised pre-training in fine-grained scenarios. Based on our findings, we introduce cross-view saliency alignment (CVSA), a contrastive learning framework that first crops and swaps saliency regions of images as a novel view generation and then guides the model to localize on the foreground object via a cross-view alignment loss. Extensive experiments on four popular fine-grained classification benchmarks show that CVSA significantly improves the learned representation.Comment: The second version of CVSA. 10 pages, 4 figure

    Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN

    Full text link
    Masked image modeling (MIM), an emerging self-supervised pre-training method, has shown impressive success across numerous downstream vision tasks with Vision transformers (ViTs). Its underlying idea is simple: a portion of the input image is randomly masked out and then reconstructed via the pre-text task. However, the working principle behind MIM is not well explained, and previous studies insist that MIM primarily works for the Transformer family but is incompatible with CNNs. In this paper, we first study interactions among patches to understand what knowledge is learned and how it is acquired via the MIM task. We observe that MIM essentially teaches the model to learn better middle-order interactions among patches and extract more generalized features. Based on this fact, we propose an Architecture-Agnostic Masked Image Modeling framework (A2^2MIM), which is compatible with both Transformers and CNNs in a unified way. Extensive experiments on popular benchmarks show that our A2^2MIM learns better representations without explicit design and endows the backbone model with the stronger capability to transfer to various downstream tasks for both Transformers and CNNs.Comment: Preprint under review (update reversion). The source code will be released in https://github.com/Westlake-AI/openmixu

    Shape evolution and bubble formation of acoustically levitated drops

    Get PDF
    In this study, we investigated the shape evolution and bubble formation of acoustically levitated drops upon increasing the sound intensity. Here, a levitated liquid drop evolves progressively from an oblate spheroidal shape to a flattened film to a thin bowl-shaped film, eventually forming a closed bubble. Through systematic experiments, numerical simulation, and scaling analysis, we demonstrate that the buckled geometry of the liquid film can drastically enhance the suction effect of acoustic radiation pressure at its rim, forming a significant pressure gradient inside the film which causes an abrupt area expansion and bubble formation. Our results provide the mechanical origin responsible for the shape evolution and bubble formation of acoustically levitated drops, and highlight the role of buckled geometry in the levitation and manipulation of liquid films in an ultrasound field

    Charge Measurement of Cosmic Ray Nuclei with the Plastic Scintillator Detector of DAMPE

    Full text link
    One of the main purposes of the DArk Matter Particle Explorer (DAMPE) is to measure the cosmic ray nuclei up to several tens of TeV or beyond, whose origin and propagation remains a hot topic in astrophysics. The Plastic Scintillator Detector (PSD) on top of DAMPE is designed to measure the charges of cosmic ray nuclei from H to Fe and serves as a veto detector for discriminating gamma-rays from charged particles. We propose in this paper a charge reconstruction procedure to optimize the PSD performance in charge measurement. Essentials of our approach, including track finding, alignment of PSD, light attenuation correction, quenching and equalization correction are described detailedly in this paper after a brief description of the structure and operational principle of the PSD. Our results show that the PSD works very well and almost all the elements in cosmic rays from H to Fe are clearly identified in the charge spectrum.Comment: 20 pages, 4 figure

    Giant thermal transport tuning at a metal/ferroelectric interface

    Get PDF
    Interfacial thermal transport plays a prominent role in the thermal management of nanoscale objects and is of fundamental importance for basic research and nanodevices. At metal/insulator interfaces, a configuration commonly found in electronic devices, heat transport strongly depends upon the effective energy transfer from thermalized electrons in the metal to the phonons in the insulator. However, the mechanism of interfacial electron–phonon coupling and thermal transport at metal/insulator interfaces is not well understood. Here, the observation of a substantial enhancement of the interfacial thermal resistance and the important role of surface charges at the metal/ferroelectric interface in an Al/BiFeO3 membrane are reported. By applying uniaxial strain, the interfacial thermal resistance can be varied substantially (up to an order of magnitude), which is attributed to the renormalized interfacial electron–phonon coupling caused by the charge redistribution at the interface due to the polarization rotation. These results imply that surface charges at a metal/insulator interface can substantially enhance the interfacial electron–phonon-mediated thermal coupling, providing a new route to optimize the thermal transport performance in next-generation nanodevices, power electronics, and thermal logic devices.Peer ReviewedPostprint (author's final draft

    An In Vivo Screen Identifies PYGO2 as a Driver for Metastatic Prostate Cancer

    Get PDF
    Advanced prostate cancer displays conspicuous chromosomal instability and rampant copy number aberrations, yet the identity of functional drivers resident in many amplicons remain elusive. Here, we implemented a functional genomics approach to identify new oncogenes involved in prostate cancer progression. Through integrated analyses of focal amplicons in large prostate cancer genomic and transcriptomic datasets as well as genes upregulated in metastasis, 276 putative oncogenes were enlisted into an in vivo gain-of-function tumorigenesis screen. Among the top positive hits, we conducted an in-depth functional analysis on Pygopus family PHD finger 2 (PYGO2), located in the amplicon at 1q21.3. PYGO2 overexpression enhances primary tumor growth and local invasion to draining lymph nodes. Conversely, PYGO2 depletion inhibits prostate cancer cell invasion in vitro and progression of primary tumor and metastasis in vivo In clinical samples, PYGO2 upregulation associated with higher Gleason score and metastasis to lymph nodes and bone. Silencing PYGO2 expression in patient-derived xenograft models impairs tumor progression. Finally, PYGO2 is necessary to enhance the transcriptional activation in response to ligand-induced Wnt/β-catenin signaling. Together, our results indicate that PYGO2 functions as a driver oncogene in the 1q21.3 amplicon and may serve as a potential prognostic biomarker and therapeutic target for metastatic prostate cancer.Significance: Amplification/overexpression of PYGO2 may serve as a biomarker for prostate cancer progression and metastasis. Cancer Res; 78(14); 3823-33. ©2018 AACR

    Compressing the two-particle Green's function using wavelets: Theory and application to the Hubbard atom

    Full text link
    Precise algorithms capable of providing controlled solutions in the presence of strong interactions are transforming the landscape of quantum many-body physics. Particularly exciting breakthroughs are enabling the computation of non-zero temperature correlation functions. However, computational challenges arise due to constraints in resources and memory limitations, especially in scenarios involving complex Green's functions and lattice effects. Leveraging the principles of signal processing and data compression, this paper explores the wavelet decomposition as a versatile and efficient method for obtaining compact and resource-efficient representations of the many-body theory of interacting systems. The effectiveness of the wavelet decomposition is illustrated through its application to the representation of generalized susceptibilities and self-energies in a prototypical interacting fermionic system, namely the Hubbard model at half-filling in its atomic limit. These results are the first proof-of-principle application of the wavelet compression within the realm of many-body physics and demonstrate the potential of this wavelet-based compression scheme for understanding the physics of correlated electron systems.Comment: 25 pages, 16 figures, 2 table
    • …
    corecore