371 research outputs found

    LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors

    Full text link
    Inspired by the outstanding zero-shot capability of vision language models (VLMs) in image classification tasks, open-vocabulary object detection has attracted increasing interest by distilling the broad VLM knowledge into detector training. However, most existing open-vocabulary detectors learn by aligning region embeddings with categorical labels (e.g., bicycle) only, disregarding the capability of VLMs on aligning visual embeddings with fine-grained text description of object parts (e.g., pedals and bells). This paper presents DVDet, a Descriptor-Enhanced Open Vocabulary Detector that introduces conditional context prompts and hierarchical textual descriptors that enable precise region-text alignment as well as open-vocabulary detection training in general. Specifically, the conditional context prompt transforms regional embeddings into image-like representations that can be directly integrated into general open vocabulary detection training. In addition, we introduce large language models as an interactive and implicit knowledge repository which enables iterative mining and refining visually oriented textual descriptors for precise region-text alignment. Extensive experiments over multiple large-scale benchmarks show that DVDet outperforms the state-of-the-art consistently by large margins

    Domain Generalization via Balancing Training Difficulty and Model Capability

    Full text link
    Domain generalization (DG) aims to learn domain-generalizable models from one or multiple source domains that can perform well in unseen target domains. Despite its recent progress, most existing work suffers from the misalignment between the difficulty level of training samples and the capability of contemporarily trained models, leading to over-fitting or under-fitting in the trained generalization model. We design MoDify, a Momentum Difficulty framework that tackles the misalignment by balancing the seesaw between the model's capability and the samples' difficulties along the training process. MoDify consists of two novel designs that collaborate to fight against the misalignment while learning domain-generalizable models. The first is MoDify-based Data Augmentation which exploits an RGB Shuffle technique to generate difficulty-aware training samples on the fly. The second is MoDify-based Network Optimization which dynamically schedules the training samples for balanced and smooth learning with appropriate difficulty. Without bells and whistles, a simple implementation of MoDify achieves superior performance across multiple benchmarks. In addition, MoDify can complement existing methods as a plug-in, and it is generic and can work for different visual recognition tasks.Comment: 11 pages, 6 figures, Accepted by ICCV 202

    Halo: Estimation and Reduction of Hallucinations in Open-Source Weak Large Language Models

    Full text link
    Large Language Models (LLMs) have revolutionized Natural Language Processing (NLP). Although convenient for research and practical applications, open-source LLMs with fewer parameters often suffer from severe hallucinations compared to their larger counterparts. This paper focuses on measuring and reducing hallucinations in BLOOM 7B, a representative of such weaker open-source LLMs that are publicly available for research and commercial applications. We introduce HaloCheck, a lightweight BlackBox knowledge-free framework designed to quantify the severity of hallucinations in LLMs. Additionally, we explore techniques like knowledge injection and teacher-student approaches to alleviate hallucinations in low-parameter LLMs. Our experiments effectively demonstrate the reduction of hallucinations in challenging domains for these LLMs

    Rician Noise Removal via a Learned Dictionary

    Get PDF
    This paper proposes a new effective model for denoising images with Rician noise. The sparse representations of images have been shown to be efficient approaches for image processing. Inspired by this, we learn a dictionary from the noisy image and then combine the MAP model with it for Rician noise removal. For solving the proposed model, the primal-dual algorithm is applied and its convergence is studied. The computational results show that the proposed method is promising in restoring images with Rician noise

    Enhanced Interfacial Dzyaloshinskii-Moriya Interaction in annealed Pt/Co/MgO structures

    Get PDF
    The interfacial Dzyaloshinskii-Moriya interaction (iDMI) is attracting great interests for spintronics. An iDMI constant larger than 3 mJ/m^2 is expected to minimize the size of skyrmions and to optimize the DW dynamics. In this study, we experimentally demonstrate an enhanced iDMI in Pt/Co/X/MgO ultra-thin film structures with perpendicular magnetization. The iDMI constants were measured using a field-driven creep regime domain expansion method. The enhancement of iDMI with an atomically thin insertion of Ta and Mg is comprehensively understood with the help of ab-initio calculations. Thermal annealing has been used to crystallize the MgO thin layer for improving tunneling magneto-resistance (TMR), but interestingly it also provides a further increase of the iDMI constant. An increase of the iDMI constant up to 3.3 mJ/m^2 is shown, which could be promising for the scaling down of skyrmion electronics

    Detection of TMPRSS2 : ERG fusion gene in circulating prostate cancer cells

    Get PDF
    Creative Commons Attribution-NonCommercial-Share Alike 3.0 license (CC BY-NC SA)Aim: To investigate the existence of TMPRSS2:ERG fusion gene in circulating tumor cells (CTC) from prostate cancer patients and its potential in monitoring tumor metastasis. Methods: We analyzed the frequency of TMPRSS2: ERG and TMPRSS2:ETV1 transcripts in 27 prostate cancer biopsies from prostatectomies, and TMPRSS2:ERG transcripts in CTC isolated from 15 patients with advanced androgen independent disease using reverse transcription polymerase chain reaction (RT-PCR). Fluorescence in situ hybridization (FISH) was applied to analyze the genomic truncation of ERG, which is the result of TMPRSS2:ERG fusion in 10 of the 15 CTC samples. Results: TMPRSS2: ERG transcripts were found in 44% of our samples, but we did not detect expression of TMPRSS2:ETV1. Using FISH analysis we detected chromosomal rearrangements affecting the ERG gene in 6 of 10 CTC samples, including 1 case with associated TMPRSS2:ERG fusion at the primary site. However, TMPRSS2:ERG transcripts were not detected in any of the 15 CTC samples, including the 10 cases analyzed by FISH. Conclusion: Although further study is required to address the association between TMPRSS2:ERG fusion and prostate cancer metastasis, detection of genomic truncation of the ERG gene by FISH analysis could be useful for monitoring the appearance of CTC and the potential for prostate cancer metastasis.Peer reviewedFinal Published versio

    Increased Variation in Body Weight and Food Intake Is Related to Increased Dietary Fat but Not Increased Carbohydrate or Protein in Mice

    Get PDF
    Funding This study was funded by the National Key R&D Program of China (2019YFA0801900) to JS and the Postdoctoral Innovation Fund (2021) to YW. The original diet exposure experiment was funded by the Chinese Academy of Sciences Strategic Program (XDB13030100). JS was also supported during this work by a PIFI professorial fellowship from CAS and a Wolfson merit award from the UK Royal Society. CORRECTION article Front. Nutr., 21 October 2022 Sec. Nutrition and Metabolism https://doi.org/10.3389/fnut.2022.1049766 Corrigendum: Increased variation in body weight and food intake is related to increased dietary fat but not increased carbohydrate or protein in micePeer reviewedPublisher PD

    Review of Associations between Built Environment Characteristics and Severe Acute Respiratory Syndrome Coronavirus 2 Infection Risk.

    Get PDF
    The coronavirus disease 2019 pandemic has stimulated intensive research interest in its transmission pathways and infection factors, e.g., socioeconomic and demographic characteristics, climatology, baseline health conditions or pre-existing diseases, and government policies. Meanwhile, some empirical studies suggested that built environment attributes may be associated with the transmission mechanism and infection risk of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). However, no review has been conducted to explore the effect of built environment characteristics on the infection risk. This research gap prevents government officials and urban planners from creating effective urban design guidelines to contain SARS-CoV-2 infections and face future pandemic challenges. This review summarizes evidence from 25 empirical studies and provides an overview of the effect of built environment on SARS-CoV-2 infection risk. Virus infection risk was positively associated with the density of commercial facilities, roads, and schools and with public transit accessibility, whereas it was negatively associated with the availability of green spaces. This review recommends several directions for future studies, namely using longitudinal research design and individual-level data, considering multilevel factors and extending to diversified geographic areas
    corecore