2,747 research outputs found
Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI Detection
Open-vocabulary human-object interaction (HOI) detection, which is concerned
with the problem of detecting novel HOIs guided by natural language, is crucial
for understanding human-centric scenes. However, prior zero-shot HOI detectors
often employ the same levels of feature maps to model HOIs with varying
distances, leading to suboptimal performance in scenes containing human-object
pairs with a wide range of distances. In addition, these detectors primarily
rely on category names and overlook the rich contextual information that
language can provide, which is essential for capturing open vocabulary concepts
that are typically rare and not well-represented by category names alone. In
this paper, we introduce a novel end-to-end open vocabulary HOI detection
framework with conditional multi-level decoding and fine-grained semantic
enhancement (CMD-SE), harnessing the potential of Visual-Language Models
(VLMs). Specifically, we propose to model human-object pairs with different
distances with different levels of feature maps by incorporating a soft
constraint during the bipartite matching process. Furthermore, by leveraging
large language models (LLMs) such as GPT models, we exploit their extensive
world knowledge to generate descriptions of human body part states for various
interactions. Then we integrate the generalizable and fine-grained semantics of
human body parts to improve interaction recognition. Experimental results on
two datasets, SWIG-HOI and HICO-DET, demonstrate that our proposed method
achieves state-of-the-art results in open vocabulary HOI detection. The code
and models are available at https://github.com/ltttpku/CMD-SE-release
Recommended from our members
Water-Soluble 3D Covalent Organic Framework that Displays an Enhanced Enrichment Effect of Photosensitizers and Catalysts for the Reduction of Protons to H2.
Covalent organic frameworks (COFs) are emerging porous polymers that have 2D or 3D long-range ordering. Currently available COFs are typically insoluble or decompose upon dissolution, which remarkably restricts their practical implementations. For 3D COFs, the achievement of noninterpenetration, which maximizes their porosity-derived applications, also remains a challenge synthetically. Here, we report the synthesis of the first highly water-soluble 3D COF (sCOF-101) from irreversible polymerization of a preorganized supramolecular organic framework through cucurbit[8]uril (CB[8])-controlled [2 + 2] photodimerization. Synchrotron X-ray scattering and diffraction analyses confirm that sCOF-101 exhibits porosity periodicity, with a channel diameter of 2.3 nm, in both water and the solid state and retains the periodicity under both strongly acidic and basic conditions. As an ordered 3D polymer, sCOF-101 can enrich [Ru(bpy)3]2+ photosensitizers and redox-active polyoxometalates in water, which leads to remarkable increase of their photocatalytic activity for proton reduction to produce H2
The role of EGFR mutation as a prognostic factor in survival after diagnosis of brain metastasis in non-small cell lung cancer: A systematic review and meta-analysis
Abstract Background The brain is a common site for metastasis in non-small-cell lung cancer (NSCLC). This study was designed to evaluate the relationship between the mutational of the epidermal growth factor receptor (EGFR) and overall survival (OS) in NSCLC patients with brain metastases. Methods Searches were performed in PubMed, EmBase, and the Cochrane Library to identify studies evaluating the association of EGFR mutation with OS in NSCLC patients through September 2017. Results 4373 NSCLC patients with brain metastases in 18 studies were involved. Mutated EGFR associated with significantly improved OS compared with wild type. Subgroup analyses suggested that this relationship persisted in studies conducted in Eastern, with retrospective design, with sample size ≥500, mean age of patients ≥65.0 years, percentage male < 50.0%, percentage of patients receiving tyrosine kinase inhibitor ≥30.0%. Finally, although significant publication bias was observed using the Egger test, the results were not changed after adjustment using the trim and fill method. Conclusions This meta-analysis suggests that EGFR mutation is an important predictive factor linked to improved OS for NSCLC patients with brain metastases. It can serve as a useful index in the prognostic assessment of NSCLC patients with brain metastases
- …