6 research outputs found

    Semi-Supervised Domain Generalization for Object Detection via Language-Guided Feature Alignment

    Full text link
    Existing domain adaptation (DA) and generalization (DG) methods in object detection enforce feature alignment in the visual space but face challenges like object appearance variability and scene complexity, which make it difficult to distinguish between objects and achieve accurate detection. In this paper, we are the first to address the problem of semi-supervised domain generalization by exploring vision-language pre-training and enforcing feature alignment through the language space. We employ a novel Cross-Domain Descriptive Multi-Scale Learning (CDDMSL) aiming to maximize the agreement between descriptions of an image presented with different domain-specific characteristics in the embedding space. CDDMSL significantly outperforms existing methods, achieving 11.7% and 7.5% improvement in DG and DA settings, respectively. Comprehensive analysis and ablation studies confirm the effectiveness of our method, positioning CDDMSL as a promising approach for domain generalization in object detection tasks.Comment: Accepted at BMVC 202

    Incorporating Geo-Diverse Knowledge into Prompting for Increased Geographical Robustness in Object Recognition

    Full text link
    Existing object recognition models have been shown to lack robustness in diverse geographical scenarios due to domain shifts in design and context. Class representations need to be adapted to more accurately reflect an object concept under these shifts. In the absence of training data from target geographies, we hypothesize that geographically diverse descriptive knowledge of categories can enhance robustness. For this purpose, we explore the feasibility of probing a large language model for geography-based object knowledge, and we examine the effects of integrating knowledge into zero-shot and learnable soft prompting with CLIP. Within this exploration, we propose geography knowledge regularization to ensure that soft prompts trained on a source set of geographies generalize to an unseen target set. Accuracy gains over prompting baselines on DollarStreet while training only on Europe data are up to +2.8/1.2/1.6 on target data from Africa/Asia/Americas, and +4.6 overall on the hardest classes. Competitive performance is shown vs. few-shot target training, and analysis is provided to direct future study of geographical robustness.Comment: To appear in IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 202

    Prevalence of Dementia Among Older Patients: A Hospital-Based Study in Iran

    Get PDF
    Background: Dementia constitutes a public health hazard in developing countries. The aim of this study was to evaluate the prevalence of dementia and its associated factors in older hospitalized patients. Methods: The participants of this cross-sectional study consisted of older patients admitted to medical wards in Rasoul-e Akram hospital in Tehran, Iran. Mini-Mental State Examination, Mini-Cog test, Geriatric Depression Scale, Activities of Daily Living-Instrumental Activities of Daily Living (ADL-IADL) scale, and socioeconomic questionnaires were used. Results: A total of 205 elderly inpatients were included. The mean age was 71.33 ± 7.35 years; 63.4 of the participants had normal cognitive function, while 36.6 had some degree of cognitive impairment. There was a statistically significant relationship between gender, age, number of children, and occupation and the prevalence of dementia. Conclusion: Appropriate cognitive screening of older patients upon admission to hospitals could help identify potential adverse events and enhance the quality of care for patients with comorbid dementia. © The Author(s) 2019
    corecore