159 research outputs found

    Self-training solutions for the ICCV 2023 GeoNet Challenge

    Full text link
    GeoNet is a recently proposed domain adaptation benchmark consisting of three challenges (i.e., GeoUniDA, GeoImNet, and GeoPlaces). Each challenge contains images collected from the USA and Asia where there are huge geographical gaps. Our solution adopts a two-stage source-free domain adaptation framework with a Swin Transformer backbone to achieve knowledge transfer from the USA (source) domain to Asia (target) domain. In the first stage, we train a source model using labeled source data with a re-sampling strategy and two types of cross-entropy loss. In the second stage, we generate pseudo labels for unlabeled target data to fine-tune the model. Our method achieves an H-score of 74.56% and ultimately ranks 1st in the GeoUniDA challenge. In GeoImNet and GeoPlaces challenges, our solution also reaches a top-3 accuracy of 64.46% and 51.23%, respectively.Comment: technical report; 1st in the ICCV-2023 GeoUniDA challeng

    Modern International Park City and Ecological Civilization Education Practice: Taking Chengdu Tianfu Greenway as the Core

    Get PDF
    In the comprehensive development of modern park city and ecological civilization education, Chengdu has gradually explored its own path with international characteristics. The positioning of Chengdu as a “modern and beautiful park city” is an innovation proposed by Mr. Xi Jinping in recent years. This concept not only combines the geographical location of Chengdu with the regional characteristics of the western environment, but also highly respects the historical laws of the development of Chengdu’s Bashu civilization for thousands of years. At the same time, it also absorbs Howard’s “pastoral city” dream and Mountford, the theme of Gedde’s “Organic City Theory” has corrected the shortcomings of Le Corbusier’s mechanized functional space view and presented distinctive Chinese characteristics in theoretical innovation and the development of ecological civilization education. The outstanding performance is the organic integration of the development of Tianfu ecological civilization, the cultural education of Bashu and the aesthetic design practice, the creation of the international brand image of Chengdu’s “three cities and three capitals” and the innovative practical experience results of the dream of “beautiful and livable park city”

    Self-Aligned Concave Curve: Illumination Enhancement for Unsupervised Adaptation

    Full text link
    Low light conditions not only degrade human visual experience, but also reduce the performance of downstream machine analytics. Although many works have been designed for low-light enhancement or domain adaptive machine analytics, the former considers less on high-level vision, while the latter neglects the potential of image-level signal adjustment. How to restore underexposed images/videos from the perspective of machine vision has long been overlooked. In this paper, we are the first to propose a learnable illumination enhancement model for high-level vision. Inspired by real camera response functions, we assume that the illumination enhancement function should be a concave curve, and propose to satisfy this concavity through discrete integral. With the intention of adapting illumination from the perspective of machine vision without task-specific annotated data, we design an asymmetric cross-domain self-supervised training strategy. Our model architecture and training designs mutually benefit each other, forming a powerful unsupervised normal-to-low light adaptation framework. Comprehensive experiments demonstrate that our method surpasses existing low-light enhancement and adaptation methods and shows superior generalization on various low-light vision tasks, including classification, detection, action recognition, and optical flow estimation. Project website: https://daooshee.github.io/SACC-Website/Comment: This paper has been accepted by ACM Multimedia 202

    Improving Zero-Shot Generalization for CLIP with Synthesized Prompts

    Full text link
    With the growing interest in pretrained vision-language models like CLIP, recent research has focused on adapting these models to downstream tasks. Despite achieving promising results, most existing methods require labeled data for all classes, which may not hold in real-world applications due to the long tail and Zipf's law. For example, some classes may lack labeled data entirely, such as emerging concepts. To address this problem, we propose a plug-and-play generative approach called \textbf{S}ynt\textbf{H}es\textbf{I}zed \textbf{P}rompts~(\textbf{SHIP}) to improve existing fine-tuning methods. Specifically, we follow variational autoencoders to introduce a generator that reconstructs the visual features by inputting the synthesized prompts and the corresponding class names to the textual encoder of CLIP. In this manner, we easily obtain the synthesized features for the remaining label-only classes. Thereafter, we fine-tune CLIP with off-the-shelf methods by combining labeled and synthesized features. Extensive experiments on base-to-new generalization, cross-dataset transfer learning, and generalized zero-shot learning demonstrate the superiority of our approach. The code is available at \url{https://github.com/mrflogs/SHIP}.Comment: Accepted by ICCV 202

    Towards Realistic Unsupervised Fine-tuning with CLIP

    Full text link
    The emergence of vision-language models (VLMs), such as CLIP, has spurred a significant research effort towards their application for downstream supervised learning tasks. Although some previous studies have explored the unsupervised fine-tuning of CLIP, they often rely on prior knowledge in the form of class names associated with ground truth labels. In this paper, we delve into a realistic unsupervised fine-tuning scenario by assuming that the unlabeled data might contain out-of-distribution samples from unknown classes. Furthermore, we emphasize the importance of simultaneously enhancing out-of-distribution detection capabilities alongside the recognition of instances associated with predefined class labels. To tackle this problem, we present a simple, efficient, and effective fine-tuning approach called Universal Entropy Optimization (UEO). UEO leverages sample-level confidence to approximately minimize the conditional entropy of confident instances and maximize the marginal entropy of less confident instances. Apart from optimizing the textual prompts, UEO also incorporates optimization of channel-wise affine transformations within the visual branch of CLIP. Through extensive experiments conducted across 15 domains and 4 different types of prior knowledge, we demonstrate that UEO surpasses baseline methods in terms of both generalization and out-of-distribution detection

    Effects of slow and regular breathing exercise on cardiopulmonary coupling and blood pressure

    Get PDF
    Investigation of the interaction between cardiovascular variables and respiration provides a quantitative and noninvasive approach to assess the autonomic control of cardiovascular function. The aim of this paper is to investigate the changes of cardiopulmonary coupling (CPC), blood pressure (BP) and pulse transit time (PTT) during a stepwise-paced breathing (SPB) procedure (spontaneous breathing followed by paced breathing at 14, 12.5, 11, 9.5, 8 and 7 breaths per minute, 3 min each) and gain insights into the characteristics of slow breathing exercises. RR interval, respiration, BP and PTT are collected during the SPB procedure (48 healthy subjects, 27 ± 6 years). CPC is assessed through investigating both the phase and amplitude dynamics between the respiration-induced components from RR interval and respiration by the approach of ensemble empirical mode decomposition. It was found that even though the phase synchronization and amplitude oscillation of CPC were both enhanced by the SPB procedure, phase coupling does not increase monotonically along with the amplitude oscillation during the whole procedure. Meanwhile, BP was reduced significantly by the SPB procedure (SBP: from 122.0 ± 13.4 to 114.2 ± 14.9 mmHg, p < 0.001, DBP: from 82.2 ± 8.6 to 77.0 ± 9.8 mmHg, p < 0.001, PTT: from 172.8 ± 20.1 to 176.8 ± 19.2 ms, p < 0.001). Our results demonstrate that the SPB procedure can reduce BP and lengthen PTT significantly. Compared with amplitude dynamics, phase dynamics is a different marker for CPC analysis in reflecting cardiorespiratory coherence during slow breathing exercise. Our study provides a methodology to practice slow breathing exercise, including the setting of target breathing rate, change of CPC and the importance of regular breathing. The applications and usability of the study results have also been discussed.National Natural Science Foundation (China) (Grant Number: 61471398)Beijing Natural Science Foundation (Grant Number: 3122034)General Logistics Science Foundation (Grant Number: CWS11C108)National Key Technology Research and Development Program (Grant Numbers: 2013BAI03B04, 2013BAI03B05

    A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation

    Full text link
    Contrastive Language-Image Pretraining (CLIP) has gained popularity for its remarkable zero-shot capacity. Recent research has focused on developing efficient fine-tuning methods, such as prompt learning and adapter, to enhance CLIP's performance in downstream tasks. However, these methods still require additional training time and computational resources, which is undesirable for devices with limited resources. In this paper, we revisit a classical algorithm, Gaussian Discriminant Analysis (GDA), and apply it to the downstream classification of CLIP. Typically, GDA assumes that features of each class follow Gaussian distributions with identical covariance. By leveraging Bayes' formula, the classifier can be expressed in terms of the class means and covariance, which can be estimated from the data without the need for training. To integrate knowledge from both visual and textual modalities, we ensemble it with the original zero-shot classifier within CLIP. Extensive results on 17 datasets validate that our method surpasses or achieves comparable results with state-of-the-art methods on few-shot classification, imbalanced learning, and out-of-distribution generalization. In addition, we extend our method to base-to-new generalization and unsupervised learning, once again demonstrating its superiority over competing approaches. Our code is publicly available at \url{https://github.com/mrflogs/ICLR24}.Comment: Accepted by ICLR 202

    Adaptive motion artefact reduction in respiration and ECG signals for wearable healthcare monitoring systems

    Get PDF
    Wearable healthcare monitoring systems (WHMSs) have received significant interest from both academia and industry with the advantage of non-intrusive and ambulatory monitoring. The aim of this paper is to investigate the use of an adaptive filter to reduce motion artefact (MA) in physiological signals acquired by WHMSs. In our study, a WHMS is used to acquire ECG, respiration and triaxial accelerometer (ACC) signals during incremental treadmill and cycle ergometry exercises. With these signals, performances of adaptive MA cancellation are evaluated in both respiration and ECG signals. To achieve effective and robust MA cancellation, three axial outputs of the ACC are employed to estimate the MA by a bank of gradient adaptive Laguerre lattice (GALL) filter, and the outputs of the GALL filters are further combined with time-varying weights determined by a Kalman filter. The results show that for the respiratory signals, MA component can be reduced and signal quality can be improved effectively (the power ratio between the MA-corrupted respiratory signal and the adaptive filtered signal was 1.31 in running condition, and the corresponding signal quality was improved from 0.77 to 0.96). Combination of the GALL and Kalman filters can achieve robust MA cancellation without supervised selection of the reference axis from the ACC. For ECG, the MA component can also be reduced by adaptive filtering. The signal quality, however, could not be improved substantially just by the adaptive filter with the ACC outputs as the reference signals.Municipal Science & Technology Commission. Beijing Natural Science Foundation (Grants 3102028 and 3122034)General Logistics Science Foundation (Grant CWS11C108)National Institutes of Health (U.S.) (National Institute of General Medical Sciences (U.S.). Grant R01- EB001659)National Institutes of Health (U.S.) (National Institute for Biomedical Imaging and Bioengineering (U.S.) Cooperative Agreement U01- EB-008577

    Research Hotspot and Trend Analysis of China’s Elderlyoriented Smart Products

    Get PDF
    The Chinese government attaches great importance to the current situation of population aging, so it has introduced relevant aging policies. The combination of new technologies has played a positive role in the development of Elderly-oriented smart products of enterprises. Based on the research literature on Elderly-oriented smart products collected in CNKI database in recent ten years (2012-2022), this paper makes a quantitative analysis on the research results of Elderly-oriented smart products in China with the help of CiteSpace visual analysis software. Through research hotspots and evolution trends, it is found that the theme can be extended: the upgrading and construction of Elderly-oriented smart products will be a hot research topic in the academic community in the future
    • …
    corecore