159 research outputs found
Self-training solutions for the ICCV 2023 GeoNet Challenge
GeoNet is a recently proposed domain adaptation benchmark consisting of three
challenges (i.e., GeoUniDA, GeoImNet, and GeoPlaces). Each challenge contains
images collected from the USA and Asia where there are huge geographical gaps.
Our solution adopts a two-stage source-free domain adaptation framework with a
Swin Transformer backbone to achieve knowledge transfer from the USA (source)
domain to Asia (target) domain. In the first stage, we train a source model
using labeled source data with a re-sampling strategy and two types of
cross-entropy loss. In the second stage, we generate pseudo labels for
unlabeled target data to fine-tune the model. Our method achieves an H-score of
74.56% and ultimately ranks 1st in the GeoUniDA challenge. In GeoImNet and
GeoPlaces challenges, our solution also reaches a top-3 accuracy of 64.46% and
51.23%, respectively.Comment: technical report; 1st in the ICCV-2023 GeoUniDA challeng
Modern International Park City and Ecological Civilization Education Practice: Taking Chengdu Tianfu Greenway as the Core
In the comprehensive development of modern park city and ecological civilization education, Chengdu has gradually explored its own path with international characteristics. The positioning of Chengdu as a “modern and beautiful park city” is an innovation proposed by Mr. Xi Jinping in recent years. This concept not only combines the geographical location of Chengdu with the regional characteristics of the western environment, but also highly respects the historical laws of the development of Chengdu’s Bashu civilization for thousands of years. At the same time, it also absorbs Howard’s “pastoral city” dream and Mountford, the theme of Gedde’s “Organic City Theory” has corrected the shortcomings of Le Corbusier’s mechanized functional space view and presented distinctive Chinese characteristics in theoretical innovation and the development of ecological civilization education. The outstanding performance is the organic integration of the development of Tianfu ecological civilization, the cultural education of Bashu and the aesthetic design practice, the creation of the international brand image of Chengdu’s “three cities and three capitals” and the innovative practical experience results of the dream of “beautiful and livable park city”
Self-Aligned Concave Curve: Illumination Enhancement for Unsupervised Adaptation
Low light conditions not only degrade human visual experience, but also
reduce the performance of downstream machine analytics. Although many works
have been designed for low-light enhancement or domain adaptive machine
analytics, the former considers less on high-level vision, while the latter
neglects the potential of image-level signal adjustment. How to restore
underexposed images/videos from the perspective of machine vision has long been
overlooked. In this paper, we are the first to propose a learnable illumination
enhancement model for high-level vision. Inspired by real camera response
functions, we assume that the illumination enhancement function should be a
concave curve, and propose to satisfy this concavity through discrete integral.
With the intention of adapting illumination from the perspective of machine
vision without task-specific annotated data, we design an asymmetric
cross-domain self-supervised training strategy. Our model architecture and
training designs mutually benefit each other, forming a powerful unsupervised
normal-to-low light adaptation framework. Comprehensive experiments demonstrate
that our method surpasses existing low-light enhancement and adaptation methods
and shows superior generalization on various low-light vision tasks, including
classification, detection, action recognition, and optical flow estimation.
Project website: https://daooshee.github.io/SACC-Website/Comment: This paper has been accepted by ACM Multimedia 202
Improving Zero-Shot Generalization for CLIP with Synthesized Prompts
With the growing interest in pretrained vision-language models like CLIP,
recent research has focused on adapting these models to downstream tasks.
Despite achieving promising results, most existing methods require labeled data
for all classes, which may not hold in real-world applications due to the long
tail and Zipf's law. For example, some classes may lack labeled data entirely,
such as emerging concepts. To address this problem, we propose a plug-and-play
generative approach called \textbf{S}ynt\textbf{H}es\textbf{I}zed
\textbf{P}rompts~(\textbf{SHIP}) to improve existing fine-tuning methods.
Specifically, we follow variational autoencoders to introduce a generator that
reconstructs the visual features by inputting the synthesized prompts and the
corresponding class names to the textual encoder of CLIP. In this manner, we
easily obtain the synthesized features for the remaining label-only classes.
Thereafter, we fine-tune CLIP with off-the-shelf methods by combining labeled
and synthesized features. Extensive experiments on base-to-new generalization,
cross-dataset transfer learning, and generalized zero-shot learning demonstrate
the superiority of our approach. The code is available at
\url{https://github.com/mrflogs/SHIP}.Comment: Accepted by ICCV 202
Towards Realistic Unsupervised Fine-tuning with CLIP
The emergence of vision-language models (VLMs), such as CLIP, has spurred a
significant research effort towards their application for downstream supervised
learning tasks. Although some previous studies have explored the unsupervised
fine-tuning of CLIP, they often rely on prior knowledge in the form of class
names associated with ground truth labels. In this paper, we delve into a
realistic unsupervised fine-tuning scenario by assuming that the unlabeled data
might contain out-of-distribution samples from unknown classes. Furthermore, we
emphasize the importance of simultaneously enhancing out-of-distribution
detection capabilities alongside the recognition of instances associated with
predefined class labels.
To tackle this problem, we present a simple, efficient, and effective
fine-tuning approach called Universal Entropy Optimization (UEO). UEO leverages
sample-level confidence to approximately minimize the conditional entropy of
confident instances and maximize the marginal entropy of less confident
instances. Apart from optimizing the textual prompts, UEO also incorporates
optimization of channel-wise affine transformations within the visual branch of
CLIP. Through extensive experiments conducted across 15 domains and 4 different
types of prior knowledge, we demonstrate that UEO surpasses baseline methods in
terms of both generalization and out-of-distribution detection
Effects of slow and regular breathing exercise on cardiopulmonary coupling and blood pressure
Investigation of the interaction between cardiovascular variables and respiration provides a quantitative and noninvasive approach to assess the autonomic control of cardiovascular function. The aim of this paper is to investigate the changes of cardiopulmonary coupling (CPC), blood pressure (BP) and pulse transit time (PTT) during a stepwise-paced breathing (SPB) procedure (spontaneous breathing followed by paced breathing at 14, 12.5, 11, 9.5, 8 and 7 breaths per minute, 3 min each) and gain insights into the characteristics of slow breathing exercises. RR interval, respiration, BP and PTT are collected during the SPB procedure (48 healthy subjects, 27 ± 6 years). CPC is assessed through investigating both the phase and amplitude dynamics between the respiration-induced components from RR interval and respiration by the approach of ensemble empirical mode decomposition. It was found that even though the phase synchronization and amplitude oscillation of CPC were both enhanced by the SPB procedure, phase coupling does not increase monotonically along with the amplitude oscillation during the whole procedure. Meanwhile, BP was reduced significantly by the SPB procedure (SBP: from 122.0 ± 13.4 to 114.2 ± 14.9 mmHg, p < 0.001, DBP: from 82.2 ± 8.6 to 77.0 ± 9.8 mmHg, p < 0.001, PTT: from 172.8 ± 20.1 to 176.8 ± 19.2 ms, p < 0.001). Our results demonstrate that the SPB procedure can reduce BP and lengthen PTT significantly. Compared with amplitude dynamics, phase dynamics is a different marker for CPC analysis in reflecting cardiorespiratory coherence during slow breathing exercise. Our study provides a methodology to practice slow breathing exercise, including the setting of target breathing rate, change of CPC and the importance of regular breathing. The applications and usability of the study results have also been discussed.National Natural Science Foundation (China) (Grant Number: 61471398)Beijing Natural Science Foundation (Grant Number: 3122034)General Logistics Science Foundation (Grant Number: CWS11C108)National Key Technology Research and Development Program (Grant Numbers: 2013BAI03B04, 2013BAI03B05
A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation
Contrastive Language-Image Pretraining (CLIP) has gained popularity for its
remarkable zero-shot capacity. Recent research has focused on developing
efficient fine-tuning methods, such as prompt learning and adapter, to enhance
CLIP's performance in downstream tasks. However, these methods still require
additional training time and computational resources, which is undesirable for
devices with limited resources. In this paper, we revisit a classical
algorithm, Gaussian Discriminant Analysis (GDA), and apply it to the downstream
classification of CLIP. Typically, GDA assumes that features of each class
follow Gaussian distributions with identical covariance. By leveraging Bayes'
formula, the classifier can be expressed in terms of the class means and
covariance, which can be estimated from the data without the need for training.
To integrate knowledge from both visual and textual modalities, we ensemble it
with the original zero-shot classifier within CLIP. Extensive results on 17
datasets validate that our method surpasses or achieves comparable results with
state-of-the-art methods on few-shot classification, imbalanced learning, and
out-of-distribution generalization. In addition, we extend our method to
base-to-new generalization and unsupervised learning, once again demonstrating
its superiority over competing approaches. Our code is publicly available at
\url{https://github.com/mrflogs/ICLR24}.Comment: Accepted by ICLR 202
Adaptive motion artefact reduction in respiration and ECG signals for wearable healthcare monitoring systems
Wearable healthcare monitoring systems (WHMSs) have received significant interest from both academia and industry with the advantage of non-intrusive and ambulatory monitoring. The aim of this paper is to investigate the use of an adaptive filter to reduce motion artefact (MA) in physiological signals acquired by WHMSs. In our study, a WHMS is used to acquire ECG, respiration and triaxial accelerometer (ACC) signals during incremental treadmill and cycle ergometry exercises. With these signals, performances of adaptive MA cancellation are evaluated in both respiration and ECG signals. To achieve effective and robust MA cancellation, three axial outputs of the ACC are employed to estimate the MA by a bank of gradient adaptive Laguerre lattice (GALL) filter, and the outputs of the GALL filters are further combined with time-varying weights determined by a Kalman filter. The results show that for the respiratory signals, MA component can be reduced and signal quality can be improved effectively (the power ratio between the MA-corrupted respiratory signal and the adaptive filtered signal was 1.31 in running condition, and the corresponding signal quality was improved from 0.77 to 0.96). Combination of the GALL and Kalman filters can achieve robust MA cancellation without supervised selection of the reference axis from the ACC. For ECG, the MA component can also be reduced by adaptive filtering. The signal quality, however, could not be improved substantially just by the adaptive filter with the ACC outputs as the reference signals.Municipal Science & Technology Commission. Beijing Natural Science Foundation (Grants 3102028 and 3122034)General Logistics Science Foundation (Grant CWS11C108)National Institutes of Health (U.S.) (National Institute of General Medical Sciences (U.S.). Grant R01- EB001659)National Institutes of Health (U.S.) (National Institute for Biomedical Imaging and Bioengineering (U.S.) Cooperative Agreement U01- EB-008577
Research Hotspot and Trend Analysis of China’s Elderlyoriented Smart Products
The Chinese government attaches great importance to the current situation of population aging, so it has introduced relevant aging policies. The combination of new technologies has played a positive role in the development of Elderly-oriented smart products of enterprises. Based on the research literature on Elderly-oriented smart products collected in CNKI database in recent ten years (2012-2022), this paper makes a quantitative analysis on the research results of Elderly-oriented smart products in China with the help of CiteSpace visual analysis software. Through research hotspots and evolution trends, it is found that the theme can be extended: the upgrading and construction of Elderly-oriented smart products will be a hot research topic in the academic community in the future
- …