58 research outputs found
Adaptive Test-Time Personalization for Federated Learning
Personalized federated learning algorithms have shown promising results in
adapting models to various distribution shifts. However, most of these methods
require labeled data on testing clients for personalization, which is usually
unavailable in real-world scenarios. In this paper, we introduce a novel
setting called test-time personalized federated learning (TTPFL), where clients
locally adapt a global model in an unsupervised way without relying on any
labeled data during test-time. While traditional test-time adaptation (TTA) can
be used in this scenario, most of them inherently assume training data come
from a single domain, while they come from multiple clients (source domains)
with different distributions. Overlooking these domain interrelationships can
result in suboptimal generalization. Moreover, most TTA algorithms are designed
for a specific kind of distribution shift and lack the flexibility to handle
multiple kinds of distribution shifts in FL. In this paper, we find that this
lack of flexibility partially results from their pre-defining which modules to
adapt in the model. To tackle this challenge, we propose a novel algorithm
called ATP to adaptively learns the adaptation rates for each module in the
model from distribution shifts among source domains. Theoretical analysis
proves the strong generalization of ATP. Extensive experiments demonstrate its
superiority in handling various distribution shifts including label shift,
image corruptions, and domain shift, outperforming existing TTA methods across
multiple datasets and model architectures. Our code is available at
https://github.com/baowenxuan/ATP .Comment: Accepted by NeurIPS 202
Privacy-Preserving Graph Machine Learning from Data to Computation: A Survey
In graph machine learning, data collection, sharing, and analysis often
involve multiple parties, each of which may require varying levels of data
security and privacy. To this end, preserving privacy is of great importance in
protecting sensitive information. In the era of big data, the relationships
among data entities have become unprecedentedly complex, and more applications
utilize advanced data structures (i.e., graphs) that can support network
structures and relevant attribute information. To date, many graph-based AI
models have been proposed (e.g., graph neural networks) for various domain
tasks, like computer vision and natural language processing. In this paper, we
focus on reviewing privacy-preserving techniques of graph machine learning. We
systematically review related works from the data to the computational aspects.
We first review methods for generating privacy-preserving graph data. Then we
describe methods for transmitting privacy-preserved information (e.g., graph
model parameters) to realize the optimization-based computation when data
sharing among multiple parties is risky or impossible. In addition to
discussing relevant theoretical methodology and software tools, we also discuss
current challenges and highlight several possible future research opportunities
for privacy-preserving graph machine learning. Finally, we envision a unified
and comprehensive secure graph machine learning system.Comment: Accepted by SIGKDD Explorations 2023, Volume 25, Issue
DP-Mix: Mixup-based Data Augmentation for Differentially Private Learning
Data augmentation techniques, such as simple image transformations and
combinations, are highly effective at improving the generalization of computer
vision models, especially when training data is limited. However, such
techniques are fundamentally incompatible with differentially private learning
approaches, due to the latter's built-in assumption that each training image's
contribution to the learned model is bounded. In this paper, we investigate why
naive applications of multi-sample data augmentation techniques, such as mixup,
fail to achieve good performance and propose two novel data augmentation
techniques specifically designed for the constraints of differentially private
learning. Our first technique, DP-Mix_Self, achieves SoTA classification
performance across a range of datasets and settings by performing mixup on
self-augmented data. Our second technique, DP-Mix_Diff, further improves
performance by incorporating synthetic data from a pre-trained diffusion model
into the mixup process. We open-source the code at
https://github.com/wenxuan-Bao/DP-Mix.Comment: 17 pages, 2 figures, to be published in Neural Information Processing
Systems 202
FedSelect: Personalized Federated Learning with Customized Selection of Parameters for Fine-Tuning
Standard federated learning approaches suffer when client data distributions
have sufficient heterogeneity. Recent methods addressed the client data
heterogeneity issue via personalized federated learning (PFL) - a class of FL
algorithms aiming to personalize learned global knowledge to better suit the
clients' local data distributions. Existing PFL methods usually decouple global
updates in deep neural networks by performing personalization on particular
layers (i.e. classifier heads) and global aggregation for the rest of the
network. However, preselecting network layers for personalization may result in
suboptimal storage of global knowledge. In this work, we propose FedSelect, a
novel PFL algorithm inspired by the iterative subnetwork discovery procedure
used for the Lottery Ticket Hypothesis. FedSelect incrementally expands
subnetworks to personalize client parameters, concurrently conducting global
aggregations on the remaining parameters. This approach enables the
personalization of both client parameters and subnetwork structure during the
training process. Finally, we show that FedSelect outperforms recent
state-of-the-art PFL algorithms under challenging client data heterogeneity
settings and demonstrates robustness to various real-world distributional
shifts. Our code is available at https://github.com/lapisrocks/fedselect.Comment: Published in CVPR 202
Seizing the window of opportunity to mitigate the impact of climate change on the health of Chinese residents
The health threats posed by climate change in China are increasing rapidly. Each province faces different health risks. Without a timely and adequate response, climate change will impact lives and livelihoods at an accelerated rate and even prevent the achievement of the Healthy and Beautiful China initiatives. The 2021 China Report of the Lancet Countdown on Health and Climate Change is the first annual update of China’s Report of the Lancet Countdown. It comprehensively assesses the impact of climate change on the health of Chinese households and the measures China has taken. Invited by the Lancet committee, Tsinghua University led the writing of the report and cooperated with 25 relevant institutions in and outside of China. The report includes 25 indicators within five major areas (climate change impacts, exposures, and vulnerability; adaptation, planning, and resilience for health; mitigation actions and health co-benefits; economics and finance; and public and political engagement) and a policy brief. This 2021 China policy brief contains the most urgent and relevant indicators focusing on provincial data: The increasing health risks of climate change in China; mixed progress in responding to climate change. In 2020, the heatwave exposures per person in China increased by 4.51 d compared with the 1986–2005 average, resulting in an estimated 92% increase in heatwave-related deaths. The resulting economic cost of the estimated 14500 heatwave-related deaths in 2020 is US$176 million. Increased temperatures also caused a potential 31.5 billion h in lost work time in 2020, which is equivalent to 1.3% of the work hours of the total national workforce, with resulting economic losses estimated at 1.4% of China’s annual gross domestic product. For adaptation efforts, there has been steady progress in local adaptation planning and assessment in 2020, urban green space growth in 2020, and health emergency management in 2019. 12 of 30 provinces reported that they have completed, or were developing, provincial health adaptation plans. Urban green space, which is an important heat adaptation measure, has increased in 18 of 31 provinces in the past decade, and the capacity of China’s health emergency management increased in almost all provinces from 2018 to 2019. As a result of China’s persistent efforts to clean its energy structure and control air pollution, the premature deaths due to exposure to ambient particulate matter of 2.5 μm or less (PM2.5) and the resulting costs continue to decline. However, 98% of China’s cities still have annual average PM2.5 concentrations that are more than the WHO guideline standard of 10 μg/m3. It provides policymakers and the public with up-to-date information on China’s response to climate change and improvements in health outcomes and makes the following policy recommendations. (1) Promote systematic thinking in the related departments and strengthen multi-departmental cooperation. Sectors related to climate and development in China should incorporate health perspectives into their policymaking and actions, demonstrating WHO’s and President Xi Jinping’s so-called health-in-all-policies principle. (2) Include clear goals and timelines for climate-related health impact assessments and health adaptation plans at both the national and the regional levels in the National Climate Change Adaptation Strategy for 2035. (3) Strengthen China’s climate mitigation actions and ensure that health is included in China’s pathway to carbon neutrality. By promoting investments in zero-carbon technologies and reducing fossil fuel subsidies, the current rebounding trend in carbon emissions will be reversed and lead to a healthy, low-carbon future. (4) Increase awareness of the linkages between climate change and health at all levels. Health professionals, the academic community, and traditional and new media should raise the awareness of the public and policymakers on the important linkages between climate change and health.</p
Open X-Embodiment:Robotic learning datasets and RT-X models
Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning methods train a separate model for every application, every robot, and even every environment. Can we instead train "generalist" X-robot policy that can be adapted efficiently to new robots, tasks, and environments? In this paper, we provide datasets in standardized data formats and models to make it possible to explore this possibility in the context of robotic manipulation, alongside experimental results that provide an example of effective X-robot policies. We assemble a dataset from 22 different robots collected through a collaboration between 21 institutions, demonstrating 527 skills (160266 tasks). We show that a high-capacity model trained on this data, which we call RT-X, exhibits positive transfer and improves the capabilities of multiple robots by leveraging experience from other platforms. The project website is robotics-transformer-x.github.io
On the Importance of Architecture and Feature Selection in Differentially Private Machine Learning
We study a pitfall in the typical workflow for differentially private machine
learning. The use of differentially private learning algorithms in a "drop-in"
fashion -- without accounting for the impact of differential privacy (DP) noise
when choosing what feature engineering operations to use, what features to
select, or what neural network architecture to use -- yields overly complex and
poorly performing models. In other words, by anticipating the impact of DP
noise, a simpler and more accurate alternative model could have been trained
for the same privacy guarantee. We systematically study this phenomenon through
theory and experiments. On the theory front, we provide an explanatory
framework and prove that the phenomenon arises naturally from the addition of
noise to satisfy differential privacy. On the experimental front, we
demonstrate how the phenomenon manifests in practice using various datasets,
types of models, tasks, and neural network architectures. We also analyze the
factors that contribute to the problem and distill our experimental insights
into concrete takeaways that practitioners can follow when training models with
differential privacy. Finally, we propose privacy-aware algorithms for feature
selection and neural network architecture search. We analyze their differential
privacy properties and evaluate them empirically
An Investigation of the Experiences of Working with Multilingual International Students among Local Students and Faculty Members in Chinese Universities
In recent years, as a response to the internationalization of higher education worldwide, China has begun to enroll international students to study at the tertiary level on an increasingly large scale. While the majority of the programs and courses are open to international students via Chinese as Chinese-medium instruction (CMI), there are also an increasing number of programs and courses delivered through English-medium instruction (EMI). In order to understand higher education multilingual contexts, this qualitative study examines how local students and faculty members make sense of their engagement with international students in three Chinese universities. In the study, we conducted in-depth interviews with 11 academics who worked with international students as project supervisors and 25 Chinese university students regarding their experiences of working with international students. The findings that emerged from the thematic analysis revealed that international students’ learning engagement was profoundly mediated by language barriers, cultural assumptions and the academic conventions in host institutions. The study revealed that Chinese academics are concerned about international students’ learning attitudes, their academic progress and a lack of participation due to their language ability. Local Chinese students also reported a lack of satisfaction in working with international students. Some of the local students felt that some international students may have been enabled to enroll in the academic programs as a result of national and university policies, which has led to a ‘dumbing down’ of the curriculum offered in English. The findings indicate that more needs to be done to promote mutual exchanges and better understanding among international students, Chinese faculty members and local students
- …