14 research outputs found

    Efficient Personalized Federated Learning via Sparse Model-Adaptation

    Full text link
    Federated Learning (FL) aims to train machine learning models for multiple clients without sharing their own private data. Due to the heterogeneity of clients' local data distribution, recent studies explore the personalized FL that learns and deploys distinct local models with the help of auxiliary global models. However, the clients can be heterogeneous in terms of not only local data distribution, but also their computation and communication resources. The capacity and efficiency of personalized models are restricted by the lowest-resource clients, leading to sub-optimal performance and limited practicality of personalized FL. To overcome these challenges, we propose a novel approach named pFedGate for efficient personalized FL by adaptively and efficiently learning sparse local models. With a lightweight trainable gating layer, pFedGate enables clients to reach their full potential in model capacity by generating different sparse models accounting for both the heterogeneous data distributions and resource constraints. Meanwhile, the computation and communication efficiency are both improved thanks to the adaptability between the model sparsity and clients' resources. Further, we theoretically show that the proposed pFedGate has superior complexity with guaranteed convergence and generalization error. Extensive experiments show that pFedGate achieves superior global accuracy, individual accuracy and efficiency simultaneously over state-of-the-art methods. We also demonstrate that pFedGate performs better than competitors in the novel clients participation and partial clients participation scenarios, and can learn meaningful sparse local models adapted to different data distributions.Comment: Accepted to ICML 202

    Revisiting Personalized Federated Learning: Robustness Against Backdoor Attacks

    Full text link
    In this work, besides improving prediction accuracy, we study whether personalization could bring robustness benefits to backdoor attacks. We conduct the first study of backdoor attacks in the pFL framework, testing 4 widely used backdoor attacks against 6 pFL methods on benchmark datasets FEMNIST and CIFAR-10, a total of 600 experiments. The study shows that pFL methods with partial model-sharing can significantly boost robustness against backdoor attacks. In contrast, pFL methods with full model-sharing do not show robustness. To analyze the reasons for varying robustness performances, we provide comprehensive ablation studies on different pFL methods. Based on our findings, we further propose a lightweight defense method, Simple-Tuning, which empirically improves defense performance against backdoor attacks. We believe that our work could provide both guidance for pFL application in terms of its robustness and offer valuable insights to design more robust FL methods in the future. We open-source our code to establish the first benchmark for black-box backdoor attacks in pFL: https://github.com/alibaba/FederatedScope/tree/backdoor-bench.Comment: KDD 202

    Matching in Selective and Balanced Representation Space for Treatment Effects Estimation

    Full text link
    The dramatically growing availability of observational data is being witnessed in various domains of science and technology, which facilitates the study of causal inference. However, estimating treatment effects from observational data is faced with two major challenges, missing counterfactual outcomes and treatment selection bias. Matching methods are among the most widely used and fundamental approaches to estimating treatment effects, but existing matching methods have poor performance when facing data with high dimensional and complicated variables. We propose a feature selection representation matching (FSRM) method based on deep representation learning and matching, which maps the original covariate space into a selective, nonlinear, and balanced representation space, and then conducts matching in the learned representation space. FSRM adopts deep feature selection to minimize the influence of irrelevant variables for estimating treatment effects and incorporates a regularizer based on the Wasserstein distance to learn balanced representations. We evaluate the performance of our FSRM method on three datasets, and the results demonstrate superiority over the state-of-the-art methods.Comment: Proceedings of the 29th ACM International Conference on Information and Knowledge Management (CIKM '20

    DA-FSOD: A Novel Data Augmentation Scheme for Few-Shot Object Detection

    No full text
    Deep learning techniques continue to be used in various applications in recent years. However, when it is difficult to obtain adequate training samples, the performance of the depth model will degrade. Although few-shot learning and data enhancement techniques can relieve this dilemma, the diversity of real data is too large to simulate. To tackle this challenge, we study a novel method, Data Augmentation Scheme For Few-Shot Object Detection (DA-FSOD), to improve the efficiency of model training on visual tasks. Specifically, to expand data augmentation space, we build a data augmentation operation pool (DAOP) based on several common-applied image process operations. Then we propose a novel data augmentation scheme, the series and parallel connection scheme, which superimposes the effects of different operations to generate diverse variants. To further explore and utilize the deep feature information, we leverage the semantic information of input image in model and propose imposed semantic data augmentation which augments training set semantically via deep features of augmented variants. The proposed method successfully enhanced the model performance. We validated our approach using extensive experiments on the domain of few-shot object detection. The results showed remarkable gains compared to state-of-the-art methods

    FederatedScope-GNN: Towards a Unified, Comprehensive and Efficient Package for Federated Graph Learning

    Full text link
    The incredible development of federated learning (FL) has benefited various tasks in the domains of computer vision and natural language processing, and the existing frameworks such as TFF and FATE has made the deployment easy in real-world applications. However, federated graph learning (FGL), even though graph data are prevalent, has not been well supported due to its unique characteristics and requirements. The lack of FGL-related framework increases the efforts for accomplishing reproducible research and deploying in real-world applications. Motivated by such strong demand, in this paper, we first discuss the challenges in creating an easy-to-use FGL package and accordingly present our implemented package FederatedScope-GNN (FS-G), which provides (1) a unified view for modularizing and expressing FGL algorithms; (2) comprehensive DataZoo and ModelZoo for out-of-the-box FGL capability; (3) an efficient model auto-tuning component; and (4) off-the-shelf privacy attack and defense abilities. We validate the effectiveness of FS-G by conducting extensive experiments, which simultaneously gains many valuable insights about FGL for the community. Moreover, we employ FS-G to serve the FGL application in real-world E-commerce scenarios, where the attained improvements indicate great potential business benefits. We publicly release FS-G, as submodules of FederatedScope, at https://github.com/alibaba/FederatedScope to promote FGL's research and enable broad applications that would otherwise be infeasible due to the lack of a dedicated package.Comment: Accpeted by KDD'2022; We have released FederatedScope for users on https://github.com/alibaba/FederatedScop

    Analysis of Air Purification Methods in Operating Rooms of Chinese Hospitals

    No full text
    This research demonstrates the current use of air purification methods in the operating rooms (ORs) in China. 154 hospitals from 6 provinces were included in this survey to reflect the air purification methods of ORs in 2017. Air cleaning technology (ACT) is used in 124 (80.52%) hospitals. We find that the rates of using grade I, III, or IV clean operating room (COR) in tertiary hospitals are all higher than in lower level hospitals; the rate of using ACT in the ORs is higher, too. In addition, general hospitals have higher rate in using ACT in the ORs than specialized hospitals. The highest rate of using ACT in the ORs is in the eastern region of China. The number of hospitals using ACT, ultraviolet light disinfection, and air sterilizers (such as circulating air UV sterilizer) increased yearly. All grades of CORs can be maintained as required by more than 90% hospitals except grade II COR. In this research, we found air purification methods, especially the ACT, are widely used in hospitals’ ORs. However, finding the way to select and use different air purification methods correctly is an urgent problem to be solved next

    Characteristics and potential exposure risks of environmentally persistent free radicals in PM2.5 in the three gorges reservoir area, Southwestern China

    No full text
    Environmentally persistent free radicals (EPFRs) are a novel class of hazardous substances that can exist stably in airborne particles for a period ranging from days to weeks and are potentially toxic to human health. Electron paramagnetic resonance spectroscopy (EPR) was used to characterize particulate EPFRs in Wanzhou in the Three Gorges Reservoir area in 2017. During the whole of 2017, the average concentration of particulate EPFRs was 7.0 x 10(13) +/- 1.7 x 10(13) spins/m(3). The seasonal concentration of EPFRs in PM2.5 showed a trend of autumn > winter > spring > summer. The maxima and minima of EPFRs occurred in spring with concentrations of 2.1 x 10(14) spins/m(3 )and 9.4 x 10(12) spins/m(3)respectively. The EPFRs in PM2.5 were mainly carbon-centered radicals with adjacent oxygen atoms. Significant positive correlations were found between EPFRs and SO42-, NO3- and NH4+ (r> 0.55, n = 111), indicating that EPFRs are associated with secondary sources. The atmospheric processing of particles from coal combustion, traffic, and agriculture were important sources of EPFRs. They were also particularly well correlated with K+ and Cl- in winter, suggesting that EPFRs may also be derived from wintertime biomass burning emissions. The amount of inhalable EPFRs in Wanzhou was equivalent to the range of 2.3-6.8 cigarettes per capita per day. This study provides evidence of the potential health risks of EPFRs in PM2.5, and references for air pollution control in the Three Gorges Reservoir area. (C) 2020 Elsevier Ltd. All rights reserved
    corecore