Search CORE

15 research outputs found

Robust and IP-Protecting Vertical Federated Learning against Unexpected Quitting of Parties

Author: Amirshahi Alireza
Atienza David
Baghersalimi Saleh
Chen Yiran
Dai Anna
Du Zhixu
Sun Jingwei
Publication venue
Publication date: 28/03/2023
Field of study

Vertical federated learning (VFL) enables a service provider (i.e., active party) who owns labeled features to collaborate with passive parties who possess auxiliary features to improve model performance. Existing VFL approaches, however, have two major vulnerabilities when passive parties unexpectedly quit in the deployment phase of VFL - severe performance degradation and intellectual property (IP) leakage of the active party's labels. In this paper, we propose \textbf{Party-wise Dropout} to improve the VFL model's robustness against the unexpected exit of passive parties and a defense method called \textbf{DIMIP} to protect the active party's IP in the deployment phase. We evaluate our proposed methods on multiple datasets against different inference attacks. The results show that Party-wise Dropout effectively maintains model performance after the passive party quits, and DIMIP successfully disguises label information from the passive party's feature extractor, thereby mitigating IP leakage

arXiv.org e-Print Archive

Rethinking Normalization Methods in Federated Learning

Author: Chen Pin-Yu
Chen Yiran
Du Zhixu
Li Ang
Li Hai "Helen"
Sun Jingwei
Zhang Jianyi
Publication venue
Publication date: 06/10/2022
Field of study

Federated learning (FL) is a popular distributed learning framework that can reduce privacy risks by not explicitly sharing private data. In this work, we explicitly uncover external covariate shift problem in FL, which is caused by the independent local training processes on different devices. We demonstrate that external covariate shifts will lead to the obliteration of some devices' contributions to the global model. Further, we show that normalization layers are indispensable in FL since their inherited properties can alleviate the problem of obliterating some devices' contributions. However, recent works have shown that batch normalization, which is one of the standard components in many deep neural networks, will incur accuracy drop of the global model in FL. The essential reason for the failure of batch normalization in FL is poorly studied. We unveil that external covariate shift is the key reason why batch normalization is ineffective in FL. We also show that layer normalization is a better choice in FL which can mitigate the external covariate shift and improve the performance of the global model. We conduct experiments on CIFAR10 under non-IID settings. The results demonstrate that models with layer normalization converge fastest and achieve the best or comparable accuracy for three different model architectures.Comment: Submitted to DistributedML'22 worksho

arXiv.org e-Print Archive

SiDA: Sparsity-Inspired Data-Aware Serving for Efficient and Scalable Large Mixture-of-Experts Models

Author: Chen Yiran
Du Zhixu
Jiang Xiangyu
Li Ang
Li Hai "Helen"
Li Shiyu
Sun Jingwei
Wu Yongkai
Wu Yuhao
Zheng Qilin
Publication venue
Publication date: 28/10/2023
Field of study

Mixture-of-Experts (MoE) has emerged as a favorable architecture in the era of large models due to its inherent advantage, i.e., enlarging model capacity without incurring notable computational overhead. Yet, the realization of such benefits often results in ineffective GPU memory utilization, as large portions of the model parameters remain dormant during inference. Moreover, the memory demands of large models consistently outpace the memory capacity of contemporary GPUs. Addressing this, we introduce SiDA (Sparsity-inspired Data-Aware), an efficient inference approach tailored for large MoE models. SiDA judiciously exploits both the system's main memory, which is now abundant and readily scalable, and GPU memory by capitalizing on the inherent sparsity on expert activation in MoE models. By adopting a data-aware perspective, SiDA achieves enhanced model efficiency with a neglectable performance drop. Specifically, SiDA attains a remarkable speedup in MoE inference with up to 3.93X throughput increasing, up to 75% latency reduction, and up to 80% GPU memory saving with down to 1% performance drop. This work paves the way for scalable and efficient deployment of large MoE models, even in memory-constrained systems

arXiv.org e-Print Archive

CoRE: A context-aware relation extraction method for relation completion

Author: Du Xiaoyong
Li Zhixu
Sharaf Mohamed
Sitbon Laurianne
Zhou Xiaofang
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 27/08/2013
Field of study

We identify relation completion (RC) as one recurring problem that is central to the success of novel big data applications such as Entity Reconstruction and Data Enrichment. Given a semantic relation, RC attempts at linking entity pairs between two entity lists under the relation. To accomplish the RC goals, we propose to formulate search queries for each query entity Î± based on some auxiliary information, so that to detect its target entity Î² from the set of retrieved documents. For instance, a pattern-based method (PaRE) uses extracted patterns as the auxiliary information in formulating search queries. However, high-quality patterns may decrease the probability of finding suitable target entities. As an alternative, we propose CoRE method that uses context terms learned surrounding the expression of a relation as the auxiliary information in formulating queries. The experimental results based on several real-world web data collections demonstrate that CoRE reaches a much higher accuracy than PaRE for the purpose of RC

Crossref

Queensland University of Technology ePrints Archive

University of Queensland eSpace

IRank: A term-based innovation ranking system for conferences and scholars

Author: Du Xiaoyong
He Jun
Li Zhixu
Liu Hongyan
Zhou Xiaofang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Since the proposition of Journal Impact Factor [1] in 1963, the classical citation-based ranking scheme has been a standard criterion to rank journals and conferences. However, the reference of a paper cannot list all relevant publications and the citation relationships are not always available especially when related to copyright problem. Besides, we cannot evaluate a newly published paper before it is cited by others. Therefore, we propose an alternative method, term-based evaluation scheme which can evaluate publications by terms they use. Then we can rank conferences, journals and scholars accordingly. We think this term-based ranking scheme can be used to evaluate innovation quality for conferences and scholars. To evaluate our scheme and to facilitate its application, we develop an innovation ranking system called IRank to rank conferences and authors in the field of Database Systems. The performance of IRank demonstrates the effectiveness of our scheme

University of Queensland eSpace

Approximate membership localization (AML) for web-based join

Author: Du Xiaoyong
Li Zhixu
Sitbon Laurianne
Wang Liwei
Zhou Xiaofang
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2010
Field of study

In this paper, we propose a search-based approach to join two tables in the absence of clean join attributes. Non-structured documents from the web are used to express the correlations between a given query and a reference list. To implement this approach, a major challenge we meet is how to efficiently determine the number of times and the locations of each clean reference from the reference list that is approximately mentioned in the retrieved documents. We formalize the Approximate Membership Localization (AML) problem and propose an efficient partial pruning algorithm to solve it. A study using real-word data sets demonstrates the effectiveness of our search-based approach, and the efficiency of our AML algorithm

Crossref

Queensland University of Technology ePrints Archive

AML: efficient Approximate Membership Localization within a web-based join framework

Author: Du Xiaoyong
Li Zhixu
Sitbon Laurianne
Wang Liwei
Zhou Xiaofang
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2013
Field of study

In this paper, we propose a new type of Dictionary-based Entity Recognition Problem, named Approximate Membership Localization (AML). The popular Approximate Membership Extraction (AME) provides a full coverage to the true matched substrings from a given document, but many redundancies cause a low efficiency of the AME process and deteriorate the performance of real-world applications using the extracted substrings. The AML problem targets at locating nonoverlapped substrings which is a better approximation to the true matched substrings without generating overlapped redundancies. In order to perform AML efficiently, we propose the optimized algorithm P-Prune that prunes a large part of overlapped redundant matched substrings before generating them. Our study using several real-word data sets demonstrates the efficiency of P-Prune over a baseline method. We also study the AML in application to a proposed web-based join framework scenario which is a search-based approach joining two tables using dictionary-based entity recognition from web documents. The results not only prove the advantage of AML over AME, but also demonstrate the effectiveness of our search-based approach

Crossref

Queensland University of Technology ePrints Archive

University of Queensland eSpace

AML: Efficient approximate membership localization within a web-based join framework

Author: Du Xiaoyong
Li Zhixu
Sitbon Laurianne
Wang Liwei
Zhou Xiaofang
Publication venue: IEEE
Publication date
Field of study

Synthesis, Crystal Structure, Absolute Configuration and Antitumor Activity of the Enantiomers of 5-Bromo-2-chloro-N-(1-phenylethyl)pyridine-3-sulfonamide

Author: Changshan Sun
Lei Du
Linwei Li
Ning Yan
Tiemin Sun
Zhixu Zhou
Publication venue: MDPI AG
Publication date: 01/11/2015
Field of study

Pyridinesulfonamide is an important fragment which has a wide range of applications in novel drugs. R- and S-isomers of 5-bromo-2-chloro-N-(1-phenylethyl)pyridine-3-sulfonamide have been synthesized, and the stereostructures have been researched. Single crystals of both compounds were obtained for X-ray analysis, and the absolute configurations (ACs) have been further confirmed by electronic circular dichroism (ECD), optical rotation (OR) and quantum chemical calculations. The crystal structures and calculated geometries were extremely similar, which permitted a comparison of the relative reliabilities of ACs obtained by ECD analyses and theoretical simulation. In addition, the effect of stereochemistry on the PI3Kα kinase and anticancer activity were investigated. Compounds 10a and 10b inhibit the activity of PI3Kα kinase with IC50 values of 1.08 and 2.69 μM, respectively. Furthermore, molecular docking was performed to analyze the binding modes of R- and S-isomers

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

Correlation between Aspartate Aminotransferase/Alanine Aminotransferase and Prognosis of Hemophagocytic Lymphohistiocytosis in Children

Author: SHI Xiaoqi LUO Nandu, HUANG Jiaojiao, DU Zuochen, HUANG Pei, CAO Xiuli, CHEN Yan, HE Zhixu
Publication venue: Chinese General Practice Publishing House Co., Ltd
Publication date: 01/10/2023
Field of study

Background Aspartate aminotransferase (AST) /alanine aminotransferase (ALT) is a novel indicator to evaluate the prognosis of acute critical illness in recent years. At present, AST/ALT has only been reported to evaluate the prognosis of hemophagocytic lymphohistiocytosis (HLH) in adults, while HLH in children has not been studied. Objective To explore the relationship between AST/ALT and clinical characteristics and its prognostic significance in children with HLH, so as to provide a theoretical basis for early clinical recognition and diagnosis of HLH in children. Methods A total of 128 hospitalized children diagnosed with HLH in the Affiliated Hospital of Zunyi Medical University from January 2013 to May 2022 were selected as the research objects, and the baseline data of children were collected through the electronic medical record system. The children were divided into the T1 group (AST/ALT≤1.57, n=43), T2 group (1.57<AST/ALT<3.22, n=42), and T3 group (AST/ALT≥3.22, n=43) according to the AST/ALT quantiles, and followed up by outpatient review and telephone follow-up once every 6 months from the time of discharge to 2022-06-01, with the termination event of death or loss of follow-up. Spearman rank correlation analysis was used to explore the correlation between AST/ALT and laboratory test results. The receiver operating characteristic (ROC) curve of laboratory indicators for predicting death in children with HLHwas plotted, the area under ROC curve (AUC) and optimal cut-off value were calculated. Kaplan-Meier method was used to plot survival curves to analyze the effect of different AST/ALT groupings on overall survival, and Log-rank test was used for comparison. Cox proportional risk model was used to explore the influencing factors of death in children with HLH. Results There were statistically significant differences in gender, PICU admission, treatment methods, incidence of respiratory failure and shock among the 3 groups (P<0.05). Lactate dehydrogenase, creatine kinase isoenzyme, serum ferritin and activated partial thromboplastin time in the T3 group were higher than those in the T1 and T2 groups, while the levels of albumin and fibrinogen in the T3 group were lower than those in the T1 and T2 groups (P<0.05). Na+ level in the T2 and T3 groups was lower than that in the T1 group, while C-reactive protein level was higher than that in the T1 group (P<0.05). Correlation analysis showed that AST/ALT was positively correlated with absolute neutrophil count (rs=0.182, P=0.040), C-reactive protein (rs=0.419, P<0.001), total bilirubin (rs=0.182, P=0.040), creatine kinase isoenzyme (rs=0.310, P<0.001), lactate dehydrogenase (rs=0.474, P<0.001), activated partial thromboplastin time (rs=0.316, P<0.001), serum ferritin (rs=0.311, P<0.001), and negatively correlated with albumin (rs=-0.352, P<0.001), fibrinogen (rs=-0.179, P=0.043), Ca2+ (rs=-0.259, P=0.003), Na+ (rs=-0.244, P=0.006). ROC curve results showed that the AUCs of C-reactive protein, lactate dehydrogenase, activated partial thromboplastin time, serum ferritin and fibrinogen were 0.560〔95%CI (0.451, 0.669) 〕, 0.666〔95%CI (0.560, 0.772) 〕, 0.605〔95%CI (0.499, 0.710) 〕, 0.724〔95%CI (0.626, 0.822) 〕, 0.648〔95%CI (0.551, 0.745) 〕 and 0.715〔95%CI (0.624, 0.807) 〕, respectively, with the optimal cutoff values of 82.08 mg/L, 40.5 U/L, 927.5 U/L, 53.95 s, 1 897 μg/L, and 1.45 g/L, respectively. The mortality rate in the T1, T2 and T3 groups was 14.0% (6/43), 33.3% (14/42) and 44.2% (19/43), respectively, with statistically significant differences (χ2=9.518, P=0.009). Multivariate Cox proportional hazard regression analysis showed that shock〔HR=4.24, 95%CI (2.09, 8.61), P<0.001〕, activated partial thromboplastin time ≥53.95 s〔HR=2.44, 95%CI (1.24, 4.81), P=0.010〕and serum ferritin ≥1 897 μg/L〔HR=3.05, 95%CI (1.02, 9.09), P=0.046〕were the risk factors for death in children. Conclusion HLH patients in children with higher AST/ALT have higher incidence of poor prognosis, shorter overall survival, and worse prognosis

Directory of Open Access Journals