Search CORE

60 research outputs found

A Homogenization Approach for Gradient-Dominated Stochastic Optimization

Author: Deng Qi
Ge Dongdong
Tan Jiyuan
Xue Chenyu
Ye Yinyu
Zhang Chuwen
Publication venue
Publication date: 21/08/2023
Field of study

Gradient dominance property is a condition weaker than strong convexity, yet it sufficiently ensures global convergence for first-order methods even in non-convex optimization. This property finds application in various machine learning domains, including matrix decomposition, linear neural networks, and policy-based reinforcement learning (RL). In this paper, we study the stochastic homogeneous second-order descent method (SHSODM) for gradient-dominated optimization with

\alpha \in [1, 2]

based on a recently proposed homogenization approach. Theoretically, we show that SHSODM achieves a sample complexity of

O(\epsilon^{-7/(2 \alpha) +1})

for

\alpha \in [1, 3/2)

and

\tilde{O}(\epsilon^{-2/\alpha})

for

\alpha \in [3/2, 2]

. We further provide a SHSODM with a variance reduction technique enjoying an improved sample complexity of

O( \epsilon ^{-( 7-3\alpha ) /( 2\alpha )})

for

\alpha \in [1,3/2)

. Our results match the state-of-the-art sample complexity bounds for stochastic gradient-dominated optimization without \emph{cubic regularization}. Since the homogenization approach only relies on solving extremal eigenvector problems instead of Newton-type systems, our methods gain the advantage of cheaper iterations and robustness in ill-conditioned problems. Numerical experiments on several RL tasks demonstrate the efficiency of SHSODM compared to other off-the-shelf methods

arXiv.org e-Print Archive

Towards Efficient and Certified Recovery from Poisoning Attacks in Federated Learning

Author: Jiang Yu
Lam Kwok-Yan
Liu Ziyao
Shen Jiyuan
Tan Chee Wei
Publication venue
Publication date: 19/01/2024
Field of study

Federated learning (FL) is vulnerable to poisoning attacks, where malicious clients manipulate their updates to affect the global model. Although various methods exist for detecting those clients in FL, identifying malicious clients requires sufficient model updates, and hence by the time malicious clients are detected, FL models have been already poisoned. Thus, a method is needed to recover an accurate global model after malicious clients are identified. Current recovery methods rely on (i) all historical information from participating FL clients and (ii) the initial model unaffected by the malicious clients, leading to a high demand for storage and computational resources. In this paper, we show that highly effective recovery can still be achieved based on (i) selective historical information rather than all historical information and (ii) a historical model that has not been significantly affected by malicious clients rather than the initial model. In this scenario, while maintaining comparable recovery performance, we can accelerate the recovery speed and decrease memory consumption. Following this concept, we introduce Crab, an efficient and certified recovery method, which relies on selective information storage and adaptive model rollback. Theoretically, we demonstrate that the difference between the global model recovered by Crab and the one recovered by train-from-scratch can be bounded under certain assumptions. Our empirical evaluation, conducted across three datasets over multiple machine learning models, and a variety of untargeted and targeted poisoning attacks reveals that Crab is both accurate and efficient, and consistently outperforms previous approaches in terms of both recovery speed and memory consumption

arXiv.org e-Print Archive

ASP: Automatic Selection of Proxy dataset for efficient AutoML

Author: Chen Bin
Jia Jiyuan
Liao Chao
Song Chengru
Tan Jianchao
Yao Peng
Zhang Di
Publication venue
Publication date: 17/10/2023
Field of study

Deep neural networks have gained great success due to the increasing amounts of data, and diverse effective neural network designs. However, it also brings a heavy computing burden as the amount of training data is proportional to the training time. In addition, a well-behaved model requires repeated trials of different structure designs and hyper-parameters, which may take a large amount of time even with state-of-the-art (SOTA) hyper-parameter optimization (HPO) algorithms and neural architecture search (NAS) algorithms. In this paper, we propose an Automatic Selection of Proxy dataset framework (ASP) aimed to dynamically find the informative proxy subsets of training data at each epoch, reducing the training data size as well as saving the AutoML processing time. We verify the effectiveness and generalization of ASP on CIFAR10, CIFAR100, ImageNet16-120, and ImageNet-1k, across various public model benchmarks. The experiment results show that ASP can obtain better results than other data selection methods at all selection ratios. ASP can also enable much more efficient AutoML processing with a speedup of 2x-20x while obtaining better architectures and better hyper-parameters compared to utilizing the entire dataset.Comment: This paper was actually finished in 202

arXiv.org e-Print Archive

Continuous and Discrete-Time Optimal Controls for an Isolated Signalized Intersection

Author: Haiyang Yu
Jiyuan Tan
Kaidi Yang
Li Wang
Na Xie
Xiangyun Shi
Zhengxi Li
Zhiheng Li
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2017
Field of study

A classical control problem for an isolated oversaturated intersection is revisited with a focus on the optimal control policy to minimize total delay. The difference and connection between existing continuous-time planning models and recently proposed discrete-time planning models are studied. A gradient descent algorithm is proposed to convert the optimal control plan of the continuous-time model to the plan of the discrete-time model in many cases. Analytic proof and numerical tests for the algorithm are also presented. The findings shed light on the links between two kinds of models

Repository for Publications and Research Data

Crossref

Directory of Open Access Journals

Crystal structure of rhodopsin bound to arrestin by femtosecond X-ray laser.

Author: Barty Anton
Basu Shibom
Boutet Sébastien
Caffrey Martin
Caro Lydia N
Carragher Bridget
Chapman Henry N
Cherezov Vadim
Coe Jesse
Conrad Chelsie E
de Waal Parker W
Diederichs Kay
Dong Yuhui
Ernst Oliver P
Fromme Petra
Fromme Raimund
Gao Xiang
Gati Cornelius
Griffin Patrick R
Grotjohann Ingo
Gu Xin
Gurevich Vsevolod V
Han Gye Won
He Yuanzheng
Howe Nicole
Hubbell Wayne L
Ishchenko Andrii
James Daniel
Jiang Hualiang
Jiang Yi
Kang Yanyong
Katritch Vsevolod
Ke Jiyuan
Kupitz Christopher
Lee Regina J
Li Dianfan
Li Jun
Lisova Stella
Liu Haiguang
Liu Wei
Ma Jinming
Melcher Karsten
Messerschmidt Marc
Moeller Arne
Pal Kuntal
Pascal Bruce D
Potter Clinton S
Roy-Chowdhury Shatabdi
Spence John CH
Standfuss Jörg
Stevens Raymond C
Suino-Powell Kelly M
Tan MH Eileen
Tan Minjia
Van Eps Ned
Vishnivetskiy Sergey A
Wang Dingjie
Wang Meitian
Weierstall Uwe
West Graham M
White Thomas A
Williams Garth J
Xu H Eric
Xu Qingping
Yang Huaiyu
Yefanov Oleksandr
Zatsepin Nadia A
Zhang Chenghai
Zhao Yingming
Zheng Zhong
Zhi Xiaoyong
Zhou X Edward
Publication venue: eScholarship, University of California
Publication date: 01/01/2015
Field of study

G-protein-coupled receptors (GPCRs) signal primarily through G proteins or arrestins. Arrestin binding to GPCRs blocks G protein interaction and redirects signalling to numerous G-protein-independent pathways. Here we report the crystal structure of a constitutively active form of human rhodopsin bound to a pre-activated form of the mouse visual arrestin, determined by serial femtosecond X-ray laser crystallography. Together with extensive biochemical and mutagenesis data, the structure reveals an overall architecture of the rhodopsin-arrestin assembly in which rhodopsin uses distinct structural elements, including transmembrane helix 7 and helix 8, to recruit arrestin. Correspondingly, arrestin adopts the pre-activated conformation, with a ∼20° rotation between the amino and carboxy domains, which opens up a cleft in arrestin to accommodate a short helix formed by the second intracellular loop of rhodopsin. This structure provides a basis for understanding GPCR-mediated arrestin-biased signalling and demonstrates the power of X-ray lasers for advancing the frontiers of structural biology

DESY Publication Database

PubMed Central

eScholarship - University of California

DESY

The rapid and massive urban and industrial land expansions in China between 1990 and 2010: A CLUD-based analysis of their trajectories, patterns, and drivers

Author: Bierwagen
Budd
Chi Zhang
Deng
Deng
Fang
Grimm
Gu
Huang
Jiang
Jinwei Dong
Jiyuan Liu
Kaza
Kuang
Kuang
Kuang
Kuang
Kuang
Kuang
Li
Li
Liu
Liu
Liu
Liu
Liu
Liu
Liu
Lu
Matus
McDonald
Mitchell
Montgomery
National Bureau of Statistics of China
National Bureau of Statistics of China
National Bureau of Statistics of China
National Bureau of Statistics of China
National Bureau of Statistics of China
Normile
Parrish
Pickett
Schneider
Seto
Seto
Seto
Seto
Song
Tan
Taubenböck
Tian
Tian
United Nations
Wang
Wenfeng Chi
Weng
Weng
Wenhui Kuang
Wu
Wu
Xiang
Xu
Zeng
Zhang
Zhang
Zhang
Zhang
Zhou
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Distinction of Scrambled Linear Block Codes Based on Extraction of Correlation Features

Author: Jiyuan Tan
Limin Zhang
Zhaogen Zhong
Publication venue: MDPI AG
Publication date: 01/11/2022
Field of study

Aiming to solve the problem of the distinction of scrambled linear block codes, a method for identifying the scrambling types of linear block codes by combining correlation features and convolution long short-term memory neural networks is proposed in this paper. First, the cross-correlation characteristics of the scrambling sequence symbols are deduced, the partial autocorrelation function is constructed, the superiority of the partial autocorrelation function is determined by derivation, and the two are combined as the input correlation characteristics. A shallow network combining a convolutional neural network and LSTM is constructed; finally, the linear block code scrambled dataset is input into the network model, and the training and recognition test of the network is completed. The simulation results show that, compared with the traditional algorithm based on a multi-fractal spectrum, the proposed method can identify a synchronous scrambler, and the recognition accuracy is higher under a high bit error rate. Moreover, the method is suitable for classification under noise. The proposed method lays a foundation for future improvements in scrambler parameter identification

Directory of Open Access Journals

A systematic literature review of the impact of gamification instruction on students’ problem-solving skills

Author: Feng Jiyuan
Tan Wee Hoe
Yu Bin
Publication venue: Institute of Advanced Engineering and Science
Publication date: 01/12/2024
Field of study

Gamification is a popular approach to teaching and learning in education in recent years, it provides the flexible integration of various game elements into educational activities to achieve instructional objectives. This approach has the potential to enhance student skills through interactive communication and motivation and has, sparked interest and discussion in many fields, including education. This review assesses the impact of gamification on student skills in terms of teaching methods, learning outcomes, and challenges faced. The study analyzed 21 research articles from three databases, Web of Science (WoS), Scopus, and ERIC, up to November 2023. The findings were grouped into three main themes: teaching implementation and methods (8 articles), learning outcomes and skill development (8 articles), and challenges and improvement strategies (5 articles). The findings suggest that although gamification has gained more results in the field of education, especially in terms of improving students’ skills and teaching effectiveness, there are shortcomings in integrating it with professional education. This review mainly highlights the impact of gamification in the development of students’ skills while considering gamification in conjunction with other aspects of education to illustrate the need for ongoing research and further exploration in the field of education

International Journal of Evaluation and Research in Education (IJERE)

Research on the Construction of a Knowledge Graph and Knowledge Reasoning Model in the Field of Urban Traffic

Author: Jiyuan Tan
Qianqian Qiu
Tingshuai Li
Weiwei Guo
Publication venue: 'MDPI AG'
Publication date: 15/03/2021
Field of study

The integration of multi-source transportation data is complex and insufficient in most of the big cities, which made it difficult for researchers to conduct in-depth data mining to improve the policy or the management. In order to solve this problem, a top-down approach is used to construct a knowledge graph of urban traffic system in this paper. First, the model layer of the knowledge graph was used to realize the reuse and sharing of knowledge. Furthermore, the model layer then was stored in the graph database Neo4j. Second, the representation learning based knowledge reasoning model was adopted to implement knowledge completion and improve the knowledge graph. Finally, the proposed method was validated with an urban traffic data set and the results showed that the model could be used to mine the implicit relationship between traffic entities and discover traffic knowledge effectively

Multidisciplinary Digital Publishing Institute