60 research outputs found
A Homogenization Approach for Gradient-Dominated Stochastic Optimization
Gradient dominance property is a condition weaker than strong convexity, yet
it sufficiently ensures global convergence for first-order methods even in
non-convex optimization. This property finds application in various machine
learning domains, including matrix decomposition, linear neural networks, and
policy-based reinforcement learning (RL). In this paper, we study the
stochastic homogeneous second-order descent method (SHSODM) for
gradient-dominated optimization with based on a recently
proposed homogenization approach. Theoretically, we show that SHSODM achieves a
sample complexity of for
and for . We further
provide a SHSODM with a variance reduction technique enjoying an improved
sample complexity of for . Our results match the state-of-the-art sample complexity bounds
for stochastic gradient-dominated optimization without \emph{cubic
regularization}. Since the homogenization approach only relies on solving
extremal eigenvector problems instead of Newton-type systems, our methods gain
the advantage of cheaper iterations and robustness in ill-conditioned problems.
Numerical experiments on several RL tasks demonstrate the efficiency of SHSODM
compared to other off-the-shelf methods
Towards Efficient and Certified Recovery from Poisoning Attacks in Federated Learning
Federated learning (FL) is vulnerable to poisoning attacks, where malicious
clients manipulate their updates to affect the global model. Although various
methods exist for detecting those clients in FL, identifying malicious clients
requires sufficient model updates, and hence by the time malicious clients are
detected, FL models have been already poisoned. Thus, a method is needed to
recover an accurate global model after malicious clients are identified.
Current recovery methods rely on (i) all historical information from
participating FL clients and (ii) the initial model unaffected by the malicious
clients, leading to a high demand for storage and computational resources. In
this paper, we show that highly effective recovery can still be achieved based
on (i) selective historical information rather than all historical information
and (ii) a historical model that has not been significantly affected by
malicious clients rather than the initial model. In this scenario, while
maintaining comparable recovery performance, we can accelerate the recovery
speed and decrease memory consumption. Following this concept, we introduce
Crab, an efficient and certified recovery method, which relies on selective
information storage and adaptive model rollback. Theoretically, we demonstrate
that the difference between the global model recovered by Crab and the one
recovered by train-from-scratch can be bounded under certain assumptions. Our
empirical evaluation, conducted across three datasets over multiple machine
learning models, and a variety of untargeted and targeted poisoning attacks
reveals that Crab is both accurate and efficient, and consistently outperforms
previous approaches in terms of both recovery speed and memory consumption
ASP: Automatic Selection of Proxy dataset for efficient AutoML
Deep neural networks have gained great success due to the increasing amounts
of data, and diverse effective neural network designs. However, it also brings
a heavy computing burden as the amount of training data is proportional to the
training time. In addition, a well-behaved model requires repeated trials of
different structure designs and hyper-parameters, which may take a large amount
of time even with state-of-the-art (SOTA) hyper-parameter optimization (HPO)
algorithms and neural architecture search (NAS) algorithms. In this paper, we
propose an Automatic Selection of Proxy dataset framework (ASP) aimed to
dynamically find the informative proxy subsets of training data at each epoch,
reducing the training data size as well as saving the AutoML processing time.
We verify the effectiveness and generalization of ASP on CIFAR10, CIFAR100,
ImageNet16-120, and ImageNet-1k, across various public model benchmarks. The
experiment results show that ASP can obtain better results than other data
selection methods at all selection ratios. ASP can also enable much more
efficient AutoML processing with a speedup of 2x-20x while obtaining better
architectures and better hyper-parameters compared to utilizing the entire
dataset.Comment: This paper was actually finished in 202
Continuous and Discrete-Time Optimal Controls for an Isolated Signalized Intersection
A classical control problem for an isolated oversaturated intersection is revisited with a focus on the optimal control policy to minimize total delay. The difference and connection between existing continuous-time planning models and recently proposed discrete-time planning models are studied. A gradient descent algorithm is proposed to convert the optimal control plan of the continuous-time model to the plan of the discrete-time model in many cases. Analytic proof and numerical tests for the algorithm are also presented. The findings shed light on the links between two kinds of models
Crystal structure of rhodopsin bound to arrestin by femtosecond X-ray laser.
G-protein-coupled receptors (GPCRs) signal primarily through G proteins or arrestins. Arrestin binding to GPCRs blocks G protein interaction and redirects signalling to numerous G-protein-independent pathways. Here we report the crystal structure of a constitutively active form of human rhodopsin bound to a pre-activated form of the mouse visual arrestin, determined by serial femtosecond X-ray laser crystallography. Together with extensive biochemical and mutagenesis data, the structure reveals an overall architecture of the rhodopsin-arrestin assembly in which rhodopsin uses distinct structural elements, including transmembrane helix 7 and helix 8, to recruit arrestin. Correspondingly, arrestin adopts the pre-activated conformation, with a ∼20° rotation between the amino and carboxy domains, which opens up a cleft in arrestin to accommodate a short helix formed by the second intracellular loop of rhodopsin. This structure provides a basis for understanding GPCR-mediated arrestin-biased signalling and demonstrates the power of X-ray lasers for advancing the frontiers of structural biology
Distinction of Scrambled Linear Block Codes Based on Extraction of Correlation Features
Aiming to solve the problem of the distinction of scrambled linear block codes, a method for identifying the scrambling types of linear block codes by combining correlation features and convolution long short-term memory neural networks is proposed in this paper. First, the cross-correlation characteristics of the scrambling sequence symbols are deduced, the partial autocorrelation function is constructed, the superiority of the partial autocorrelation function is determined by derivation, and the two are combined as the input correlation characteristics. A shallow network combining a convolutional neural network and LSTM is constructed; finally, the linear block code scrambled dataset is input into the network model, and the training and recognition test of the network is completed. The simulation results show that, compared with the traditional algorithm based on a multi-fractal spectrum, the proposed method can identify a synchronous scrambler, and the recognition accuracy is higher under a high bit error rate. Moreover, the method is suitable for classification under noise. The proposed method lays a foundation for future improvements in scrambler parameter identification
A systematic literature review of the impact of gamification instruction on students’ problem-solving skills
Gamification is a popular approach to teaching and learning in education in recent years, it provides the flexible integration of various game elements into educational activities to achieve instructional objectives. This approach has the potential to enhance student skills through interactive communication and motivation and has, sparked interest and discussion in many fields, including education. This review assesses the impact of gamification on student skills in terms of teaching methods, learning outcomes, and challenges faced. The study analyzed 21 research articles from three databases, Web of Science (WoS), Scopus, and ERIC, up to November 2023. The findings were grouped into three main themes: teaching implementation and methods (8 articles), learning outcomes and skill development (8 articles), and challenges and improvement strategies (5 articles). The findings suggest that although gamification has gained more results in the field of education, especially in terms of improving students’ skills and teaching effectiveness, there are shortcomings in integrating it with professional education. This review mainly highlights the impact of gamification in the development of students’ skills while considering gamification in conjunction with other aspects of education to illustrate the need for ongoing research and further exploration in the field of education
Research on the Construction of a Knowledge Graph and Knowledge Reasoning Model in the Field of Urban Traffic
The integration of multi-source transportation data is complex and insufficient in most of the big cities, which made it difficult for researchers to conduct in-depth data mining to improve the policy or the management. In order to solve this problem, a top-down approach is used to construct a knowledge graph of urban traffic system in this paper. First, the model layer of the knowledge graph was used to realize the reuse and sharing of knowledge. Furthermore, the model layer then was stored in the graph database Neo4j. Second, the representation learning based knowledge reasoning model was adopted to implement knowledge completion and improve the knowledge graph. Finally, the proposed method was validated with an urban traffic data set and the results showed that the model could be used to mine the implicit relationship between traffic entities and discover traffic knowledge effectively
- …