Search CORE

26,600 research outputs found

Discrete-time weight updates in neural-adaptive control

Author: D Richert
• C J B Macnab
• K Masaud
Publication venue
Publication date: 11/04/2020
Field of study

Abstract Typical neural-adaptive control approaches update neural-network weights as though they were adaptive parameters in a continuous-time adaptive control. However, requiring fast digital rates usually restricts the size of the neural network. In this paper we analyze a deltarule update for the weights, applied at a relatively slow digital rate. We show that digital weight update causes the neural network to estimate a discrete-time model of the system, assuming that state feedback is still applied in continuous time. A Lyapunov analysis shows uniformly ultimately bounded signals. Furthermore, slowing the update frequency and using the extra computational time to increase the size/accuracy of the neural network results in better performance. Experimental results achieving link tracking of a two-link flexible-joint robot verify the improved performance

CiteSeerX

A Hierarchical Framework of Cloud Resource Allocation and Power Management Using Deep Reinforcement Learning

Author: Li Zhe
Lin Sheng
Liu Ning
Qiu Qinru
Tang Jian
Wang Yanzhi
Xu Jielong
Xu Zhiyuan
Publication venue
Publication date: 11/08/2017
Field of study

Automatic decision-making approaches, such as reinforcement learning (RL), have been applied to (partially) solve the resource allocation problem adaptively in the cloud computing system. However, a complete cloud resource allocation framework exhibits high dimensions in state and action spaces, which prohibit the usefulness of traditional RL techniques. In addition, high power consumption has become one of the critical concerns in design and control of cloud computing systems, which degrades system reliability and increases cooling cost. An effective dynamic power management (DPM) policy should minimize power consumption while maintaining performance degradation within an acceptable level. Thus, a joint virtual machine (VM) resource allocation and power management framework is critical to the overall cloud computing system. Moreover, novel solution framework is necessary to address the even higher dimensions in state and action spaces. In this paper, we propose a novel hierarchical framework for solving the overall resource allocation and power management problem in cloud computing systems. The proposed hierarchical framework comprises a global tier for VM resource allocation to the servers and a local tier for distributed power management of local servers. The emerging deep reinforcement learning (DRL) technique, which can deal with complicated control problems with large state space, is adopted to solve the global tier problem. Furthermore, an autoencoder and a novel weight sharing structure are adopted to handle the high-dimensional state space and accelerate the convergence speed. On the other hand, the local tier of distributed server power managements comprises an LSTM based workload predictor and a model-free RL based power manager, operating in a distributed manner.Comment: accepted by 37th IEEE International Conference on Distributed Computing (ICDCS 2017

arXiv.org e-Print Archive

Crossref

Reinforcement Learning: A Survey

Author: Kaelbling L. P.
Littman M. L.
Moore A. W.
Publication venue
Publication date: 01/01/1996
Field of study

This paper surveys the field of reinforcement learning from a computer-science perspective. It is written to be accessible to researchers familiar with machine learning. Both the historical basis of the field and a broad selection of current work are summarized. Reinforcement learning is the problem faced by an agent that learns behavior through trial-and-error interactions with a dynamic environment. The work described here has a resemblance to work in psychology, but differs considerably in the details and in the use of the word ``reinforcement.'' The paper discusses central issues of reinforcement learning, including trading off exploration and exploitation, establishing the foundations of the field via Markov decision theory, learning from delayed reinforcement, constructing empirical models to accelerate learning, making use of generalization and hierarchy, and coping with hidden state. It concludes with a survey of some implemented systems and an assessment of the practical utility of current methods for reinforcement learning.Comment: See http://www.jair.org/ for any accompanying file

arXiv.org e-Print Archive

CiteSeerX