Search CORE

48,299 research outputs found

Reinforcement Learning: A Survey

Author: Kaelbling L. P.
Littman M. L.
Moore A. W.
Publication venue
Publication date: 01/01/1996
Field of study

This paper surveys the field of reinforcement learning from a computer-science perspective. It is written to be accessible to researchers familiar with machine learning. Both the historical basis of the field and a broad selection of current work are summarized. Reinforcement learning is the problem faced by an agent that learns behavior through trial-and-error interactions with a dynamic environment. The work described here has a resemblance to work in psychology, but differs considerably in the details and in the use of the word ``reinforcement.'' The paper discusses central issues of reinforcement learning, including trading off exploration and exploitation, establishing the foundations of the field via Markov decision theory, learning from delayed reinforcement, constructing empirical models to accelerate learning, making use of generalization and hierarchy, and coping with hidden state. It concludes with a survey of some implemented systems and an assessment of the practical utility of current methods for reinforcement learning.Comment: See http://www.jair.org/ for any accompanying file

arXiv.org e-Print Archive

CiteSeerX

Vibration limiting of rotors by feedback control

Author: Allaire P. E.
Bradley P. L.
Lewis D. W.
Moore J. W.
Publication venue
Publication date: 01/12/1982
Field of study

Experimental findings of a three mass rotor with four channels of feedback control are reported. The channels are independently controllable with force being proportional to the velocity and/or instantaneous displacement from equilibrium of the shaft at the noncontacting probe locations (arranged in the vertical and horizontal attitudes near the support bearings). The findings suggest that automatic feedback control of rotors is feasible for limiting certain vibration levels. Control of one end of a rotor does afford some predictable vibration limiting of the rotor at the other end

NASA Technical Reports Server

Structure of the chromosphere-corona transition region

Author: Fung P. C. W.
Moore R. L.
Publication venue
Publication date
Field of study

Structure and energy distribution of chromosphere-corona transition regio

NASA Technical Reports Server

An evaluation: The potential of discarded tires as a source of fuel

Author: Collins L. W.
Downs W. R.
Gibson E. K.
Moore G. W.
Publication venue
Publication date
Field of study

The destructive distillation of rubber tire samples was studied by thermogravimetry, differential scanning calorimetry, combustion calorimetry, and mass spectroscopy. The decomposition reaction was found to be exothermic and produced a mass loss of 65 percent. The gas evolution curves that were obtained indicate that a variety of organic materials are evolved simultaneously during the decomposition of the rubber polymer

NASA Technical Reports Server

Design study of general aviation collision avoidance system

Author: Bates M. R.
Moore L. D.
Scott W. V.
Publication venue
Publication date
Field of study

The selection and design of a time/frequency collision avoidance system for use in general aviation aircraft is discussed. The modifications to airline transport collision avoidance equipment which were made to produce the simpler general aviation system are described. The threat determination capabilities and operating principles of the general aviation system are illustrated

NASA Technical Reports Server

SPAR demonstration problems

Author: Moore R. A.
Whetstone W. D.
Yen C. L.
Publication venue
Publication date
Field of study

A series of examples are presented to indicate some of the principal functions of the SPAR system and to illustrate SPAR's control card-data card structure. Information in the following categories is given: (1) a description of the problem and, in most cases, comparisons with analytical solutions; (2) a list of the input cards; (3) a printout of the table of contents of the direct access library into which all SPAR output was directed; and (4) a few representative plots

NASA Technical Reports Server