48,299 research outputs found
Reinforcement Learning: A Survey
This paper surveys the field of reinforcement learning from a
computer-science perspective. It is written to be accessible to researchers
familiar with machine learning. Both the historical basis of the field and a
broad selection of current work are summarized. Reinforcement learning is the
problem faced by an agent that learns behavior through trial-and-error
interactions with a dynamic environment. The work described here has a
resemblance to work in psychology, but differs considerably in the details and
in the use of the word ``reinforcement.'' The paper discusses central issues of
reinforcement learning, including trading off exploration and exploitation,
establishing the foundations of the field via Markov decision theory, learning
from delayed reinforcement, constructing empirical models to accelerate
learning, making use of generalization and hierarchy, and coping with hidden
state. It concludes with a survey of some implemented systems and an assessment
of the practical utility of current methods for reinforcement learning.Comment: See http://www.jair.org/ for any accompanying file
Vibration limiting of rotors by feedback control
Experimental findings of a three mass rotor with four channels of feedback control are reported. The channels are independently controllable with force being proportional to the velocity and/or instantaneous displacement from equilibrium of the shaft at the noncontacting probe locations (arranged in the vertical and horizontal attitudes near the support bearings). The findings suggest that automatic feedback control of rotors is feasible for limiting certain vibration levels. Control of one end of a rotor does afford some predictable vibration limiting of the rotor at the other end
Structure of the chromosphere-corona transition region
Structure and energy distribution of chromosphere-corona transition regio
An evaluation: The potential of discarded tires as a source of fuel
The destructive distillation of rubber tire samples was studied by thermogravimetry, differential scanning calorimetry, combustion calorimetry, and mass spectroscopy. The decomposition reaction was found to be exothermic and produced a mass loss of 65 percent. The gas evolution curves that were obtained indicate that a variety of organic materials are evolved simultaneously during the decomposition of the rubber polymer
Design study of general aviation collision avoidance system
The selection and design of a time/frequency collision avoidance system for use in general aviation aircraft is discussed. The modifications to airline transport collision avoidance equipment which were made to produce the simpler general aviation system are described. The threat determination capabilities and operating principles of the general aviation system are illustrated
SPAR demonstration problems
A series of examples are presented to indicate some of the principal functions of the SPAR system and to illustrate SPAR's control card-data card structure. Information in the following categories is given: (1) a description of the problem and, in most cases, comparisons with analytical solutions; (2) a list of the input cards; (3) a printout of the table of contents of the direct access library into which all SPAR output was directed; and (4) a few representative plots
- …