41,046 research outputs found
Reinforcement Learning: A Survey
This paper surveys the field of reinforcement learning from a
computer-science perspective. It is written to be accessible to researchers
familiar with machine learning. Both the historical basis of the field and a
broad selection of current work are summarized. Reinforcement learning is the
problem faced by an agent that learns behavior through trial-and-error
interactions with a dynamic environment. The work described here has a
resemblance to work in psychology, but differs considerably in the details and
in the use of the word ``reinforcement.'' The paper discusses central issues of
reinforcement learning, including trading off exploration and exploitation,
establishing the foundations of the field via Markov decision theory, learning
from delayed reinforcement, constructing empirical models to accelerate
learning, making use of generalization and hierarchy, and coping with hidden
state. It concludes with a survey of some implemented systems and an assessment
of the practical utility of current methods for reinforcement learning.Comment: See http://www.jair.org/ for any accompanying file
Structure of the chromosphere-corona transition region
Structure and energy distribution of chromosphere-corona transition regio
An evaluation: The potential of discarded tires as a source of fuel
The destructive distillation of rubber tire samples was studied by thermogravimetry, differential scanning calorimetry, combustion calorimetry, and mass spectroscopy. The decomposition reaction was found to be exothermic and produced a mass loss of 65 percent. The gas evolution curves that were obtained indicate that a variety of organic materials are evolved simultaneously during the decomposition of the rubber polymer
Design study of general aviation collision avoidance system
The selection and design of a time/frequency collision avoidance system for use in general aviation aircraft is discussed. The modifications to airline transport collision avoidance equipment which were made to produce the simpler general aviation system are described. The threat determination capabilities and operating principles of the general aviation system are illustrated
SPAR demonstration problems
A series of examples are presented to indicate some of the principal functions of the SPAR system and to illustrate SPAR's control card-data card structure. Information in the following categories is given: (1) a description of the problem and, in most cases, comparisons with analytical solutions; (2) a list of the input cards; (3) a printout of the table of contents of the direct access library into which all SPAR output was directed; and (4) a few representative plots
Study of Chromium-Frit-Type Coatings for High-Temperature Protection of Molybdenum
The achievement of more compact and efficient power plants for aircraft is dependent, among other factors, on the perfection of heat-resisting materials that are superior to those in current use. Molybdenum is one of the high-melting metals (melting point, 4750 F). It is fairly abundant and also can be worked into many of the shapes required in modern power plants. To permit its widespread use at elevated temperatures, however, some means must first be found to prevent its rapid oxidation. The application of a protective coating is one method that might be used to achieve this goal. In the present work, a number of chromium-frit-type coatings were studied. These were bonded to molybdenum specimens by firing in controlled atmospheres to temperatures in the range of 2400 to 2700 F
- …