4,755 research outputs found
Recommended from our members
Learning to Act with RVRL Agents
The use of reinforcement learning to guide action selection of cognitive agents has been shown to be a powerful technique for stochastic environments. Standard Reinforcement learning techniques used to provide decision theoretic policies rely, however, on explicit state-based computations of value for each state-action pair. This requires the computation of a number of values exponential to the number of state variables and actions in the system. This research extends existing work with an acquired probabilistic rule representation of an agent environment by developing an algorithm to apply reinforcement learning to values attached to the rules themselves. Structure captured by the rules is then used to learn a policy directly. The resulting value attached to each rule represents the utility of taking an action if the conditions of the rule are present in the agentâs current set of percepts. This has several advantages for planning purposes: generalization over many states and over unseen states; effective decisions can therefore be made with less training data than state based modelling systems (e.g. Dyna Q-Learning); and the problem of computation in an exponential state-action space is alleviated. The results of application of this algorithm to rules in a specific environment are presented, with comparison to standard reinforcement learning policies developed from related work
Recommended from our members
Structure-From-Motion Photogrammetry of Antarctic Historical Aerial Photographs in Conjunction with Ground Control Derived from Satellite Data
A longer temporal scale of Antarctic observations is vital to better understanding glacier dynamics and improving ice sheet model projections. One underutilized data source that expands the temporal scale is aerial photography, specifically imagery collected prior to 1990. However, processing Antarctic historical aerial imagery using modern photogrammetry software is difficult, as it requires precise information about the data collection process and extensive in situ ground control is required. Often, the necessary orientation metadata for older aerial imagery is lost and in situ data collection in regions like Antarctica is extremely difficult to obtain, limiting the use of traditional photogrammetric methods. Here, we test an alternative methodology to generate elevations from historical Antarctic aerial imagery. Instead of relying on pre-existing ground control, we use structure-from-motion photogrammetry techniques to process the imagery with manually derived ground control from high-resolution satellite imagery. This case study is based on vertical aerial image sets collected over Byrd Glacier, East Antarctica in December 1978 and January 1979. Our results are the oldest, highest resolution digital elevation models (DEMs) ever generated for an Antarctic glacier. We use these DEMs to estimate glacier dynamics and show that surface elevation of Byrd Glacier has been constant for the past âŒ40 years
The quality of different types of child care at 10 and 18 months. A comparison between types and factors related to quality.
The quality of care offered in four different types of non-parental child care to 307 infants at 10 months old and 331 infants at 18 months old was compared and factors associated with higher quality were identified. Observed quality was lowest in nurseries at each age point, except that at 18 months they offered more learning activities. There were few differences in the observed quality of care by child-minders, grandparents and nannies, although grandparents had somewhat lower safety and health scores and offered children fewer activities. Cost was largely unrelated to quality of care except in child-minding, where higher cost was associated with higher quality. Observed ratios of children to adults had a significant impact on quality of nursery care; the more infants or toddlers each adult had to care for, the lower the quality of the care she gave them. Mothers' overall satisfaction with their child's care was positively associated with its quality for home-based care but not for nursery settings
State-to-State Differential and Relative Integral Cross Sections for Rotationally Inelastic Scattering of H2O by Hydrogen
State-to-state differential cross sections (DCSs) for rotationally inelastic
scattering of H2O by H2 have been measured at 71.2 meV (574 cm-1) and 44.8 meV
(361 cm-1) collision energy using crossed molecular beams combined with
velocity map imaging. A molecular beam containing variable compositions of the
(J = 0, 1, 2) rotational states of hydrogen collides with a molecular beam of
argon seeded with water vapor that is cooled by supersonic expansion to its
lowest para or ortho rotational levels (JKaKc= 000 and 101, respectively).
Angular speed distributions of fully specified rotationally excited final
states are obtained using velocity map imaging. Relative integral cross
sections are obtained by integrating the DCSs taken with the same experimental
conditions. Experimental state-specific DCSs are compared with predictions from
fully quantum scattering calculations on the most complete H2O-H2 potential
energy surface. Comparison of relative total cross sections and state-specific
DCSs show excellent agreement with theory in almost all detailsComment: 46 page
Two state scattering problem to Multi-channel scattering problem: Analytically solvable model
Starting from few simple examples we have proposed a general method for
finding an exact analytical solution for the two state scattering problem in
presence of a delta function coupling. We have also extended our model to deal
with general one dimensional multi-channel scattering problems
Pulse-driven near-resonant quantum adiabatic dynamics: lifting of quasi-degeneracy
We study the quantum dynamics of a two-level system driven by a pulse that
starts near-resonant for small amplitudes, yielding nonadiabatic evolution, and
induces an adiabatic evolution for larger amplitudes. This problem is analyzed
in terms of lifting of degeneracy for rising amplitudes. It is solved exactly
for the case of linear and exponential rising. Approximate solutions are given
in the case of power law rising. This allows us to determine approximative
formulas for the lineshape of resonant excitation by various forms of pulses
such as truncated trig-pulses. We also analyze and explain the various
superpositions of states that can be obtained by the Half Stark Chirped Rapid
Adiabatic Passage (Half-SCRAP) process.Comment: 21 pages, 12 figure
The Maslov index and nondegenerate singularities of integrable systems
We consider integrable Hamiltonian systems in R^{2n} with integrals of motion
F = (F_1,...,F_n) in involution. Nondegenerate singularities are critical
points of F where rank dF = n-1 and which have definite linear stability. The
set of nondegenerate singularities is a codimension-two symplectic submanifold
invariant under the flow. We show that the Maslov index of a closed curve is a
sum of contributions +/- 2 from the nondegenerate singularities it is encloses,
the sign depending on the local orientation and stability at the singularities.
For one-freedom systems this corresponds to the well-known formula for the
Poincar\'e index of a closed curve as the oriented difference between the
number of elliptic and hyperbolic fixed points enclosed. We also obtain a
formula for the Liapunov exponent of invariant (n-1)-dimensional tori in the
nondegenerate singular set. Examples include rotationally symmetric n-freedom
Hamiltonians, while an application to the periodic Toda chain is described in a
companion paper.Comment: 27 pages, 1 figure; published versio
Maslov Indices and Monodromy
We prove that for a Hamiltonian system on a cotangent bundle that is
Liouville-integrable and has monodromy the vector of Maslov indices is an
eigenvector of the monodromy matrix with eigenvalue 1. As a corollary the
resulting restrictions on the monodromy matrix are derived.Comment: 6 page
Two-dimensional atom trapping in field-induced adiabatic potentials
We show how to create a novel two-dimensional trap for ultracold atoms from a conventional magnetic trap. We achieve this by utilizing rf-induced adiabatic potentials to enhance the trapping potential in one direction. We demonstrate the loading process and discuss the experimental conditions under which it might be possible to prepare a 2D Bose condensate. A scheme for the preparation of coherent matterwave bubbles is also discussed
- âŠ