Search CORE

15,099 research outputs found

Hi-Val: Iterative Learning of Hierarchical Value Functions for Policy Generation

Author: D Silver
D Silver
G Chowdhary
G Konidaris
J Hostetler
Levente Kocsis
M Jun
P Auer
RS Sutton
TG Dietterich
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Task decomposition is effective in manifold applications where the global complexity of a problem makes planning and decision-making too demanding. This is true, for example, in high-dimensional robotics domains, where (1) unpredictabilities and modeling limitations typically prevent the manual specification of robust behaviors, and (2) learning an action policy is challenging due to the curse of dimensionality. In this work, we borrow the concept of Hierarchical Task Networks (HTNs) to decompose the learning procedure, and we exploit Upper Confidence Tree (UCT) search to introduce HOP, a novel iterative algorithm for hierarchical optimistic planning with learned value functions. To obtain better generalization and generate policies, HOP simultaneously learns and uses action values. These are used to formalize constraints within the search space and to reduce the dimensionality of the problem. We evaluate our algorithm both on a fetching task using a simulated 7-DOF KUKA light weight arm and, on a pick and delivery task with a Pioneer robot

Crossref

Archivio della ricerca- Università di Roma La Sapienza

Validity and practical utility of accelerometry for the measurement of in-hand physical activity in horses

Author: Carnwath J.
Horsfield E.
Hunter-Blair N.
Morrison R.
Ramsoy C.
Sutton D. G. M.
Yam P. S.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/09/2015
Field of study

Background: Accelerometers are valid, practical and reliable tools for the measurement of habitual physical activity (PA). Quantification of PA in horses is desirable for use in research and clinical settings. The objective of this study was to evaluate a triaxial accelerometer for objective measurement of PA in the horse by assessment of their practical utility and validity. Horses were recruited to establish both the optimal site of accelerometer attachment and questionnaire designed to explore owner acceptance. Validity and cut-off values were obtained by assessing PA at various gaits. Validation study- 20 horses wore the accelerometer while being filmed for 10 min each of rest, walking and trotting and 5 mins of canter work. Practical utility study- five horses wore accelerometers on polls and withers for 18 h; compliance and relative data losses were quantified. Results: Accelerometry output differed significantly between the four PA levels (P <0•001) for both wither and poll placement. For withers placement, ROC analyses found optimal sensitivity and specificity at a cut-off of <47 counts per minute (cpm) for rest (sensitivity 99.5 %, specificity 100 %), 967–2424 cpm for trotting (sensitivity 96.7 %, specificity 100 %) and ≥2425 cpm for cantering (sensitivity 96.0 %, specificity 97.0 %). Attachment at the poll resulted in optimal sensitivity and specificity at a cut-off of <707 counts per minute (cpm) for rest (sensitivity 97.5 %, specificity 99.6 %), 1546–2609 cpm for trotting (sensitivity 90.33 %, specificity 79.25 %) and ≥2610 cpm for cantering (sensitivity 100 %, specificity 100 %) In terms of practical utility, accelerometry was well tolerated and owner acceptance high. Conclusion: Accelerometry data correlated well with varying levels of in-hand equine activity. The use of accelerometers is a valid method for objective measurement of controlled PA in the horse

Crossref

Springer - Publisher Connector

PubMed Central

Enlighten

A new constant-pressure molecular dynamics method for finite system

Author: Allen M P
Angilella G G N
Birch F
D Y Sun
Landau L D
Reich S
Sun D Y
Sutton A P
Tolbert S H
Tolbert S H
X G Gong
Publication venue: 'IOP Publishing'
Publication date: 10/02/2001
Field of study

In this letter, by writing the volume as a function of coordinates of atoms, we present a new constant-pressure molecular dynamics method with parameters free. This method is specially appropriate for the finite system in which the periodic boundary condition does not exist. Simulations on the carbon nanotube and the Ni nanoparticle clearly demonstrate the validity of the method. By using this method, one can easily obtain the equation of states for the finite system under the external pressure.Comment: RevTex, 5 pages, 3 figures, submitted to Phys. Rev. Let

arXiv.org e-Print Archive

Crossref

Variable cavity volume tooling for high-performance resin infusion moulding

Author: Achim V.
Armstrong D. L.
Chowdhury F. H.
G J Gibbons
Gibbons G. J.
Grande J.
J J Segui-Garza
McConnell V. P.
R G Hansell
Schwartz M. M.
Sutton G.
Toi Y.
Publication venue: 'SAGE Publications'
Publication date: 01/04/2010
Field of study

This article describes the research carried out by Warwick under the BAE Systems/EPSRC programme ‘Flapless Aerial Vehicles Integrated Interdisciplinary Research – FLAVIIR’. Warwick's aim in FLAVIIR was to develop low-cost innovative tooling technologies to enable the affordable manufacture of complex composite aerospace structures and to help realize the aim of the Grand Challenge of maintenance-free, low-cost unmanned aerial vehicle manufacture. This article focuses on the evaluation of a novel tooling process (variable cavity tooling) to enable the complete infusion of resin throughout non-crimp fabric within a mould cavity under low (0.1 MPa) injection pressure. The contribution of the primary processing parameters to the mechanical properties of a carbon composite component (bulk-head lug section), and the interactions between parameters, was determined. The initial mould gap (di) was identified as having the most significant effect on all measured mechanical properties, but complex interactions between di, n (number of fabric layers), and vc (mould closure rate) were observed. The process capability was low due to the manual processing, but was improved through process optimization, and delivered properties comparable to high-pressure resin transfer moulding

Crossref

Warwick Research Archives Portal Repository

A simple environment-dependent overlap potential and Cauchy violation in solid argon

Author: Aoki M
Born M
Finnis M W
Haas H
Kittel C
Masato Aoki
Nguyen-Manh D
Perdew J P
Pettifor D G
Rosciszewski K
Skinner A J
Sutton A P
Tatsuya Kurokawa
Publication venue: 'IOP Publishing'
Publication date: 09/01/2007
Field of study

We develop an analytic and environment-dependent interatomic potential for the overlap repulsion in solid argon, based on an approximate treatment of the non-orthogonal Tight-Binding theory for the closed-shell systems. The present model can well reproduce the observed elastic properties of solid argon including Cauchy violation at high pressures, yet very simple. A useful and novel analysis is given to show how the elastic properties are related to the environment-dependence incorporated into a generic pairwise potential. The present study has a close link to the broad field of computational materials science, in which the inclusion of environment dependence in short-ranged repulsive part of a potential model is sometimes crucial in predicting the elastic properties correctly.Comment: 10 pages, 3 figure

arXiv.org e-Print Archive

Crossref

A tight binding model for water

Author: A. T. Paxton
Eisenberg D.
Harrison W. A.
Herzberg G.
J. J. Kohanoff
Lide D. R.
Murrell J. N.
Pettifor D. G.
Schwefel H.-P.
Stone A. J.
Stoneham A. M.
Sutton A. P.
Publication venue: 'AIP Publishing'
Publication date: 30/09/2010
Field of study

We demonstrate for the first time a tight binding model for water incorporating polarizable anions. A novel aspect is that we adopt a "ground up" approach in that properties of the monomer and dimer only are fitted. Subsequently we make predictions of the structure and properties of hexamer clusters, ice-XI and liquid water. A particular feature, missing in current tight binding and semiempirical hamiltonians, is that we reproduce the almost two-fold increase in molecular dipole moment as clusters are built up towards the limit of bulk liquid. We concentrate on properties of liquid water which are very well rendered in comparison with experiment and published density functional calculations. Finally we comment on the question of the contrasting densities of water and ice which is central to an understanding of the subtleties of the hydrogen bond

arXiv.org e-Print Archive

Queen's University Belfast Research Portal

Crossref

King's Research Portal

Temperature dependence of surface reconstructions of Au on Pd(110)

Author: A. Hairie
A. P. Sutton
A. P. Sutton
A. P. Sutton
B. D. Todd
C. T. Chan
G. Binnig
H. Häkkinen
H. Rafii-Tabar
J. A. Nieminen
J. R. Noonan
K.-M. Ho
M. I. Haftel
P. J. Schmitz
P. Kaukasoina
R. LeSar
S. Rousset
T. Gritsch
W. H. Press
Publication venue: 'American Physical Society (APS)'
Publication date: 30/03/1995
Field of study

Surface reconstructions of Au film on Pd(110) substrate are studied using a local Einstein approximation to quasiharmonic theory with the Sutton-Chen interatomic potential. Temperature dependent surface free energies for different coverages and surface structures are calculated. Experimentally observed transformations from

(1\times1)

(1 \times 2)

and

(1 \times 3)

structures can be explained in the framework of this model. Also conditions for Stranski-Krastanov growth mode are found to comply with experiments. The domain of validity of the model neglecting mixing entropy is analyzed.Comment: 7 pages, REVTeX two-column format, 3 postscript figures available on request from [email protected] To appear in Phys. Rev. Letter

arXiv.org e-Print Archive

Crossref

Inelastic quantum transport: the self-consistent Born approximation and correlated electron-ion dynamics

Author: A. B. Migdal
A. P. Sutton
Andrew P. Horsfield
D. Dundas
Daniel Dundas
Eunan J. McEniry
G. D. Mahan
H. Haug
M. Tsutsui
Tchavdar N. Todorov
Thomas Frederiksen
Publication venue: 'American Physical Society (APS)'
Publication date: 28/02/2008
Field of study

A dynamical method for inelastic transport simulations in nanostructures is compared with a steady-state method based on non-equilibrium Green's functions. A simplified form of the dynamical method produces, in the steady state in the weak-coupling limit, effective self-energies analogous to those in the Born Approximation due to electron-phonon coupling. The two methods are then compared numerically on a resonant system consisting of a linear trimer weakly embedded between metal electrodes. This system exhibits enhanced heating at high biases and long phonon equilibration times. Despite the differences in their formulation, the static and dynamical methods capture local current-induced heating and inelastic corrections to the current with good agreement over a wide range of conditions, except in the limit of very high vibrational excitations, where differences begin to emerge.Comment: 12 pages, 7 figure

arXiv.org e-Print Archive

Queen's University Belfast Research Portal

Crossref

Learning from Monte Carlo Rollouts with Opponent Models for Playing Tron

Author: AL Samuel
CJ Watkins
D Silver
D Silver
G Tesauro
J Baxter
J Schmidhuber
L Kocsis
M Otterlo van
RS Sutton
RS Sutton
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/12/2018
Field of study

This paper describes a novel reinforcement learning system for learning to play the game of Tron. The system combines Q-learning, multi-layer perceptrons, vision grids, opponent modelling, and Monte Carlo rollouts in a novel way. By learning an opponent model, Monte Carlo rollouts can be effectively applied to generate state trajectories for all possible actions from which improved action estimates can be computed. This allows to extend experience replay by making it possible to update the state-action values of all actions in a given game state simultaneously. The results show that the use of experience replay that updates the Q-values of all actions simultaneously strongly outperforms the conventional experience replay that only updates the Q-value of the performed action. The results also show that using short or long rollout horizons during training lead to similar good performances against two fixed opponents

Crossref

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen