Search CORE

9,335 research outputs found

Recommended from our members

Visualising gas heating from an RF plasma loudspeaker

Author: Braithwaite N. St.J
Johnson P. C.
Moore J.
Naidis G. V.
Sharp D.
Sutton Y.
Publication venue
Publication date: 01/10/2010
Field of study

In an electro-acoustic transduction mechanism, an ac modulation (here in the audio frequency range) of the electric field in an atmospheric pressure air plasma gives rise to a rapid increase in the gas temperature and dimensions of the gas volume. As in natural lightning, the rapid expansion in the ionised column though the air produces external pressure variations at the modulation frequency. \ud Spatial and temporal measurement of the gas temperature can identify the nature of the thermal expansion and provide a direct approach to understanding its relationship to the sound pressure wave that is generated. However, the established method through spectroscopic measurement of rotational line emission from nitrogen molecules is limited to the main current channel where relaxation and subsequent optical emission of the excited nitrogen molecules occurs. The wider picture is revealed through the use of the Schlieren method where the refractive index gradients caused by gas heating in the plasma are imaged

Open Research Online (The Open University)

Extending Feynman's Formalisms for Modelling Human Joint Action Coordination

Author: Bernstein N. A.
Bernstein N. A.
EUGENE V. AIDMAN
Fogassi L.
Hebb D. O.
Ivancevic V.
LEONG YEN
Lewin K.
Sutton R. S.
VLADIMIR G. IVANCEVIC
Publication venue
Publication date: 01/01/2009
Field of study

The recently developed Life-Space-Foam approach to goal-directed human action deals with individual actor dynamics. This paper applies the model to characterize the dynamics of co-action by two or more actors. This dynamics is modelled by: (i) a two-term joint action (including cognitive/motivatonal potential and kinetic energy), and (ii) its associated adaptive path integral, representing an infinite--dimensional neural network. Its feedback adaptation loop has been derived from Bernstein's concepts of sensory corrections loop in human motor control and Brooks' subsumption architectures in robotics. Potential applications of the proposed model in human--robot interaction research are discussed. Keywords: Psycho--physics, human joint action, path integralsComment: 6 pages, Late

arXiv.org e-Print Archive

Crossref

Adelaide Research & Scholarship

Modelling the hepatitis B vaccination programme in prisons

Author: Andrews Nicholas J.
Edmunds W. John
Gay N. J.
Gilbert R. L.
Gill O. N.
Hope V. D.
Piper M.
Sutton A. J.
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/01/2005
Field of study

A vaccination programme offering hepatitis B (HBV) vaccine at reception into prison has been introduced into selected prisons in England and Wales. Over the coming years it is anticipated this vaccination programme will be extended. A model has been developed to assess the potential impact of the programme on the vaccination coverage of prisoners, ex-prisoners, and injecting drug users (IDUs). Under a range of coverage scenarios, the model predicts the change over time in the vaccination status of new entrants to prison, current prisoners and IDUs in the community. The model predicts that at baseline in 2012 57% of the IDU population will be vaccinated with up to 72% being vaccinated depending on the vaccination scenario implemented. These results are sensitive to the size of the IDU population in England and Wales and the average time served by an IDU during each prison visit. IDUs that do not receive HBV vaccine in the community are at increased risk from HBV infection. The HBV vaccination programme in prisons is an effective way of vaccinating this hard-to-reach population although vaccination coverage on prison reception must be increased to achieve this

CiteSeerX

Warwick Research Archives Portal Repository

Prediction with Expert Advice under Discounted Loss

Author: A. Chernov
B. Schölkopf
D. Haussler
D.A. Harville
E.F. Beckenbach
E.S. Gardner
J.F. Muth
M. Herbster
N. Cesa-Bianchi
R. Sutton
V. Vovk
V. Vovk
V. Vovk
Y. Kalnishkan
Publication venue
Publication date: 01/01/2010
Field of study

We study prediction with expert advice in the setting where the losses are accumulated with some discounting---the impact of old losses may gradually vanish. We generalize the Aggregating Algorithm and the Aggregating Algorithm for Regression to this case, propose a suitable new variant of exponential weights algorithm, and prove respective loss bounds.Comment: 26 pages; expanded (2 remarks -> theorems), some misprints correcte

arXiv.org e-Print Archive

Crossref

University of Brighton Research Portal

University of Bedfordshire Repository

Toward Interpretable Deep Reinforcement Learning with Linear Model U-Trees

Author: AK McCallum
D Dancey
E Ikonomovska
M Hall
M Riedmiller
N Landwehr
P Chaudhuri
RS Sutton
S Tong
V Mnih
WY Loh
Publication venue
Publication date: 16/07/2018
Field of study

Deep Reinforcement Learning (DRL) has achieved impressive success in many applications. A key component of many DRL models is a neural network representing a Q function, to estimate the expected cumulative reward following a state-action pair. The Q function neural network contains a lot of implicit knowledge about the RL problems, but often remains unexamined and uninterpreted. To our knowledge, this work develops the first mimic learning framework for Q functions in DRL. We introduce Linear Model U-trees (LMUTs) to approximate neural network predictions. An LMUT is learned using a novel on-line algorithm that is well-suited for an active play setting, where the mimic learner observes an ongoing interaction between the neural net and the environment. Empirical evaluation shows that an LMUT mimics a Q function substantially better than five baseline methods. The transparent tree structure of an LMUT facilitates understanding the network's learned knowledge by analyzing feature influence, extracting rules, and highlighting the super-pixels in image inputs.Comment: This paper is accepted by ECML-PKDD 201

arXiv.org e-Print Archive

Crossref

Resonant charging and stopping power of slow channelling atoms in a crystalline metal

Author: A P Horsfield
A P Sutton
C P Race
D R Mason
Firsov O B
Harrison W A
le Page J
le Page J
Lindhard J
Lindhard J
M H F Foo
Mason D R
Okorokov V V
Race C P
Race C P
Race C P
Rimini E
Sutton A P
Sutton A P
Todorov T N
W M C Foulkes
Publication venue: 'IOP Publishing'
Publication date: 01/01/2012
Field of study

Crossref

Spiral - Imperial College Digital Repository

The University of Manchester - Institutional Repository

MPG.PuRe

Self-Modification of Policy and Utility Function in Rational Agents

Author: B Hibbard
D Dewey
D Silver
J Schmidhuber
L Orseau
L Orseau
L Orseau
LP Kaelbling
M Hutter
M Hutter
M Ring
N Bostrom
R Sutton
RV Yampolskiy
S Legg
V Mnih
Publication venue
Publication date: 10/05/2016
Field of study

Any agent that is part of the environment it interacts with and has versatile actuators (such as arms and fingers), will in principle have the ability to self-modify -- for example by changing its own source code. As we continue to create more and more intelligent agents, chances increase that they will learn about this ability. The question is: will they want to use it? For example, highly intelligent systems may find ways to change their goals to something more easily achievable, thereby `escaping' the control of their designers. In an important paper, Omohundro (2008) argued that goal preservation is a fundamental drive of any intelligent system, since a goal is more likely to be achieved if future versions of the agent strive towards the same goal. In this paper, we formalise this argument in general reinforcement learning, and explore situations where it fails. Our conclusion is that the self-modification possibility is harmless if and only if the value function of the agent anticipates the consequences of self-modifications and use the current utility function when evaluating the future.Comment: Artificial General Intelligence (AGI) 201

arXiv.org e-Print Archive

Crossref

The Australian National University

Waterproofing cavity walls to allow insulation in exposed areas (Main report)

Author: Aktas Y
Altamirano H
D'Ayala D
Gori V
King C
Marincioni V
May N
Sutton A
Weeks C
Zhu H
Publication venue: UCL & BRE
Publication date: 01/05/2020
Field of study

UCL Discovery

Information theoretic approach to interactive learning

Author: Atkinson A. C. Bogacka B. Zhiglkilavskify A. A. (Editors)
Balcan M.-F.
Box G.
Dasgupta S.
Engel A.
Fedorov V. V.
Pack-Kaelbling L.
S. Still
Schmidhuber J.
Shannon C. E.
Still S. Bialek W.
Still S. Crutchfield J. P. Ellison C.
Still S. Precup D.
Sutton R. S.
Tishby N.
Vapnik V.
Publication venue: 'IOP Publishing'
Publication date: 30/01/2009
Field of study

The principles of statistical mechanics and information theory play an important role in learning and have inspired both theory and the design of numerous machine learning algorithms. The new aspect in this paper is a focus on integrating feedback from the learner. A quantitative approach to interactive learning and adaptive behavior is proposed, integrating model- and decision-making into one theoretical framework. This paper follows simple principles by requiring that the observer's world model and action policy should result in maximal predictive power at minimal complexity. Classes of optimal action policies and of optimal models are derived from an objective function that reflects this trade-off between prediction and complexity. The resulting optimal models then summarize, at different levels of abstraction, the process's causal organization in the presence of the learner's actions. A fundamental consequence of the proposed principle is that the learner's optimal action policies balance exploration and control as an emerging property. Interestingly, the explorative component is present in the absence of policy randomness, i.e. in the optimal deterministic behavior. This is a direct result of requiring maximal predictive power in the presence of feedback.Comment: 6 page

arXiv.org e-Print Archive

CiteSeerX

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

Power Law Scaling for a System of Interacting Units with Complex Internal Structure

Author: A. Chandler
B. H. Hall
B. Jovanovic
C.-K. Peng
D. S. Evans
H. Eugene Stanley
J. Sutton
L. A. N. Amaral
Luís A. Nunes Amaral
M. Gort
M. H. R. Stanley
Michael A. Salinger
O. E. Williamson
R. Gibrat
S. J. Davis
S. V. Buldyrev
Sergey V. Buldyrev
Shlomo Havlin
T. Vicsek
V. Pareto
Y. Ijiri
Publication venue: 'American Physical Society (APS)'
Publication date: 31/07/1997
Field of study

We study the dynamics of a system composed of interacting units each with a complex internal structure comprising many subunits. We consider the case in which each subunit grows in a multiplicative manner. We propose a model for such systems in which the interaction among the units is treated in a mean field approximation and the interaction among subunits is nonlinear. To test the model, we identify a large data base spanning 20 years, and find that the model correctly predicts a variety of empirical results.Comment: 4 pages with 4 postscript figures (uses Revtex 3.1, Latex2e, multicol.sty, epsf.sty and rotate.sty). Submitted to PR

arXiv.org e-Print Archive

Crossref

Boston University Institutional Repository (OpenBU)