CORE
🇺🇦Â
 make metadata, not war
Services
Services overview
Explore all CORE services
Access to raw data
API
Dataset
FastSync
Content discovery
Recommender
Discovery
OAI identifiers
OAI Resolver
Managing content
Dashboard
Bespoke contracts
Consultancy services
Support us
Support us
Membership
Sponsorship
Community governance
Advisory Board
Board of supporters
Research network
About
About us
Our mission
Team
Blog
FAQs
Contact us
Filters
1 research outputs found
An incremental off-policy search in a model-free Markov decision process using a single sample path
Author
A Antos
ADMS Barreto
+44Â more
AG Barto
Ajin George Joseph
AW Moore
B Wang
BT Polyak
BW Balleine
C Dann
DP Bertsekas
DP Bertsekas
DP Kroese
E Ertin
E Ikonen
EA Feinberg
G Alon
H Yu
HS Chang
I Menache
J Baxter
J Hu
J Hu
J Xue
JC Spall
JN Tsitsiklis
JP O’Doherty
M Sato
M Sato
M Zlochin
MG Lagoudakis
ML Puterman
P Fracasso
P Kumar
PW Glynn
R Rubinstein
RS Sutton
RS Sutton
RS Varga
RY Rubinstein
RY Rubinstein
S Bhatnagar
Shalabh Bhatnagar
SP Singh
SW Lee
VR Konda
VS Borkar
Publication venue
'Springer Science and Business Media LLC'
Publication date
Field of study
No full text
Crossref