Search CORE

5 research outputs found

Recommended from our members

Data assurance in opaque computations

Author: A.J. Roycroft
A.J. Roycroft
A.J. Roycroft
A.J. Roycroft
A.O. Herbstman
C. Wirth
E.A. Heinz
E.A. Komissarchik
H.J. Herik
H.J. Herik
J. Schaeffer
J. Schaeffer
K. Thompson
K. Thompson
L.B. Stiller
M. Fierz
M.J.C. Gordon
M.J.C. Gordon
N. Shadbolt
P. McCorduck
R. Sattler
R. Wu
R.E. Bryant
T. Coe
X. Leroy
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

The chess endgame is increasingly being seen through the lens of, and therefore effectively defined by, a data ‘model’ of itself. It is vital that such models are clearly faithful to the reality they purport to represent. This paper examines that issue and systems engineering responses to it, using the chess endgame as the exemplar scenario. A structured survey has been carried out of the intrinsic challenges and complexity of creating endgame data by reviewing the past pattern of errors during work in progress, surfacing in publications and occurring after the data was generated. Specific measures are proposed to counter observed classes of error-risk, including a preliminary survey of techniques for using state-of-the-art verification tools to generate EGTs that are correct by construction. The approach may be applied generically beyond the game domain

Central Archive at the University of Reading

CiteSeerX

Crossref

The dark side of the board: advances in chess Kriegspiel

Author: Favini Gian - Piero <1981>
Publication venue: Alma Mater Studiorum - Università di Bologna
Publication date: 05/05/2010
Field of study

While imperfect information games are an excellent model of real-world problems and tasks, they are often difficult for computer programs to play at a high level of proficiency, especially if they involve major uncertainty and a very large state space. Kriegspiel, a variant of chess making it similar to a wargame, is a perfect example: while the game was studied for decades from a game-theoretical viewpoint, it was only very recently that the first practical algorithms for playing it began to appear. This thesis presents, documents and tests a multi-sided effort towards making a strong Kriegspiel player, using heuristic searching, retrograde analysis and Monte Carlo tree search algorithms to achieve increasingly higher levels of play. The resulting program is currently the strongest computer player in the world and plays at an above-average human level

AMS Tesi di Dottorato

Temporal Difference Learning in Complex Domains

Author: Smith Martin C.
Publication venue: 'Queen Mary University of London'
Publication date: 01/01/1999
Field of study

PhDThis thesis adapts and improves on the methods of TD(k) (Sutton 1988) that were successfully used for backgammon (Tesauro 1994) and applies them to other complex games that are less amenable to simple pattem-matching approaches. The games investigated are chess and shogi, both of which (unlike backgammon) require significant amounts of computational effort to be expended on search in order to achieve expert play. The improved methods are also tested in a non-game domain. In the chess domain, the adapted TD(k) method is shown to successfully learn the relative values of the pieces, and matches using these learnt piece values indicate that they perform at least as well as piece values widely quoted in elementary chess books. The adapted TD(X) method is also shown to work well in shogi, considered by many researchers to be the next challenge for computer game-playing, and for which there is no standardised set of piece values. An original method to automatically set and adjust the major control parameters used by TD(k) is presented. The main performance advantage comes from the learning rate adjustment, which is based on a new concept called temporal coherence. Experiments in both chess and a random-walk domain show that the temporal coherence algorithm produces both faster learning and more stable values than both human-chosen parameters and an earlier method for learning rate adjustment. The methods presented in this thesis allow programs to learn with as little input of external knowledge as possible, exploring the domain on their own rather than by being taught. Further experiments show that the method is capable of handling many hundreds of weights, and that it is not necessary to perform deep searches during the leaming phase in order to learn effective weight

Queen Mary Research Online

Temoral Difference Learning in Complex Domains

Author: Smith Martin C.
Publication venue
Publication date: 30/12/2013
Field of study

Submitted to the University of London for the Degree of Doctor of Philosophy in Computer Scienc

Queen Mary Research Online

Tablebase of Contents of the ICCA Journal

Author
Publication venue: 'IOS Press'
Publication date
Field of study

Crossref