Search CORE

7,120 research outputs found

A Survey of Monte Carlo Tree Search Methods

Author: Browne Cameron B
Colton Simon
Cowling Peter I
Lucas Simon M
Perez Diego
Powley Edward
Rohlfshagen Philipp
Samothrakis Spyridon
Tavener Stephen
Whitehouse Daniel
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

Monte Carlo tree search (MCTS) is a recently proposed search method that combines the precision of tree search with the generality of random sampling. It has received considerable interest due to its spectacular success in the difficult problem of computer Go, but has also proved beneficial in a range of other domains. This paper is a survey of the literature to date, intended to provide a snapshot of the state of the art after the first five years of MCTS research. We outline the core algorithm's derivation, impart some structure on the many variations and enhancements that have been proposed, and summarize the results from the key game and nongame domains to which MCTS methods have been applied. A number of open research questions indicate that the field is ripe for future work

University of Essex Research Repository

CiteSeerX

Maastricht University Research Portal

Inflexibility of experts – Reality or myth? Quantifying the Einstellung effect in chess masters

Author: Bilalić M
Gobet F
McLeod P
Publication venue: 'Elsevier BV'
Publication date: 01/01/2007
Field of study

How does the knowledge of experts affect their behaviour in situations that require unusual methods of dealing? One possibility, loosely originating in research on creativity and skill acquisition, is that an increase in expertise can lead to inflexibility of thought due to automation of procedures. Yet another possibility, based on expertise research, is that experts’ knowledge leads to flexibility of thought. We tested these two possibilities in a series of experiments using the Einstellung (set) effect paradigm. Chess players tried to solve problems that had both a familiar but non-optimal solution and a better but less familiar one. The more familiar solution induced the Einstellung (set) effect even in experts, preventing them from finding the optimal solution. The presence of the non-optimal solution reduced experts' problem solving ability was reduced to about that of players three standard deviations lower in skill level by the presence of the non-optimal solution. Inflexibility of thought induced by prior knowledge (i.e., the blocking effect of the familiar solution) was shown by experts but the more expert they were, the less prone they were to the effect. Inflexibility of experts is both reality and myth. But the greater the level of expertise, the more of a myth it becomes

Oxford University Research Archive

Brunel University Research Archive

Information and Design: Book Symposium on Luciano Floridi’s The Logic of Information

Author: Bawden David
Dixon Dominic
Floridi Luciano
Furner Jonathan
Gorichanaz Tim
Herold Ken
Ma Lai
Obelitz Søe Sille
Robinson Liz
Van der Veer Martens Betsy
Publication venue: 'Emerald'
Publication date: 01/01/2020
Field of study

Purpose – To review and discuss Luciano Floridi’s 2019 book The Logic of Information: A Theory of Philosophy as Conceptual Design, the latest instalment in his philosophy of information (PI) tetralogy, particularly with respect to its implications for library and information studies (LIS). Design/methodology/approach – Nine scholars with research interests in philosophy and LIS read and responded to the book, raising critical and heuristic questions in the spirit of scholarly dialogue. Floridi responded to these questions. Findings – Floridi’s PI, including this latest publication, is of interest to LIS scholars, and much insight can be gained by exploring this connection. It seems also that LIS has the potential to contribute to PI’s further development in some respects. Research implications – Floridi’s PI work is technical philosophy for which many LIS scholars do not have the training or patience to engage with, yet doing so is rewarding. This suggests a role for translational work between philosophy and LIS. Originality/value – The book symposium format, not yet seen in LIS, provides forum for sustained, multifaceted and generative dialogue around ideas

PhilPapers

City Research Online

Copenhagen University Research Information System

CGAMES'2009

Author
Publication venue: University of Wolverhampton, School of Computing and Information Technology
Publication date: 01/01/2009
Field of study

Wolverhampton Intellectual Repository and E-theses

2007 Writing Contest Winners

Author: State University of New York at Cortland
Publication venue: Digital Commons @ Cortland
Publication date: 01/01/2007
Field of study

The Writing Contest is sponsored by the College Writing Committee and is presented during Transformations conference (formally titled Scholar\u27s Day) at SUNY Cortland.https://digitalcommons.cortland.edu/transformationsprograms/1032/thumbnail.jp

Digital Commons @ Cortland (SUNY College Cortland)

The Exchange: A Novel

Author: Fontaine Peter A
Publication venue: ScholarWorks @ Georgia State University
Publication date: 05/05/2012
Field of study

The Exchange is a fiction novel Xavier Savvy Kowalski, one of the most promising American chess prodigies and rumored up-and-comer for the international fame as a potential challenger for the world chess crown. After he loses the junior world chess championship in Venice, Italy, he retires to Las Vegas, Nevada, where he hopes to start his life over. Savvy\u27s father and the chess world at large conspire against him and he finds himself returning to competitive chess again after three years away. He assembles a new team to train him for a return to the world championship, and he also falls in love with a young prodigy he met during his retirement. Together they travel the United States and Europe as Savvy attempts to win back his reputation as America\u27s premier chess player while encountering various rivals, including his own father. The story culminates with Savvy\u27s final championship game, and with his dad

ScholarWorks @ Georgia State University

Learning to Search in Reinforcement Learning

Author: Antonoglou Ioannis
Publication venue: UCL (University College London)
Publication date: 28/04/2023
Field of study

In this thesis, we investigate the use of search based algorithms with deep neural networks to tackle a wide range of problems ranging from board games to video games and beyond. Drawing inspiration from AlphaGo, the first computer program to achieve superhuman performance in the game of Go, we developed a new algorithm AlphaZero. AlphaZero is a general reinforcement learning algorithm that combines deep neural networks with a Monte Carlo Tree search for planning and learning. Starting completely from scratch, without any prior human knowledge beyond the basic rules of the game, AlphaZero managed to achieve superhuman performance in Go, chess and shogi. Subsequently, building upon the success of AlphaZero, we investigated ways to extend our methods to problems in which the rules are not known or cannot be hand-coded. This line of work led to the development of MuZero, a model-based reinforcement learning agent that builds a deterministic internal model of the world and uses it to construct plans in its imagination. We applied our method to Go, chess, shogi and the classic Atari suite of video-games, achieving superhuman performance. MuZero is the first RL algorithm to master a variety of both canonical challenges for high performance planning and visually complex problems using the same principles. Finally, we describe Stochastic MuZero, a general agent that extends the applicability of MuZero to highly stochastic environments. We show that our method achieves superhuman performance in stochastic domains such as backgammon and the classic game of 2048 while matching the performance of MuZero in deterministic ones like Go

UCL Discovery