7,120 research outputs found
A Survey of Monte Carlo Tree Search Methods
Monte Carlo tree search (MCTS) is a recently proposed search method that combines the precision of tree search with the generality of random sampling. It has received considerable interest due to its spectacular success in the difficult problem of computer Go, but has also proved beneficial in a range of other domains. This paper is a survey of the literature to date, intended to provide a snapshot of the state of the art after the first five years of MCTS research. We outline the core algorithm's derivation, impart some structure on the many variations and enhancements that have been proposed, and summarize the results from the key game and nongame domains to which MCTS methods have been applied. A number of open research questions indicate that the field is ripe for future work
Inflexibility of experts – Reality or myth? Quantifying the Einstellung effect in chess masters
How does the knowledge of experts affect their behaviour in situations that require unusual
methods of dealing? One possibility, loosely originating in research on creativity and skill
acquisition, is that an increase in expertise can lead to inflexibility of thought due to
automation of procedures. Yet another possibility, based on expertise research, is that
experts’ knowledge leads to flexibility of thought. We tested these two possibilities in a series of experiments using the Einstellung (set) effect paradigm. Chess players tried to solve
problems that had both a familiar but non-optimal solution and a better but less familiar one.
The more familiar solution induced the Einstellung (set) effect even in experts, preventing them from finding the optimal solution. The presence of the non-optimal solution reduced experts' problem solving ability was reduced to about that of players three standard deviations lower in skill level by the presence of the non-optimal solution. Inflexibility of thought induced by prior knowledge (i.e., the blocking effect of the familiar solution) was shown by experts but the more expert they were, the less prone they were to the effect. Inflexibility of experts is both reality and myth. But the greater the level of expertise, the more of a myth it becomes
Information and Design: Book Symposium on Luciano Floridi’s The Logic of Information
Purpose – To review and discuss Luciano Floridi’s 2019 book The Logic of Information: A Theory of Philosophy as Conceptual Design, the latest instalment in his philosophy of information (PI) tetralogy, particularly with respect to its implications for library and information studies (LIS).
Design/methodology/approach – Nine scholars with research interests in philosophy and LIS read and responded to the book, raising critical and heuristic questions in the spirit of scholarly dialogue. Floridi responded to these questions.
Findings – Floridi’s PI, including this latest publication, is of interest to LIS scholars, and much insight can be gained by exploring this connection. It seems also that LIS has the potential to contribute to PI’s further development in some respects.
Research implications – Floridi’s PI work is technical philosophy for which many LIS scholars do not have the training or patience to engage with, yet doing so is rewarding. This suggests a role for translational work between philosophy and LIS.
Originality/value – The book symposium format, not yet seen in LIS, provides forum for sustained, multifaceted and generative dialogue around ideas
2007 Writing Contest Winners
The Writing Contest is sponsored by the College Writing Committee and is presented during Transformations conference (formally titled Scholar\u27s Day) at SUNY Cortland.https://digitalcommons.cortland.edu/transformationsprograms/1032/thumbnail.jp
The Exchange: A Novel
The Exchange is a fiction novel Xavier Savvy Kowalski, one of the most promising American chess prodigies and rumored up-and-comer for the international fame as a potential challenger for the world chess crown. After he loses the junior world chess championship in Venice, Italy, he retires to Las Vegas, Nevada, where he hopes to start his life over. Savvy\u27s father and the chess world at large conspire against him and he finds himself returning to competitive chess again after three years away. He assembles a new team to train him for a return to the world championship, and he also falls in love with a young prodigy he met during his retirement. Together they travel the United States and Europe as Savvy attempts to win back his reputation as America\u27s premier chess player while encountering various rivals, including his own father. The story culminates with Savvy\u27s final championship game, and with his dad
Learning to Search in Reinforcement Learning
In this thesis, we investigate the use of search based algorithms with deep neural
networks to tackle a wide range of problems ranging from board games to video
games and beyond. Drawing inspiration from AlphaGo, the first computer program
to achieve superhuman performance in the game of Go, we developed a new algorithm AlphaZero. AlphaZero is a general reinforcement learning algorithm that
combines deep neural networks with a Monte Carlo Tree search for planning and
learning. Starting completely from scratch, without any prior human knowledge
beyond the basic rules of the game, AlphaZero managed to achieve superhuman
performance in Go, chess and shogi. Subsequently, building upon the success of AlphaZero, we investigated ways to extend our methods to problems in which the rules
are not known or cannot be hand-coded. This line of work led to the development
of MuZero, a model-based reinforcement learning agent that builds a deterministic
internal model of the world and uses it to construct plans in its imagination. We
applied our method to Go, chess, shogi and the classic Atari suite of video-games,
achieving superhuman performance. MuZero is the first RL algorithm to master
a variety of both canonical challenges for high performance planning and visually complex problems using the same principles. Finally, we describe Stochastic
MuZero, a general agent that extends the applicability of MuZero to highly stochastic environments. We show that our method achieves superhuman performance in
stochastic domains such as backgammon and the classic game of 2048 while matching the performance of MuZero in deterministic ones like Go
Spartan Daily, January 9, 1973
Volume 60, Issue 59https://scholarworks.sjsu.edu/spartandaily/5694/thumbnail.jp
Spartan Daily, March 24, 1983
Volume 80, Issue 37https://scholarworks.sjsu.edu/spartandaily/7019/thumbnail.jp
- …