Search CORE

132 research outputs found

Counterfactual Regret Minimization を用いたトレーディングカードゲームの戦略計算

Author: 張昊宇
Publication venue: 情報理工学系研究科電子情報学専攻
Publication date: 23/03/2020
Field of study

学位の種別: 修士University of Tokyo(東京大学

Analysis and Optimization of Deep Counterfactual Value Networks

Author: J Nash
M Bowling
T Kanungo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 12/10/2018
Field of study

Recently a strong poker-playing algorithm called DeepStack was published, which is able to find an approximate Nash equilibrium during gameplay by using heuristic values of future states predicted by deep neural networks. This paper analyzes new ways of encoding the inputs and outputs of DeepStack's deep counterfactual value networks based on traditional abstraction techniques, as well as an unabstracted encoding, which was able to increase the network's accuracy.Comment: Long version of publication appearing at KI 2018: The 41st German Conference on Artificial Intelligence (http://dx.doi.org/10.1007/978-3-030-00111-7_26). Corrected typo in titl

arXiv.org e-Print Archive

Crossref