This paper briefly discusses learning algorithms used in stochastic games with two players. The algorithms discussed are WoLF and PHC. These algorithms were run in selected matrix games against themselves, each other and against a random stationary adversary. These results are then discussed
To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.