Search CORE

8 research outputs found

Reaching Pareto Optimality in Prisoner’s Dilemma Using Conditional Joint Action Learning

Author: Dipyaman Banerjee
Sandip Sen
Publication venue
Publication date
Field of study

We consider a repeated Prisoner’s Dilemma game where two independent learning agents play against each other. We assume that the players can observe each others’ action but are oblivious to the payoff received by the other player. Multiagent learning literature has provided mechanisms that allow agents to converge to Nash Equilibrium. In this paper we define a special class of learner called a conditional joint action learner (CJAL) who attempts to learn the conditional probability of an action taken by the other given its own action and uses it to decide its next course of action. We prove that when played against itself, if the payoff structure of Prisoner’s Dilemma game satisfies certain conditions, using a limited exploration technique these agents can actually learn to converge to the Pareto optimal solution that dominates the Nash Equilibrium, while maintaining individual rationality. We analytically derive the conditions for which such a phenomenon can occur and have shown experimental results to support our claim

CiteSeerX

General Terms Experimentation, Performance

Author: Dipyaman Banerjee
Ip Sen
Sabyasachi Saha
Publication venue
Publication date
Field of study

CiteSeerX

MnO doped SnO2 nanocatalysts: Activation of wide band gap semiconducting nanomaterials towards visible light induced photoelectrocatalytic water oxidation

Author: Alexander
Banerjee
Banerjee
Banerjee
Barman
Barman
Biesinger
Chemelewski
Chen
DeKrafft
Dipyaman Mohanta
Garain
Hoang
Horikawa
Koushik Barman
Kudo
Lee
Li
Li
Liew
Liu
Maeda
Marshall
Md. Ahmaruzzaman
Mohanta
Moir
Nesbitt
Park
Patel
Rakhshani
Seaton
Sk. Jasimuddin
Srinivasan
Sudhagar
Turner
Ungar
Wang
Yagi
Yin
Zhang
Zhang
Zhang
Zhao
Zong
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref