1 research outputs found
High efficiency rl agent
Now a day, model free algorithm achieve state of art performance on many RL
problems, but the low efficiency of model free algorithm limited the usage. We
combine model base RL, soft actor-critic framework, and curiosity. proposed an
agent called RMC, giving a promise way to achieve good performance while
maintain data efficiency. We suppress the performance of SAC and achieve state
of the art performance, both on efficiency and stability. Meanwhile we can
solving POMDP problem and achieve great generalization from MDP to POMDP.Comment: arXiv admin note: text overlap with arXiv:1812.05905,
arXiv:1801.01290, arXiv:1509.03044 by other authors. arXiv admin note:
substantial text overlap with arXiv:1812.05905, arXiv:1801.01290 by other
authors; text overlap with arXiv:1507.06527 by other author