1 research outputs found

    High efficiency rl agent

    Full text link
    Now a day, model free algorithm achieve state of art performance on many RL problems, but the low efficiency of model free algorithm limited the usage. We combine model base RL, soft actor-critic framework, and curiosity. proposed an agent called RMC, giving a promise way to achieve good performance while maintain data efficiency. We suppress the performance of SAC and achieve state of the art performance, both on efficiency and stability. Meanwhile we can solving POMDP problem and achieve great generalization from MDP to POMDP.Comment: arXiv admin note: text overlap with arXiv:1812.05905, arXiv:1801.01290, arXiv:1509.03044 by other authors. arXiv admin note: substantial text overlap with arXiv:1812.05905, arXiv:1801.01290 by other authors; text overlap with arXiv:1507.06527 by other author
    corecore