2 research outputs found

    Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning

    No full text
    Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution
    corecore