Simple systems where the quasimetric and the dynamic programming methods are not equivalent.

Abstract

<p>(A) Example of non deterministic systems where the quasi-distance differs from the value function.Arrows indicate possible actions with their associated transition probabilities and costs. Dotted arrow represents action and dashed arrows action , both allowed in state . (B) Example with a prison state . Starting from to the goal we can choose between two actions. Action in dotted leads to with a low cost but then with the risk to fall from to with a probability . is a risky state. Action in dashed leads to the goal with a probability but with a high cost .</p

    Similar works

    Full text

    thumbnail-image

    Available Versions