3 research outputs found

    Flexible policy construction by information refinement

    No full text
    Decision making under uncertainty addresses the problem of deciding which actions to take in the world, when there is uncertainty about the state of the world, and uncertainty as to the outcome of these actions. A rational approach to making good choices is the principle of maximum expected utility: the decision maker should act so as to maximize the expected benefits of the possible outcomes. The "textbook" approaches to decision analysis typically make the assumption that the computational costs involved are negligible. This assumption is not always appropriate. When computational costs cannot be ignored, a decision maker must be able to choose a trade-off between computational costs and object value. This thesis proposes an approach to decision making called information refinement. It is an iterative, heuristic process which a decision maker can use to build a policy. We present three algorithms which use information refinement to construct policies for decision problems expressed as influence diagrams. The algorithms are intended for situations in which computational costs are not negligible, and are designed to give the decision maker control of the trade-off involved in the decision making process. The first algorithm is an anytime algorithm for single stage decision problems. It constructs a policy by increasing the use of information available to the decision maker. The second algorithm applies the single stage algorithm to multistage decision problems using a fixed allocation of computational resources. The third algorithm is an anytime algorithm for multi-stage decision problems. We show empirically that these algorithms are able to make decisions with high expected value with small computational costs. We provide empirically evidence for our claims, by applying our algorithms to a large number of large decision problems.Science, Faculty ofComputer Science, Department ofGraduat

    Flexible policy construction by information refinement

    No full text
    We report on work towards exible algorithms for solving decision problems represented as in uence diagrams. An algorithm is given to construct a tree structure for each decision node in an in uence diagram. Each tree represents a decision function and is constructed incrementally. The decision maker can balance the cost of computing the next incremental improvement to a tree against the expected value of the improvement. The improvements to the tree converge to the optimal decision function computed by dynamic programming techniques, and the asymptotic behaviour is only a constant factor worse than dynamic programming techniques, counting the number of Bayesian network queries. Empirical results show that utility varies with the size of the tree and the numberofBayesian net calculations
    corecore