48,529 research outputs found

    Consistency and optimality

    Get PDF
    Assume that the problem Qo is not solvable in polynomial time. For theories T containing a sufficiently rich part of true arithmetic we characterize T U {ConT} as the minimal extension of T proving for some algorithm that it decides Qo as fast as any algorithm B with the property that T proves that B decides Qo. Here, ConT claims the consistency of T. Moreover, we characterize problems with an optimal algorithm in terms of arithmetical theories

    Quasi-optimal nonconforming methods for symmetric elliptic problems. I -- Abstract theory

    Get PDF
    We consider nonconforming methods for symmetric elliptic problems and characterize their quasi-optimality in terms of suitable notions of stability and consistency. The quasi-optimality constant is determined and the possible impact of nonconformity on its size is quantified by means of two alternative consistency measures. Identifying the structure of quasi-optimal methods, we show that their construction reduces to the choice of suitable linear operators mapping discrete functions to conforming ones. Such smoothing operators are devised in the forthcoming parts of this work for various finite element spaces

    Increasing the Action Gap: New Operators for Reinforcement Learning

    Full text link
    This paper introduces new optimality-preserving operators on Q-functions. We first describe an operator for tabular representations, the consistent Bellman operator, which incorporates a notion of local policy consistency. We show that this local consistency leads to an increase in the action gap at each state; increasing this gap, we argue, mitigates the undesirable effects of approximation and estimation errors on the induced greedy policies. This operator can also be applied to discretized continuous space and time problems, and we provide empirical results evidencing superior performance in this context. Extending the idea of a locally consistent operator, we then derive sufficient conditions for an operator to preserve optimality, leading to a family of operators which includes our consistent Bellman operator. As corollaries we provide a proof of optimality for Baird's advantage learning algorithm and derive other gap-increasing operators with interesting properties. We conclude with an empirical study on 60 Atari 2600 games illustrating the strong potential of these new operators

    A fully discrete framework for the adaptive solution of inverse problems

    Get PDF
    We investigate and contrast the differences between the discretize-then-differentiate and differentiate-then-discretize approaches to the numerical solution of parameter estimation problems. The former approach is attractive in practice due to the use of automatic differentiation for the generation of the dual and optimality equations in the first-order KKT system. The latter strategy is more versatile, in that it allows one to formulate efficient mesh-independent algorithms over suitably chosen function spaces. However, it is significantly more difficult to implement, since automatic code generation is no longer an option. The starting point is a classical elliptic inverse problem. An a priori error analysis for the discrete optimality equation shows consistency and stability are not inherited automatically from the primal discretization. Similar to the concept of dual consistency, We introduce the concept of optimality consistency. However, the convergence properties can be restored through suitable consistent modifications of the target functional. Numerical tests confirm the theoretical convergence order for the optimal solution. We then derive a posteriori error estimates for the infinite dimensional optimal solution error, through a suitably chosen error functional. This estimates are constructed using second order derivative information for the target functional. For computational efficiency, the Hessian is replaced by a low order BFGS approximation. The efficiency of the error estimator is confirmed by a numerical experiment with multigrid optimization

    Characterizations of solutions for Bayesian games

    Get PDF
    Bayesian equilibria are characterized by means of consistency and one-person rationality in combination with non-emptiness or converse consistency. Moreover, strong and coalition-proof Bayesian equilibria of extended Bayesian games are introduced and it is seen that these notions can be characterized by means of consistency, one-person rationality, a version of Pareto optimality and a modification of converse consistency. It is shown that, in case of the strong Bayesian equilibrium correspondence, converse consistency can be replaced by non-emptiness. As examples we treat Bayesian potential games and Bayesian congestion games.Game Theory;Bayesian Games;game theory

    Nonrepresentative representative consumers

    Get PDF
    Representative consumers can be very Pareto inconsistent. We describe a cornmunity, with equal income distribution, where all consumers require 56 % higher aggregate income than the representative consumer requires in order to be compensated for the doubling of a price. Such large inconsistencies are ruled out if the representative consumer is homothetic, or if the consumers' income shares are fixed and all goods are normal. We show that optimality of the income distribution rule is not necessary for Pareto consistency of the representative consumer, and we give a weaker sufficient condition for Pareto consistency in cornmunities with two goods and two consumers
    corecore