Search CORE

6,831 research outputs found

General Stopping Behaviors of Naive and Non-Committed Sophisticated Agents, with Application to Probability Distortion

Author: Huang Yu-Jui
Nguyen-Huu Adrien
Zhou Xun Yu
Publication venue
Publication date: 01/01/2019
Field of study

We consider the problem of stopping a diffusion process with a payoff functional that renders the problem time-inconsistent. We study stopping decisions of naive agents who reoptimize continuously in time, as well as equilibrium strategies of sophisticated agents who anticipate but lack control over their future selves' behaviors. When the state process is one dimensional and the payoff functional satisfies some regularity conditions, we prove that any equilibrium can be obtained as a fixed point of an operator. This operator represents strategic reasoning that takes the future selves' behaviors into account. We then apply the general results to the case when the agents distort probability and the diffusion process is a geometric Brownian motion. The problem is inherently time-inconsistent as the level of distortion of a same event changes over time. We show how the strategic reasoning may turn a naive agent into a sophisticated one. Moreover, we derive stopping strategies of the two types of agent for various parameter specifications of the problem, illustrating rich behaviors beyond the extreme ones such as "never-stopping" or "never-starting"

arXiv.org e-Print Archive

HAL Descartes

Abstraction in decision-makers with limited information processing capabilities

Author: Braun Daniel A.
Genewein Tim
Publication venue
Publication date: 01/12/2013
Field of study

A distinctive property of human and animal intelligence is the ability to form abstractions by neglecting irrelevant information which allows to separate structure from noise. From an information theoretic point of view abstractions are desirable because they allow for very efficient information processing. In artificial systems abstractions are often implemented through computationally costly formations of groups or clusters. In this work we establish the relation between the free-energy framework for decision making and rate-distortion theory and demonstrate how the application of rate-distortion for decision-making leads to the emergence of abstractions. We argue that abstractions are induced due to a limit in information processing capacity.Comment: Presented at the NIPS 2013 Workshop on Planning with Information Constraint

arXiv.org e-Print Archive

MPG.PuRe

Autonomous Exploration over Continuous Domains

Author: Francis Gilad
Publication venue: Faculty of Engineering and Information Technologies, School of Computer Science
Publication date: 13/03/2018
Field of study

Motion planning is an essential aspect of robot autonomy, and as such it has been studied for decades, producing a wide range of planning methodologies. Path planners are generally categorised as either trajectory optimisers or sampling-based planners. The latter is the predominant planning paradigm as it can resolve a path efficiently while explicitly reasoning about path safety. Yet, with a limited budget, the resulting paths are far from optimal. In contrast, state-of-the-art trajectory optimisers explicitly trade-off between path safety and efficiency to produce locally optimal paths. However, these planners cannot incorporate updates from a partially observed model such as an occupancy map and fail in planning around information gaps caused by incomplete sensor coverage. Autonomous exploration adds another twist to path planning. The objective of exploration is to safely and efficiently traverse through an unknown environment in order to map it. The desired output of such a process is a sequence of paths that efficiently and safely minimise the uncertainty of the map. However, optimising over the entire space of trajectories is computationally intractable. Therefore, most exploration algorithms relax the general formulation by optimising a simpler one, for example finding the single next best view, resulting in suboptimal performance. This thesis investigates methodologies for optimal and safe exploration over continuous paths. Contrary to existing exploration algorithms that break exploration into independent sub-problems of finding goal points and planning safe paths to these points, our holistic approach simultaneously optimises the coupled problems of where and how to explore. Thus, offering a shift in paradigm from next best view to next best path. With exploration defined as an optimisation problem over continuous paths, this thesis explores two different optimisation paradigms; Bayesian and functional

Sydney eScholarship

An Entropy Search Portfolio for Bayesian Optimization

Author: Bouchard-Côté Alexandre
de Freitas Nando
Hoffman Matthew W.
Shahriari Bobak
Wang Ziyu
Publication venue
Publication date: 01/01/2014
Field of study

Bayesian optimization is a sample-efficient method for black-box global optimization. How- ever, the performance of a Bayesian optimization method very much depends on its exploration strategy, i.e. the choice of acquisition function, and it is not clear a priori which choice will result in superior performance. While portfolio methods provide an effective, principled way of combining a collection of acquisition functions, they are often based on measures of past performance which can be misleading. To address this issue, we introduce the Entropy Search Portfolio (ESP): a novel approach to portfolio construction which is motivated by information theoretic considerations. We show that ESP outperforms existing portfolio methods on several real and synthetic problems, including geostatistical datasets and simulated control tasks. We not only show that ESP is able to offer performance as good as the best, but unknown, acquisition function, but surprisingly it often gives better performance. Finally, over a wide range of conditions we find that ESP is robust to the inclusion of poor acquisition functions.Comment: 10 pages, 5 figure

arXiv.org e-Print Archive

CiteSeerX

Oxford University Research Archive

Assessing the Value of Time Travel Savings – A Feasibility Study on Humberside.

Author: Gunn H.F.
Mackie P.J.
Ortuzar J. de D.
Publication venue: Institute of Transport Studies, University of Leeds
Publication date: 01/01/1980
Field of study

It is expected that the opening of the Humber Bridge will cause major changes to travel patterns around Humberside; given the level of tolls as currently stated, many travellers will face decisions involving a trade-off between travel time, money outlay on tolls or fares and money outlay on private vehicle running costs; this either in the context of destination choice, mode choice or route choice. This report sets out the conclusions of a preliminary study of the feasibility of inferring values of travel time savings from observations made on the outcomes of these decisions. Methods based on aggregate data of destination choice are found t o be inefficient; a disaggregate mode choice study i s recommended, subject to caveats on sample size

White Rose Research Online