398 research outputs found

    Heuristics for dynamic and stochastic routing in industrial shipping

    Get PDF
    Maritime transportation plays a central role in international trade, being responsible for the majority of long-distance shipments in terms of volume. One of the key aspects in the planning of maritime transportation systems is the routing of ships. While static and deterministic vehicle routing problems have been extensively studied in the last decades and can now be solved effectively with metaheuristics, many industrial applications are both dynamic and stochastic. In this spirit, this paper addresses a dynamic and stochastic maritime transportation problem arising in industrial shipping. Three heuristics adapted to this problem are considered and their performance in minimizing transportation costs is assessed. Extensive computational experiments show that the use of stochastic information within the proposed solution methods yields average cost savings of 2.5% on a set of realistic test instances

    Dynamic commitment and imperfect policy rules

    Get PDF
    Considering the dynamics of commitment highlights, some neglected features of time inconsistency problems. We modify the standard rules-versus-discretion question in three ways: (1) A government that does not commit today retains the option to do so tomorrow, (2) the government's commitment capability is restricted to a class of simple rules, and (3) the government's ability to make irrevocable commitments is restricted. Three results stand out. First, the option to wait makes the incumbent regime (rules or discretion) relatively more attractive. Second, the option to wait means that increased uncertainly makes the incumbent regime more attractive. Third, because the commitment decision takes place in 'real time,' policy choice displays hysteresis.Monetary policy

    Dynamic commitment and imperfect policy rules

    Get PDF
    An examination of the dynamics of commitment, showing that because the decision regarding rules versus discretion occurs in real time, opting for discretion is often the better choice, since it leaves open the possibility of adopting rules later on.Monetary policy

    Information gathering control via exploratory path planning

    Full text link

    Multi-vehicle refill scheduling with queueing

    Full text link
    © 2017 We consider the problem of refill scheduling for a team of vehicles or robots that must contend for access to a single physical location for refilling. The objective is to minimise time spent in travelling to/from the refill station, and also time lost to queuing (waiting for access). In this paper, we present principled results for this problem in the context of agricultural operations. We first establish that the problem is NP-hard and prove that the maximum number of vehicles that can usefully work together is bounded. We then focus on the design of practical algorithms and present two solutions. The first is an exact algorithm based on dynamic programming that is suitable for small problem instances. The second is an approximate anytime algorithm based on the branch and bound approach that is suitable for large problem instances with many robots. We present simulated results of our algorithms for three classes of agricultural work that cover a range of operations: spot spraying, broadcast spraying and slurry application. We show that the algorithm is reasonably robust to inaccurate prediction of resource utilisation rate, which is difficult to estimate in cases such as spot application of herbicide for weed control, and validate its performance in simulation using realistic scenarios with up to 30 robots

    Model-based Reinforcement Learning with Parametrized Physical Models and Optimism-Driven Exploration

    Full text link
    In this paper, we present a robotic model-based reinforcement learning method that combines ideas from model identification and model predictive control. We use a feature-based representation of the dynamics that allows the dynamics model to be fitted with a simple least squares procedure, and the features are identified from a high-level specification of the robot's morphology, consisting of the number and connectivity structure of its links. Model predictive control is then used to choose the actions under an optimistic model of the dynamics, which produces an efficient and goal-directed exploration strategy. We present real time experimental results on standard benchmark problems involving the pendulum, cartpole, and double pendulum systems. Experiments indicate that our method is able to learn a range of benchmark tasks substantially faster than the previous best methods. To evaluate our approach on a realistic robotic control task, we also demonstrate real time control of a simulated 7 degree of freedom arm.Comment: 8 page
    • …
    corecore