Search CORE

59 research outputs found

Linear Programming for Large-Scale Markov Decision Problems

Author: Abbasi-Yadkori Yasin
Bartlett Peter L.
Malek Alan
Publication venue
Publication date: 01/01/2014
Field of study

We consider the problem of controlling a Markov decision process (MDP) with a large state space, so as to minimize average cost. Since it is intractable to compete with the optimal policy for large scale problems, we pursue the more modest goal of competing with a low-dimensional family of policies. We use the dual linear programming formulation of the MDP average cost problem, in which the variable is a stationary distribution over state-action pairs, and we consider a neighborhood of a low-dimensional subset of the set of stationary distributions (defined in terms of state-action features) as the comparison class. We propose two techniques, one based on stochastic convex optimization, and one based on constraint sampling. In both cases, we give bounds that show that the performance of our algorithms approaches the best achievable by any policy in the comparison class. Most importantly, these results depend on the size of the comparison class, but not on the size of the state space. Preliminary experiments show the effectiveness of the proposed algorithms in a queuing application.Comment: 27 pages, 3 figure

arXiv.org e-Print Archive

CiteSeerX

Queensland University of Technology ePrints Archive

Rail Infrastructure Manager Problem: Analyzing Capacity Pricing and Allocation in Shared Railway System

Author: Pena-Alcaraz Maite
Ramos Andres
Sussman Joseph M.
Webster Mort D.
Publication venue: Massachusetts Institute of Technology. Engineering Systems Division
Publication date: 01/03/2014
Field of study

This paper proposes a train timetabling model for shared railway systems. The model is formulated as a mixed integer linear programming problem and solved both using commercial software and a novel algorithm based on approximate dynamic programming. The results of the train timetabling model can be used to simulate and evaluate the behavior of the infrastructure manager in shared railway systems under different capacity pricing and allocation mechanisms. This would allow regulators and decision makers to identify the implications of these mechanisms for different stakeholders considering the specific characteristics of the system

DSpace@MIT

Random projections for linear programming

Author: Liberti Leo
Poirion Pierre-Louis
Vu Ky
Publication venue
Publication date: 08/06/2017
Field of study

Random projections are random linear maps, sampled from appropriate distributions, that approx- imately preserve certain geometrical invariants so that the approximation improves as the dimension of the space grows. The well-known Johnson-Lindenstrauss lemma states that there are random ma- trices with surprisingly few rows that approximately preserve pairwise Euclidean distances among a set of points. This is commonly used to speed up algorithms based on Euclidean distances. We prove that these matrices also preserve other quantities, such as the distance to a cone. We exploit this result to devise a probabilistic algorithm to solve linear programs approximately. We show that this algorithm can approximately solve very large randomly generated LP instances. We also showcase its application to an error correction coding problem.Comment: 26 pages, 1 figur

arXiv.org e-Print Archive

Crossref

HAL-Polytechnique