340 research outputs found

    International Conference on Continuous Optimization (ICCOPT) 2019 Conference Book

    Get PDF
    The Sixth International Conference on Continuous Optimization took place on the campus of the Technical University of Berlin, August 3-8, 2019. The ICCOPT is a flagship conference of the Mathematical Optimization Society (MOS), organized every three years. ICCOPT 2019 was hosted by the Weierstrass Institute for Applied Analysis and Stochastics (WIAS) Berlin. It included a Summer School and a Conference with a series of plenary and semi-plenary talks, organized and contributed sessions, and poster sessions. This book comprises the full conference program. It contains, in particular, the scientific program in survey style as well as with all details, and information on the social program, the venue, special meetings, and more

    MARKOV DECISION PROCESS MODELS FOR IMPROVING EQUITY IN LIVER ALLOCATION

    Get PDF
    In the United States, end-stage liver disease (ESLD) patients are prioritized primarily by their Model for End-stage Liver Disease score (MELD) to receive organ offers. Therefore, patients are required to update their MELD score at predefined frequencies that depend on the patient's last reported score. One aim of this dissertation is to mitigate inequities that stem from patients' flexibility regarding MELD score updates. We develop a Markov decision process (MDP) model to examine the degree to which an individual patient can benefit from the updating flexibility, and provide a menu of updating requirements that balance inequity and data processing more efficiently than the current updating requirements. We also derive sufficient conditions under which a structured optimal updating policy exists. As the coordinator of the harvesting Organ Procurement Organization (OPO) extends offers according to MELD score prioritization, the organ becomes less desirable. To avoid not placing the organ, the OPO coordinator can initiate an expedited placement, i.e., offer the organ to a transplant center, which can then allocate it to any of its patients. A second aim of this dissertation is to mitigate inequities induced by the OPO coordinator's premature departure from the prioritized list of patients via an expedited placement. As a preliminary step to studying the inequity induced by expedited liver placement, we conduct an extensive analysis of the current expedited liver placement practice based on recent data. We investigate different aspects of extending offers, e.g., the number of offers extended concurrently, and patients' response characteristics. Several of the results from this analysis serve as inputs for a second MDP model that examines how many concurrent offers the OPO coordinator should extend and when the coordinator should initiate an expedited placement. Numerical experimentation reveals a structured optimal policy, and we test the sensitivity of the model outcomes with respect to changes in model inputs. Lastly, we examine how our model outputs compare to the analogous measures observed in current practice and how they can be used to improve current practice

    Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions

    Full text link
    Value iteration (VI) is a foundational dynamic programming method, important for learning and planning in optimal control and reinforcement learning. VI proceeds in batches, where the update to the value of each state must be completed before the next batch of updates can begin. Completing a single batch is prohibitively expensive if the state space is large, rendering VI impractical for many applications. Asynchronous VI helps to address the large state space problem by updating one state at a time, in-place and in an arbitrary order. However, Asynchronous VI still requires a maximization over the entire action space, making it impractical for domains with large action space. To address this issue, we propose doubly-asynchronous value iteration (DAVI), a new algorithm that generalizes the idea of asynchrony from states to states and actions. More concretely, DAVI maximizes over a sampled subset of actions that can be of any user-defined size. This simple approach of using sampling to reduce computation maintains similarly appealing theoretical properties to VI without the need to wait for a full sweep through the entire action space in each update. In this paper, we show DAVI converges to the optimal value function with probability one, converges at a near-geometric rate with probability 1-delta, and returns a near-optimal policy in computation time that nearly matches a previously established bound for VI. We also empirically demonstrate DAVI's effectiveness in several experiments
    corecore