Search CORE

27,538 research outputs found

Towards Robust Deep Reinforcement Learning for Traffic Signal Control: Demand Surges, Incidents and Sensor Failures

Author: Azevedo Carlos Lima
Rodrigues Filipe
Publication venue
Publication date: 01/01/2019
Field of study

Reinforcement learning (RL) constitutes a promising solution for alleviating the problem of traffic congestion. In particular, deep RL algorithms have been shown to produce adaptive traffic signal controllers that outperform conventional systems. However, in order to be reliable in highly dynamic urban areas, such controllers need to be robust with the respect to a series of exogenous sources of uncertainty. In this paper, we develop an open-source callback-based framework for promoting the flexible evaluation of different deep RL configurations under a traffic simulation environment. With this framework, we investigate how deep RL-based adaptive traffic controllers perform under different scenarios, namely under demand surges caused by special events, capacity reductions from incidents and sensor failures. We extract several key insights for the development of robust deep RL algorithms for traffic control and propose concrete designs to mitigate the impact of the considered exogenous uncertainties.Comment: 8 page

arXiv.org e-Print Archive

Perspectives on Bayesian Optimization for HCI

Author: Larsen Jan
Nielsen Jens Brehm
Sand Jensen Bjørn
Publication venue
Publication date: 01/01/2015
Field of study

In this position paper we discuss optimization in the HCI domain based on our experiences with Bayesian methods for modeling and optimization of audio systems, including challenges related to evaluating, designing, and optimizing such interfaces. We outline and demonstrate how a combined Bayesian modeling and optimization approach provides a flexible framework for integrating various user and content attributes, while also supporting model-based optimization of HCI systems. Finally, we discuss current and future research direction and applications, such as inferring user needs and optimizing interfaces for computer assisted teaching

Enlighten

Reliability-based economic model predictive control for generalized flow-based networks including actuators' health-aware capabilities

Author: Grosso Pérez Juan Manuel
Ocampo-Martínez Carlos
Puig Cayuela Vicenç
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/01/2016
Field of study

This paper proposes a reliability-based economic model predictive control (MPC) strategy for the management of generalized flow-based networks, integrating some ideas on network service reliability, dynamic safety stock planning, and degradation of equipment health. The proposed strategy is based on a single-layer economic optimisation problem with dynamic constraints, which includes two enhancements with respect to existing approaches. The first enhancement considers chance-constraint programming to compute an optimal inventory replenishment policy based on a desired risk acceptability level, leading to dynamically allocate safety stocks in flow-based networks to satisfy non-stationary flow demands. The second enhancement computes a smart distribution of the control effort and maximises actuators’ availability by estimating their degradation and reliability. The proposed approach is illustrated with an application of water transport networks using the Barcelona network as the considered case study.Peer ReviewedPostprint (author's final draft

Food supply chain network robustness : a literature review and research agenda

Author: Hendrix E.M.T.
Vlajic J.V.
Vorst J.G.A.J., van der
Publication venue: Wageningen University
Publication date
Field of study

Today’s business environment is characterized by challenges of strong global competition where companies tend to achieve leanness and maximum responsiveness. However, lean supply chain networks (SCNs) become more vulnerable to all kind of disruptions. Food SCNs have to become robust, i.e. they should be able to continue to function in the event of disruption as well as in normal business environment. Current literature provides no explicit clarification related to robustness issue in food SCN context. This paper explores the meaning of SCN robustness and highlights further research direction