27,538 research outputs found
Towards Robust Deep Reinforcement Learning for Traffic Signal Control: Demand Surges, Incidents and Sensor Failures
Reinforcement learning (RL) constitutes a promising solution for alleviating
the problem of traffic congestion. In particular, deep RL algorithms have been
shown to produce adaptive traffic signal controllers that outperform
conventional systems. However, in order to be reliable in highly dynamic urban
areas, such controllers need to be robust with the respect to a series of
exogenous sources of uncertainty. In this paper, we develop an open-source
callback-based framework for promoting the flexible evaluation of different
deep RL configurations under a traffic simulation environment. With this
framework, we investigate how deep RL-based adaptive traffic controllers
perform under different scenarios, namely under demand surges caused by special
events, capacity reductions from incidents and sensor failures. We extract
several key insights for the development of robust deep RL algorithms for
traffic control and propose concrete designs to mitigate the impact of the
considered exogenous uncertainties.Comment: 8 page
Perspectives on Bayesian Optimization for HCI
In this position paper we discuss optimization in the HCI
domain based on our experiences with Bayesian methods
for modeling and optimization of audio systems, including
challenges related to evaluating, designing, and optimizing
such interfaces. We outline and demonstrate how a
combined Bayesian modeling and optimization approach
provides a flexible framework for integrating various user
and content attributes, while also supporting model-based
optimization of HCI systems. Finally, we discuss current
and future research direction and applications, such as
inferring user needs and optimizing interfaces for
computer assisted teaching
Reliability-based economic model predictive control for generalized flow-based networks including actuators' health-aware capabilities
This paper proposes a reliability-based economic model predictive control (MPC) strategy for the management of generalized flow-based networks, integrating some ideas on network service reliability, dynamic safety stock planning, and degradation of equipment health. The proposed strategy is based on a single-layer economic optimisation problem with dynamic constraints, which includes two enhancements with respect to existing approaches. The first enhancement considers chance-constraint programming to compute an optimal inventory replenishment policy based on a desired risk acceptability level, leading to dynamically allocate safety stocks in flow-based networks to satisfy non-stationary flow demands. The second enhancement computes a smart distribution of the control effort and maximises actuators’ availability by estimating their degradation and reliability. The proposed approach is illustrated with an application of water transport networks using the Barcelona network as the considered case study.Peer ReviewedPostprint (author's final draft
Food supply chain network robustness : a literature review and research agenda
Today’s business environment is characterized by challenges of strong global competition where companies tend to achieve leanness and maximum responsiveness. However, lean supply chain networks (SCNs) become more vulnerable to all kind of disruptions. Food SCNs have to become robust, i.e. they should be able to continue to function in the event of disruption as well as in normal business environment. Current literature provides no explicit clarification related to robustness issue in food SCN context. This paper explores the meaning of SCN robustness and highlights further research direction
- …