7 research outputs found
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study
In recent years, on-policy reinforcement learning (RL) has been successfully
applied to many different continuous control tasks. While RL algorithms are
often conceptually simple, their state-of-the-art implementations take numerous
low- and high-level design decisions that strongly affect the performance of
the resulting agents. Those choices are usually not extensively discussed in
the literature, leading to discrepancy between published descriptions of
algorithms and their implementations. This makes it hard to attribute progress
in RL and slows down overall progress [Engstrom'20]. As a step towards filling
that gap, we implement >50 such ``choices'' in a unified on-policy RL
framework, allowing us to investigate their impact in a large-scale empirical
study. We train over 250'000 agents in five continuous control environments of
different complexity and provide insights and practical recommendations for
on-policy training of RL agents
Solving N-player dynamic routing games with congestion: a mean field approach
The recent emergence of navigational tools has changed traffic patterns and has now enabled new types of congestion-aware routing control like dynamic road pricing. Using the fundamental diagram of traffic flows - applied in macroscopic and mesoscopic traffic modeling - the article introduces a new N-player dynamic routing game with explicit congestion dynamics. The model is well-posed and can reproduce heterogeneous departure times and congestion spill back phenomena. However, as Nash equilibrium computations are PPAD-complete, solving the game becomes intractable for large but realistic numbers of vehicles N. Therefore, the corresponding mean field game is also introduced. Experiments were performed on several classical benchmark networks of the traffic community: the Pigou, Braess, and Sioux Falls networks with heterogeneous origin, destination and departure time tuples. The Pigou and the Braess examples reveal that the mean field approximation is generally very accurate and computationally efficient as soon as the number of vehicles exceeds a few dozen. On the Sioux Falls network (76 links, 100 time steps), this approach enables learning traffic dynamics with more than 14,000 vehicles
Solving N-player dynamic routing games with congestion: a mean field approach
The recent emergence of navigational tools has changed traffic patterns and has now enabled new types of congestion-aware routing control like dynamic road pricing. Using the fundamental diagram of traffic flows - applied in macroscopic and mesoscopic traffic modeling - the article introduces a new N-player dynamic routing game with explicit congestion dynamics. The model is well-posed and can reproduce heterogeneous departure times and congestion spill back phenomena. However, as Nash equilibrium computations are PPAD-complete, solving the game becomes intractable for large but realistic numbers of vehicles N. Therefore, the corresponding mean field game is also introduced. Experiments were performed on several classical benchmark networks of the traffic community: the Pigou, Braess, and Sioux Falls networks with heterogeneous origin, destination and departure time tuples. The Pigou and the Braess examples reveal that the mean field approximation is generally very accurate and computationally efficient as soon as the number of vehicles exceeds a few dozen. On the Sioux Falls network (76 links, 100 time steps), this approach enables learning traffic dynamics with more than 14,000 vehicles
Solving N-player dynamic routing games with congestion: a mean field approach
The recent emergence of navigational tools has changed traffic patterns and has now enabled new types of congestion-aware routing control like dynamic road pricing. Using the fundamental diagram of traffic flows - applied in macroscopic and mesoscopic traffic modeling - the article introduces a new N-player dynamic routing game with explicit congestion dynamics. The model is well-posed and can reproduce heterogeneous departure times and congestion spill back phenomena. However, as Nash equilibrium computations are PPAD-complete, solving the game becomes intractable for large but realistic numbers of vehicles N. Therefore, the corresponding mean field game is also introduced. Experiments were performed on several classical benchmark networks of the traffic community: the Pigou, Braess, and Sioux Falls networks with heterogeneous origin, destination and departure time tuples. The Pigou and the Braess examples reveal that the mean field approximation is generally very accurate and computationally efficient as soon as the number of vehicles exceeds a few dozen. On the Sioux Falls network (76 links, 100 time steps), this approach enables learning traffic dynamics with more than 14,000 vehicles
Successful thrombectomy is beneficial in patients with pre-stroke disability: Results from an international multicenter cohort study
International audienceAbstract Excessive alcohol consumption is the leading cause of liver diseases in Western countries, especially in France. Alcoholârelated liver disease (ARLD) is an extremely broad context and there remains much to accomplish in terms of identifying patients, improving prognosis and treatment, and standardising practices. The French Association for the Study of the Liver wished to organise guidelines together with the French Alcohol Society in order to summarise the best evidence available about several key clinical points in ARLD. These guidelines have been elaborated based on the level of evidence available in the literature and each recommendation has been analysed, discussed and voted by the panel of experts. They describe how patients with ARLD should be managed nowadays and discuss the main unsettled issues in the field