Search CORE

27 research outputs found

Real-Time Reinforcement Learning

Author: Ramstedt Simon
Publication venue
Publication date: 01/09/2019
Field of study

Les processus de décision markovien (MDP), le cadre mathématiques sous-jacent à la plupart des algorithmes de l'apprentissage par renforcement (RL) est souvent utilisé d'une manière qui suppose, à tort, que l'état de l'environnement d'un agent ne change pas pendant la sélection des actions. Puisque les systèmes RL basés sur les MDP classiques commencent à être appliqués dans les situations critiques pour la sécurité du monde réel, ce décalage entre les hypothèses sous-jacentes aux MDP classiques et la réalité du calcul en temps réel peut entraîner des résultats indésirables. Dans cette thèse, nous introduirons un nouveau cadre dans lequel les états et les actions évoluent simultanément, nous montrerons comment il est lié à la formulation MDP classique. Nous analyserons des algorithmes existants selon la nouvelle formulation en temps réel et montrerons pourquoi ils sont inférieurs, lorsqu'ils sont utilisés en temps réel. Par la suite, nous utiliserons ces perspectives pour créer un nouveau algorithme Real-Time Actor Critic qui est supérieur au Soft Actor Critic contrôle continu de l'état de l'art actuel, aussi bien en temps réel qu'en temps non réel.Markov Decision Processes (MDPs), the mathematical framework underlying most algorithms in Reinforcement Learning (RL), are often used in a way that wrongfully assumes that the state of an agent's environment does not change during action selection. As RL systems based on MDPs begin to find application in real-world safety critical situations, this mismatch between the assumptions underlying classical MDPs and the reality of real-time computation may lead to undesirable outcomes. In this thesis, we introduce a new framework, in which states and actions evolve simultaneously, we show how it is related to the classical MDP formulation. We analyze existing algorithms under the new real-time formulation and show why they are suboptimal when used in real-time. We then use those insights to create a new algorithm, Real-Time Actor Critic (RTAC) that outperforms the existing state-of-the-art continuous control algorithm Soft Actor Critic both in real-time and non-real-time settings

Dépôt Institutionnel Numérique

Real-Time Reinforcement Learning

Author: Pal Christopher
Ramstedt Simon
Publication venue
Publication date: 01/01/2019
Field of study

Markov Decision Processes (MDPs), the mathematical framework underlying most algorithms in Reinforcement Learning (RL), are often used in a way that wrongfully assumes that the state of an agent's environment does not change during action selection. As RL systems based on MDPs begin to find application in real-world safety critical situations, this mismatch between the assumptions underlying classical MDPs and the reality of real-time computation may lead to undesirable outcomes. In this paper, we introduce a new framework, in which states and actions evolve simultaneously and show how it is related to the classical MDP formulation. We analyze existing algorithms under the new real-time formulation and show why they are suboptimal when used in real-time. We then use those insights to create a new algorithm Real-Time Actor-Critic (RTAC) that outperforms the existing state-of-the-art continuous control algorithm Soft Actor-Critic both in real-time and non-real-time settings. Code and videos can be found at https://github.com/rmst/rtrl.Comment: Neural Information Processing Systems (2019

arXiv.org e-Print Archive

PolyPublie

Reinforcement Learning with Random Delays

Author: Beltrame Giovanni
Binas Jonathan
Bouteiller Yann
Pal Christopher
Ramstedt Simon
Publication venue
Publication date: 08/10/2020
Field of study

Action and observation delays commonly occur in many Reinforcement Learning applications, such as remote control scenarios. We study the anatomy of randomly delayed environments, and show that partially resampling trajectory fragments in hindsight allows for off-policy multi-step value estimation. We apply this principle to derive Delay-Correcting Actor-Critic (DCAC), an algorithm based on Soft Actor-Critic with significantly better performance in environments with delays. This is shown theoretically and also demonstrated practically on a delay-augmented version of the MuJoCo continuous control benchmark

arXiv.org e-Print Archive

PolyPublie

Suicidal Behavior and Alcohol Abuse

Author: Agerbo
Agren
Aharonovich
Alireza Ayatollahi
Allebeck
Allgulander
Anderson
Arcudi
Barak
Barr
Barraclough
Bartels
Bartels
Barter
Batki
Beautrais
Bech
Beck
Beck
Bedard
Bender
Berglund
Bergman
Bernal
Bernard
Bertolote
Bhave
Bhave
Bie
Bobak
Bridge
Buckley
Burns
Caces
Canetto
Cavanagh
Chatterji
Cheng
Cherpitel
Chiu
Christiansen
Clark
Clarke
Cohen-Sandler
Conner
Conner
Conner
Conner
Conwell
Conwell
Copello
Cornelius
Cornelius
Crews
Curran
Dalton
David Lester
Davis
Dawson
De Leo
De Luca
Degenhardt
Duberstein
Duberstein
Duffy
Dumais
Eaton
Evenden
Evren
Fawcett
Ferrada-Noli
Field
Flensborg-Madsen
Foster
Foster
Freed
Garfinkel
Garlow
Gay
Gianluca Serafini
Gibbons
Ginter
Giorgio D. Kotzalidis
Giovanni Dominici
Giulia Serra
Glass
Glowinski
Goldman
Gorwood
Gothert
Gould
Gould
Grant
Grant
Gruenewald
Gunnell
Hansen
Harris
Harrison
Harwood
Harwood
Hawton
Hayward
Heila
Heinz
Helseth
Hiroeh
Hoyer
Hucks
Hufford
Ikeda
Ilgen
Ilgen
Inskip
James
Joiner
Julien
June
Kamali
Kessler
Khantzian
Knapp
Knop
Koller
Kolves
Koob
Kraepelin
Krajnc
Kresnow
Leo Sher
Leon
Lesage
Lester
Lester
Levi
Lewohl
Lewohl
Light
Lindberg
Link
Lipsey
Lovinger
Luigi Janiri
Mackie
Magne Ingvar
Makela
Mann
Mann
Mann
Marco Innamorati
Marshal
Maurizio Pompili
McBride
McGirr
Meehan
Mendelson
Menninger
Merali
Merrill
Moran
Muhonen
Murphy
Murphy
Murphy
Murphy
Möller
Nemeroff
Nemtsov
Neves
Nie
Nie
Nie
Nie
Nordentoft
Norstrom
Norstrom
Norstrom
Nowak
O'Carroll
Ohara
Ohberg
Oquendo
Oscar-Berman
Oslin
Ozalp
Pacher
Paolo Girardi
Patterson
Petersen
Pirkola
Pirkola
Platt
Pompili
Preuss
Pridemore
Prinstein
Ramstedt
Ratsma
Razvodovsky
Razvodovsky
Razvodovsky
Regier
Reimer
Renaud
Rich
Ries
Roberto Tatarelli
Robins
Rossow
Rossow
Rossow
Roy
Roy
Rusch
Sareen
Schuckit
Scocco
Sher
Sher
Sher
Sher
Sher
Sher
Sher
Sher
Sherif
Shneidman
Silverman
Simon
Singh
Skog
Skog
Spak
Spirito
Spirito
Steele
Steele
Stefano Ferracuti
Steffensen
Stenback
Storvik
Strat
Sublette
Sullivan
Suokas
Suominen
Swahn
Szanto
Tall
Tapert
Townsend
Tyndale
Ukai
Umene-Nakano
Underwood
Uzun
Varnik
Velleman
Vijayakumar
Waern
Waern
Wallner
Wang
Wasserman
Wasserman
Wheeler
Whittington
Widiger
Wilcox
Wilhelmsen
Williams
Withers
Wojnar
Wolk-Wasserman
Worden
Yaldizli
Young
Zahl
Zhu
Publication venue: Molecular Diversity Preservation International (MDPI)
Publication date: 01/01/2010
Field of study

Suicide is an escalating public health problem, and alcohol use has consistently been implicated in the precipitation of suicidal behavior. Alcohol abuse may lead to suicidality through disinhibition, impulsiveness and impaired judgment, but it may also be used as a means to ease the distress associated with committing an act of suicide. We reviewed evidence of the relationship between alcohol use and suicide through a search of MedLine and PsychInfo electronic databases. Multiple genetically-related intermediate phenotypes might influence the relationship between alcohol and suicide. Psychiatric disorders, including psychosis, mood disorders and anxiety disorders, as well as susceptibility to stress, might increase the risk of suicidal behavior, but may also have reciprocal influences with alcohol drinking patterns. Increased suicide risk may be heralded by social withdrawal, breakdown of social bonds, and social marginalization, which are common outcomes of untreated alcohol abuse and dependence. People with alcohol dependence or depression should be screened for other psychiatric symptoms and for suicidality. Programs for suicide prevention must take into account drinking habits and should reinforce healthy behavioral patterns

Crossref

PubliCatt

Directory of Open Access Journals

PubMed Central

Archivio istituzionale della ricerca - Università di Genova

Archivio della ricerca- Università di Roma La Sapienza

Psykisk ohälsa hos ungdomar så som det beskrivs i dagstidningar : - En kvalitativ textanalys

Author: Kindbom Höydahl Simon
Ramstedt Adam
Publication venue: Örebro universitet, Institutionen för juridik, psykologi och socialt arbete
Publication date: 01/01/2019
Field of study

Publikationer från Örebro universitet

Digitala Vetenskapliga Arkivet - Academic Archive On-line

matthiasplappert/keras-rl: v0.2.0rc1

Author: Alan Nichol
Ben Johnson
Bruno Marques
Dr. Kashif Rasul
Jonathan Rahn
Matthias Plappert
Simon Ramstedt
Publication venue
Publication date
Field of study

Deep Reinforcement Learning for Keras

ZENODO

Alcohol‐attributed disease burden in four Nordic countries between 2000 and 2017: Are the gender gaps narrowing? A comparison using the Global Burden of Disease, Injury and Risk Factor 2017 study

Author: Agardh Emilie, E.
Allebeck Peter
Danielsson Anna-Karin
Eikemo Terje Andreas
Flodin Pär
Gakidou Emmanuela
Gissler Mika
Iburg Kim Moesgaard
Juel Knud
Kinge Jonas Minet
Knudsen Ann Kristin
McGrath John J.
Mäkelä Pia
Naghavi Mohsen
Ramstedt Mats
Skogen Jens Christoffer
Tollånes Mette C.
Vollset Stein Emil
Wenneberg Peter
Øverland Simon Nygaard
Publication venue: 'Wiley'
Publication date: 01/01/2020
Field of study

Abstract Introduction and Aims The gender difference in alcohol use seems to have narrowed in the Nordic countries, but it is not clear to what extent this may have affected differences in levels of harm. We compared gender differences in all‐cause and cause‐specific alcohol‐attributed disease burden, as measured by disability‐adjusted life‐years (DALY), in four Nordic countries in 2000–2017, to find out if gender gaps in DALYs had narrowed. Design and Methods Alcohol‐attributed disease burden by DALYs per 100 000 population with 95% uncertainty intervals were extracted from the Global Burden of Disease database. Results In 2017, all‐cause DALYs in males varied between 2531 in Finland and 976 in Norway, and in females between 620 in Denmark and 270 in Norway. Finland had the largest gender differences and Norway the smallest, closely followed by Sweden. During 2000–2017, absolute gender differences in all‐cause DALYs declined by 31% in Denmark, 26% in Finland, 19% in Sweden and 18% in Norway. In Finland, this was driven by a larger relative decline in males than females; in Norway, it was due to increased burden in females. In Denmark, the burden in females declined slightly more than in males, in relative terms, while in Sweden the relative decline was similar in males and females. Discussion and Conclusions The gender gaps in harm narrowed to a different extent in the Nordic countries, with the differences driven by different conditions. Findings are informative about how inequality, policy and sociocultural differences affect levels of harm by gender.publishedVersio

University of Bergen

Crossref

NORA - Norwegian Open Research Archives

UiS Brage