122 research outputs found
Randomized Optimal Stopping Problem in Continuous time and Reinforcement Learning Algorithm
In this paper, we study the optimal stopping problem in the so-called
exploratory framework, in which the agent takes actions randomly conditioning
on current state and an entropy-regularized term is added to the reward
functional. Such a transformation reduces the optimal stopping problem to a
standard optimal control problem. We derive the related HJB equation and prove
its solvability. Furthermore, we give a convergence rate of policy iteration
and the comparison to classical optimal stopping problem. Based on the
theoretical analysis, a reinforcement learning algorithm is designed and
numerical results are demonstrated for several models
Weak Limits of Random Coefficient Autoregressive Processes and their Application in Ruin Theory
We prove that a large class of discrete-time insurance surplus processes converge weakly to a generalized Ornstein-Uhlenbeck process, under a suitable re-normalization and when the time-step goes to 0. Motivated by ruin theory, we use this result to obtain approximations for the moments, the ultimate ruin probability and the discounted penalty function of the discrete-time process
- …