122 research outputs found

    Randomized Optimal Stopping Problem in Continuous time and Reinforcement Learning Algorithm

    Full text link
    In this paper, we study the optimal stopping problem in the so-called exploratory framework, in which the agent takes actions randomly conditioning on current state and an entropy-regularized term is added to the reward functional. Such a transformation reduces the optimal stopping problem to a standard optimal control problem. We derive the related HJB equation and prove its solvability. Furthermore, we give a convergence rate of policy iteration and the comparison to classical optimal stopping problem. Based on the theoretical analysis, a reinforcement learning algorithm is designed and numerical results are demonstrated for several models

    Weak Limits of Random Coefficient Autoregressive Processes and their Application in Ruin Theory

    Get PDF
    We prove that a large class of discrete-time insurance surplus processes converge weakly to a generalized Ornstein-Uhlenbeck process, under a suitable re-normalization and when the time-step goes to 0. Motivated by ruin theory, we use this result to obtain approximations for the moments, the ultimate ruin probability and the discounted penalty function of the discrete-time process
    corecore