3 research outputs found

    Optimal employee retention when inferring unknown learning curves

    No full text
    This paper formulates an employer’s hiring and retention decisions as an infinite-armed bandit problem and characterizes the structure of optimal hiring and retention policies. We develop approximations that allow us to explicitly calculate these policies and to evaluate their benefit. The solution involves a balance of two types of learning: the learning that reflects the improvement in performance of employees as they gain experience, and the Bayesian learning of employers as they infer properties of employees ’ abilities to inform the decision of whether to retain or replace employees. Numerical experiments with Monte Carlo simulation suggest that the gains to active screening and monitoring of employees can be substantial.

    Approximate Dynamic Programming: Health Care Applications

    Get PDF
    This dissertation considers different approximate solutions to Markov decision problems formulated within the dynamic programming framework in two health care applications. Dynamic formulations are appropriate for problems which require optimization over time and a variety of settings for different scenarios and policies. This is similar to the situation in a lot of health care applications for which because of the curses of dimensionality, exact solutions do not always exist. Thus, approximate analysis to find near optimal solutions are motivated. To check the quality of approximation, additional evidence such as boundaries, consistency analysis, or asymptotic behavior evaluation are required. Emergency vehicle management and dose-finding clinical trials are the two heath care applications considered here in order to investigate dynamic formulations, approximate solutions, and solution quality assessments. The dynamic programming formulation for real-time ambulance dispatching and relocation policies, response-adaptive dose-finding clinical trial, and optimal stopping of adaptive clinical trials is presented. Approximate solutions are derived by multiple methods such as basis function regression, one-step look-ahead policy, simulation-based gridding algorithm, and diffusion approximation. Finally, some boundaries to assess the optimality gap and a proof of consistency for approximate solutions are presented to ensure the quality of approximation
    corecore