5 research outputs found

    Linear programming-based solution methods for constrained partially observable Markov decision processes

    Full text link
    Constrained partially observable Markov decision processes (CPOMDPs) have been used to model various real-world phenomena. However, they are notoriously difficult to solve to optimality, and there exist only a few approximation methods for obtaining high-quality solutions. In this study, grid-based approximations are used in combination with linear programming (LP) models to generate approximate policies for CPOMDPs. A detailed numerical study is conducted with six CPOMDP problem instances considering both their finite and infinite horizon formulations. The quality of approximation algorithms for solving unconstrained POMDP problems is established through a comparative analysis with exact solution methods. Then, the performance of the LP-based CPOMDP solution approaches for varying budget levels is evaluated. Finally, the flexibility of LP-based approaches is demonstrated by applying deterministic policy constraints, and a detailed investigation into their impact on rewards and CPU run time is provided. For most of the finite horizon problems, deterministic policy constraints are found to have little impact on expected reward, but they introduce a significant increase to CPU run time. For infinite horizon problems, the reverse is observed: deterministic policies tend to yield lower expected total rewards than their stochastic counterparts, but the impact of deterministic constraints on CPU run time is negligible in this case. Overall, these results demonstrate that LP models can effectively generate approximate policies for both finite and infinite horizon problems while providing the flexibility to incorporate various additional constraints into the underlying model.Comment: 42 pages, 8 figure

    Hidden Markov Model Based Visual Perception Filtering in Robotic Soccer

    No full text
    Autonomous robots can initiate their mission plans only after gathering sufficient information about the environment. Therefore reliable perception information plays a major role in the overall success of an autonomous robot. The Hidden Markov Model based post-perception filtering module proposed in this paper aims to identify and remove spurious perception information in a given perception sequence using the generic meta-pose definition. This method allows representing uncertainty in more abstract terms compared to the common physical representations. Our experiments with the four legged AIBO robot indicated that the proposed module improved perception and localization performance significantly

    Enhancing the construction safety training by using virtual environment : V-SAFE

    No full text
    Construction is one of the most high-risk industries in the world. The safety records of the construction sector report that, construction workers are approximately over three times more likely to be exposed to serious accidents comparing to other industries. In addition to these injuries and fatalities, work-related accidents also cause financial damage, conflictual cases and pecuniary penalties for the construction companies. Therefore, the significance of the safety management process has been increasing in the construction industry. However, since the majority of the construction activities are complex and require collaboration between the workers, the provision of the safety has become one of the challenging tasks. So, the behavior-based skills of the workers play a crucial role in the safety management. Traditional safety training methods have been merely focusing on information-based techniques such as lectures, videos, written materials, etc. On the other hand, previous research has indicated that adequate safety training should also involve behavioral modeling and hands-on training, together with traditional learning methods. Due to the nature of the construction projects, hands-on training in the construction field is not practicable. In this sense, using virtual environments is an effective method that enables a safe environment for the users without being exposed to adverse effects of the failed tasks. Thus, virtual environments allow visual simulation that is helpful for the improvement of the trainees’ behavior-based skills. Therefore, virtual environments provide an important opportunity to advance the level of safety training. The main aim of the study is to describe the developed virtual safety training environment called V-SAFE (Virtual Safety Analysis For Engineering applications), which involves methods to simulate, and visualize construction operation scenarios. V-SAFE is based on the Unreal game engine for the visualization of the environment, and USARSim is used for the high-fidelity simulation of the robot behavior and environment mapping. V-SAFE is projected to establish a base to identify construction-specific safety risks and to improve the behavior-based skills of the construction project participants. In brief, V-SAFE has high potential to improve the risk recognition capability, and situational awareness of the construction managers, workers, safety managers, field engineers. So, V-SAFE could be beneficial for the construction organizations aim to advance the effectiveness of the safety training.Non UBCUnreviewedFacultyOthe

    A multi-objective constrained POMDP model for breast cancer screening

    Full text link
    Breast cancer is a common and deadly disease, but it is often curable when diagnosed early. While most countries have large-scale screening programs, there is no consensus on a single globally accepted policy for breast cancer screening. The complex nature of the disease; limited availability of screening methods such as mammography, magnetic resonance imaging (MRI), and ultrasound screening; and public health policies all factor into the development of screening policies. Resource availability concerns necessitate the design of policies which conform to a budget, a problem which can be modelled as a constrained partially observable Markov decision process (CPOMDP). In this study, we propose a multi-objective CPOMDP model for breast cancer screening with two objectives: minimize the lifetime risk of dying due to breast cancer and maximize the quality-adjusted life years. Additionally, we consider an expanded action space which allows for screening methods beyond mammography. Each action has a unique impact on quality-adjusted life years and lifetime risk, as well as a unique cost. Our results reveal the Pareto frontier of optimal solutions for average and high risk patients at different budget levels, which can be used by decision makers to set policies in practice
    corecore