827 research outputs found

    Goal-oriented Dialogue Policy Learning from Failures

    Full text link
    Reinforcement learning methods have been used for learning dialogue policies. However, learning an effective dialogue policy frequently requires prohibitively many conversations. This is partly because of the sparse rewards in dialogues, and the very few successful dialogues in early learning phase. Hindsight experience replay (HER) enables learning from failures, but the vanilla HER is inapplicable to dialogue learning due to the implicit goals. In this work, we develop two complex HER methods providing different trade-offs between complexity and performance, and, for the first time, enabled HER-based dialogue policy learning. Experiments using a realistic user simulator show that our HER methods perform better than existing experience replay methods (as applied to deep Q-networks) in learning rate

    Equipment Reliability Data Collection: A Journey to Operational Excellence in the Offshore Industry

    Get PDF
    PresentationThe offshore industry has witnessed catastrophic incidents, which continue to occur. There is a need to learn from various best practices and incidents and continue to move towards safer operations. Data in different forms on equipment reliability, near misses, key performance indicators and more exists within organizations and agencies in this industry. Most of these databases, if collected and connected could be used to prevent and/or assess the consequences of an event. Near-miss databases can help to assess the barriers that would prevent an event from escalating to consequences, and the reliability databases can be used to assess the barriers that can prevent an event. This paper describes the experience, initiatives and major challenges of the Ocean Energy Safety Institute (OESI) and the Mary Kay O’Connor Process Safety Center (MKOPSC) in data collection projects and initiatives. The paper concludes with next steps to see how existing databases could be improved in areas such as data quality, data validation, increased accessibility and searchability and provides a list of potential research projects
    • …
    corecore