786 research outputs found

    Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation

    Full text link
    Dialogue policy optimization often obtains feedback until task completion in task-oriented dialogue systems. This is insufficient for training intermediate dialogue turns since supervision signals (or rewards) are only provided at the end of dialogues. To address this issue, reward learning has been introduced to learn from state-action pairs of an optimal policy to provide turn-by-turn rewards. This approach requires complete state-action annotations of human-to-human dialogues (i.e., expert demonstrations), which is labor intensive. To overcome this limitation, we propose a novel reward learning approach for semi-supervised policy learning. The proposed approach learns a dynamics model as the reward function which models dialogue progress (i.e., state-action sequences) based on expert demonstrations, either with or without annotations. The dynamics model computes rewards by predicting whether the dialogue progress is consistent with expert demonstrations. We further propose to learn action embeddings for a better generalization of the reward function. The proposed approach outperforms competitive policy learning baselines on MultiWOZ, a benchmark multi-domain dataset

    Laser Deposition Cladding On-Line Inspection Using 3-D Scanner

    Get PDF
    Laser deposition directly deposits metal cladding to fabricate and repair components. In order to finish the fabrication or repair, 3-D shape of the deposition needs to be inspected, and thus it can be determined if it has sufficient cladding to fabricate a part after deposition process. In the present hybrid system in the Laser Aided Manufacturing Lab (LAMP) at the University of Missouri - Rolla, a CMM system is used to do the inspection. A CMM requires point-by-point contact, which is time consuming and difficult to plan for an irregular deposition geometry. Also, the CMM is a separate device, which requires removal of the part from the hybrid system, which can induce fixture errors. The 3-D scanner is a non-contact tool to measure the 3-D shape of laser deposition cladding which is fast and accurate. In this paper, A prototype non-contact 3-D scanner approach has been implemented to inspect the free-form and complex parts built by laser deposition. Registration of the measured model and 3-D CAD model allows the comparison between the two models. It enables us to determine if the deposition is sufficient before machining.Mechanical Engineerin

    Towards Certain Fixes with Editing Rules and Master Data

    Get PDF
    A variety of integrity constraints have been studied for data cleaning. While these constraints can detect the presence of errors, they fall short of guiding us to correct the errors. Indeed, data repairing based on these constraints may not find certain fixes that are absolutely correct, and worse, may introduce new errors when repairing the data. We propose a method for finding certain fixes, based on master data, a notion of certain regions , and a class of editing rules . A certain region is a set of attributes that are assured correct by the users. Given a certain region and master data, editing rules tell us what attributes to fix and how to update them. We show how the method can be used in data monitoring and enrichment. We develop techniques for reasoning about editing rules, to decide whether they lead to a unique fix and whether they are able to fix all the attributes in a tuple, relative to master data and a certain region. We also provide an algorithm to identify minimal certain regions, such that a certain fix is warranted by editing rules and master data as long as one of the regions is correct. We experimentally verify the effectiveness and scalability of the algorithm. </jats:p

    A Uniform Method of Mechanical Disturbance Torque Measurement and Reduction for the Seeker Gimbal in the Assembly Process

    Get PDF
    In the manufacturing process of seekers, the reduction of disturbance torques (DTs) is a critical but time-consuming work. The innovation of the paper is to present a uniform method to measure and reduce mechanical DTs during gimbal’s assembly process. Firstly, the relationships between assembly parameters and DTs are established and analyzed by theoretical model. And then, a measuring system is established to measure the driven torque of the gimbal’s torque motor. With the goal of stabilizing and minimizing the driven torque, all assembly parameters relating to DTs could be adjusted. Through the proof of a lot of experiments, this proposed method could reduce the bias and fluctuation of these mechanical DTs. This method could also be used for the mechanical DTs reduction of most similar productions and improve the quality and efficiency during their system assembly process

    Reverse Nearest Neighbor Heat Maps: A Tool for Influence Exploration

    Full text link
    We study the problem of constructing a reverse nearest neighbor (RNN) heat map by finding the RNN set of every point in a two-dimensional space. Based on the RNN set of a point, we obtain a quantitative influence (i.e., heat) for the point. The heat map provides a global view on the influence distribution in the space, and hence supports exploratory analyses in many applications such as marketing and resource management. To construct such a heat map, we first reduce it to a problem called Region Coloring (RC), which divides the space into disjoint regions within which all the points have the same RNN set. We then propose a novel algorithm named CREST that efficiently solves the RC problem by labeling each region with the heat value of its containing points. In CREST, we propose innovative techniques to avoid processing expensive RNN queries and greatly reduce the number of region labeling operations. We perform detailed analyses on the complexity of CREST and lower bounds of the RC problem, and prove that CREST is asymptotically optimal in the worst case. Extensive experiments with both real and synthetic data sets demonstrate that CREST outperforms alternative algorithms by several orders of magnitude.Comment: Accepted to appear in ICDE 201

    Enhanced Strategies for Seismic Resilient Posttensioned Reinforced Concrete Bridge Piers: Experimental Tests and Numerical Simulations

    Get PDF
    Recent studies have proposed and investigated the use of unbonded posttensioned (PT) bars in bridge substructure systems to improve their self-centering behavior. However, the lateral loading resistance and self-centering capacities of reinforced concrete (RC) piers with PT bars (i.e., the posttensioned RC bridge piers: PRC piers) could be easily compromised by early crushing of the base compression toe during a seismic event. Hence, in order to enhance the pier base integrity, the present study proposes and experimentally investigates three simple strategies for enhancing the seismic-damage resistance of PRC piers based on the use of (1) a steel tube to encase the PRC pier’s end segment; (2) ultrahigh performance concrete at the PRC pier’s end segment; and (3) engineered cementitious composite mortar bed underneath the pier bottom. The performance of these three PRC piers was assessed by comparing them with a conventional PRC pier under cyclic loading and considering the damage evolution and the cyclic force-displacement response. The comparison shows that the proposed enhanced PRC solutions allow improving the seismic performance of the system sustaining large lateral drifts with good self-centering behavior. Moreover, based on the test results, finite element models accounting for PT force loss and the rocking characteristics were validated to reproduce the piers’ cyclic response, thus highlighting the importance of considering PT force loss during numerical simulations
    corecore