59,934 research outputs found

    Active Perception in Adversarial Scenarios using Maximum Entropy Deep Reinforcement Learning

    Full text link
    We pose an active perception problem where an autonomous agent actively interacts with a second agent with potentially adversarial behaviors. Given the uncertainty in the intent of the other agent, the objective is to collect further evidence to help discriminate potential threats. The main technical challenges are the partial observability of the agent intent, the adversary modeling, and the corresponding uncertainty modeling. Note that an adversary agent may act to mislead the autonomous agent by using a deceptive strategy that is learned from past experiences. We propose an approach that combines belief space planning, generative adversary modeling, and maximum entropy reinforcement learning to obtain a stochastic belief space policy. By accounting for various adversarial behaviors in the simulation framework and minimizing the predictability of the autonomous agent's action, the resulting policy is more robust to unmodeled adversarial strategies. This improved robustness is empirically shown against an adversary that adapts to and exploits the autonomous agent's policy when compared with a standard Chance-Constraint Partially Observable Markov Decision Process robust approach

    Identification of the major cause of endemically poor mobilities in SiC/SiO2 structures

    Full text link
    Materials with good carrier mobilities are desired for device applications, but in real devices the mobilities are usually limited by the presence of interfaces and contacts. Mobility degradation at semiconductor-dielectric interfaces is generally attributed to defects at the interface or inside the dielectric, as is the case in Si/SiO2 structures, where processing does not introduce detrimental defects in the semiconductor. In the case of SiC/SiO2 structures, a decade of research focused on reducing or passivating interface and oxide defects, but the low mobilities have persisted. By invoking theoretical results and available experimental evidence, we show that thermal oxidation generates carbon di-interstitial defects inside the semiconductor substrate and that they are a major cause of the poor mobility in SiC/SiO2 structures

    Transferable Pedestrian Motion Prediction Models at Intersections

    Full text link
    One desirable capability of autonomous cars is to accurately predict the pedestrian motion near intersections for safe and efficient trajectory planning. We are interested in developing transfer learning algorithms that can be trained on the pedestrian trajectories collected at one intersection and yet still provide accurate predictions of the trajectories at another, previously unseen intersection. We first discussed the feature selection for transferable pedestrian motion models in general. Following this discussion, we developed one transferable pedestrian motion prediction algorithm based on Inverse Reinforcement Learning (IRL) that infers pedestrian intentions and predicts future trajectories based on observed trajectory. We evaluated our algorithm on a dataset collected at two intersections, trained at one intersection and tested at the other intersection. We used the accuracy of augmented semi-nonnegative sparse coding (ASNSC), trained and tested at the same intersection as a baseline. The result shows that the proposed algorithm improves the baseline accuracy by 40% in the non-transfer task, and 16% in the transfer task
    • …
    corecore