10,871 research outputs found

    Learn to Interpret Atari Agents

    Full text link
    Deep Reinforcement Learning (DeepRL) agents surpass human-level performances in a multitude of tasks. However, the direct mapping from states to actions makes it hard to interpret the rationale behind the decision making of agents. In contrast to previous a-posteriori methods of visualizing DeepRL policies, we propose an end-to-end trainable framework based on Rainbow, a representative Deep Q-Network (DQN) agent. Our method automatically learns important regions in the input domain, which enables characterizations of the decision making and interpretations for non-intuitive behaviors. Hence we name it Region Sensitive Rainbow (RS-Rainbow). RS-Rainbow utilizes a simple yet effective mechanism to incorporate visualization ability into the learning model, not only improving model interpretability, but leading to improved performance. Extensive experiments on the challenging platform of Atari 2600 demonstrate the superiority of RS-Rainbow. In particular, our agent achieves state of the art at just 25% of the training frames. Demonstrations and code are available at https://github.com/yz93/Learn-to-Interpret-Atari-Agents

    Integrable Open Spin Chains from Flavored ABJM Theory

    Full text link
    We compute the two-loop anomalous dimension matrix in the scalar sector of planar N=3{\cal N}=3 flavored ABJM theory. Using coordinate Bethe ansatz, we obtain the reflection matrix and confirm that the boundary Yang-Baxter equations are satisfied. This establishes the integrability of this theory in the scalar sector at the two-loop order.Comment: v2, 25 pages, 2 figures, minor corrections, references adde

    ηQ\eta_{Q} meson photoproduction in ultrarelativistic heavy ion collisions

    Get PDF
    The transverse momentum distributions for inclusive ηc,b\eta_{c,b} meson described by gluon-gluon interactions from photoproduction processes in relativistic heavy ion collisions are calculated. We considered the color singlet (CS) and color octet (CO) components with the framework of non-relativistic Quantum Chromodynamics (NRQCD) into the production of heavy quarkonium. The phenomenological values of the matrix elements for the color-singlet and color-octet components give the main contribution to the production of heavy quarkonium from the gluon-gluon interaction caused by the emission of additional gluon in the initial state. The numerical results indicate that the contribution of photoproduction processes cannot be negligible for mid-rapidity in p-p and Pb-Pb collisions at the Large Hadron Collider (LHC) energies.Comment: 11 pages, 2 figure

    Two Sides of the Same Coin: White-box and Black-box Attacks for Transfer Learning

    Full text link
    Transfer learning has become a common practice for training deep learning models with limited labeled data in a target domain. On the other hand, deep models are vulnerable to adversarial attacks. Though transfer learning has been widely applied, its effect on model robustness is unclear. To figure out this problem, we conduct extensive empirical evaluations to show that fine-tuning effectively enhances model robustness under white-box FGSM attacks. We also propose a black-box attack method for transfer learning models which attacks the target model with the adversarial examples produced by its source model. To systematically measure the effect of both white-box and black-box attacks, we propose a new metric to evaluate how transferable are the adversarial examples produced by a source model to a target model. Empirical results show that the adversarial examples are more transferable when fine-tuning is used than they are when the two networks are trained independently
    • …
    corecore