64 research outputs found

    Learning to refer informatively by amortizing pragmatic reasoning

    Full text link
    A hallmark of human language is the ability to effectively and efficiently convey contextually relevant information. One theory for how humans reason about language is presented in the Rational Speech Acts (RSA) framework, which captures pragmatic phenomena via a process of recursive social reasoning (Goodman & Frank, 2016). However, RSA represents ideal reasoning in an unconstrained setting. We explore the idea that speakers might learn to amortize the cost of RSA computation over time by directly optimizing for successful communication with an internal listener model. In simulations with grounded neural speakers and listeners across two communication game datasets representing synthetic and human-generated data, we find that our amortized model is able to quickly generate language that is effective and concise across a range of contexts, without the need for explicit pragmatic reasoning.Comment: Accepted to CogSci 202

    Learning Outside the Box: Discourse-level Features Improve Metaphor Identification.

    Get PDF
    Most current approaches to metaphor identi-fication use restricted linguistic contexts, e.g.by considering only a verb’s arguments or the sentence containing a phrase.Inspired by pragmatic accounts of metaphor, we argue that broader discourse features are crucial for bet-ter metaphor identification. We train simple gradient boosting classifiers on representations of an utterance and its surrounding discourse learned with a variety of document embedding methods, obtaining near state-of-the-art results on the 2018 VU Amsterdam metaphor iden-tification task without the complex metaphor-specific features or deep neural architectures employed by other systems.A qualitative analysis further confirms the need for broader context in metaphor processing

    Improving Intrinsic Exploration with Language Abstractions

    Get PDF
    Reinforcement learning (RL) agents are particularly hard to train when rewards are sparse. One common solution is to use intrinsic rewards to encourage agents to explore their environment. However, recent intrinsic exploration methods often use state-based novelty measures which reward low-level exploration and may not scale to domains requiring more abstract skills. Instead, we explore natural language as a general medium for highlighting relevant abstractions in an environment. Unlike previous work, we evaluate whether language can improve over existing exploration methods by directly extending (and comparing to) competitive intrinsic exploration baselines: AMIGo (Campero et al., 2021) and NovelD (Zhang et al., 2021). These language-based variants outperform their non-linguistic forms by 47-85% across 13 challenging tasks from the MiniGrid and MiniHack environment suites.Comment: NeurIPS 202

    Clinical significance of vasculogenic mimicry in human gliomas

    Get PDF
    Vasculogenic mimicry (VM) is known as non-endothelial tumor cell-lined microvascular channels in aggressive tumors. We have previously found the presence of VM in high-grade gliomas. In this study, we aimed to identify VM patterns in gliomas and to explore their clinical significance. Tumor samples as well as their detailed clinical/prognostic data were collected from 101 patients. Vasculogenic mimicry in the glioma samples was determined by dual staining for endothelial marker CD34 and periodic acid–Schiff (PAS). Tumor samples were also immunohistochemically stained for Ki-67, VEGF, COX-2 and MMP-9. The association between VM and the clinical characteristics of the patients were analyzed. A Kaplan–Meier survival analysis and log-rank tests were performed to compare survival times of the patients. Vasculogenic mimicry was present in 13 out of 101 samples. The higher grade gliomas had a higher incidence of VM than that of lower grade gliomas (P = 0.006). Vasculogenic mimicry channels were associated with the expression of COX-2 and MMP-9 (P < 0.05). While there was no association between the existence of VM and the sex, age and preoperative epilepsy of the patients, or expression of Ki-67 and VEGF. However, patients with VM-positive gliomas survived a shorter period of time than those with VM negative gliomas (P = 0.027). Interestingly, in high-grade gliomas, the level of microvascular density was lower in VM positive tumors than those VM negative tumors (P = 0.039). Our results suggest that VM channels in gliomas correlate with increasing malignancy and higher aggressiveness, and may provide a complementation to the tumor’s blood supply, especially in less vascularized regions, which may aid in the identification of glioma patients with a poorer prognosis

    Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

    Full text link
    Humans are capable of strategically deceptive behavior: behaving helpfully in most situations, but then behaving very differently in order to pursue alternative objectives when given the opportunity. If an AI system learned such a deceptive strategy, could we detect it and remove it using current state-of-the-art safety training techniques? To study this question, we construct proof-of-concept examples of deceptive behavior in large language models (LLMs). For example, we train models that write secure code when the prompt states that the year is 2023, but insert exploitable code when the stated year is 2024. We find that such backdoor behavior can be made persistent, so that it is not removed by standard safety training techniques, including supervised fine-tuning, reinforcement learning, and adversarial training (eliciting unsafe behavior and then training to remove it). The backdoor behavior is most persistent in the largest models and in models trained to produce chain-of-thought reasoning about deceiving the training process, with the persistence remaining even when the chain-of-thought is distilled away. Furthermore, rather than removing backdoors, we find that adversarial training can teach models to better recognize their backdoor triggers, effectively hiding the unsafe behavior. Our results suggest that, once a model exhibits deceptive behavior, standard techniques could fail to remove such deception and create a false impression of safety.Comment: updated to add missing acknowledgement

    Pathogenic Mechanism of the FIG4 Mutation Responsible for Charcot-Marie-Tooth Disease CMT4J

    Get PDF
    CMT4J is a severe form of Charcot-Marie-Tooth neuropathy caused by mutation of the phosphoinositide phosphatase FIG4/SAC3. Affected individuals are compound heterozygotes carrying the missense allele FIG4-I41T in combination with a null allele. Analysis using the yeast two-hybrid system demonstrated that the I41T mutation impairs interaction of FIG4 with the scaffold protein VAC14. The critical role of this interaction was confirmed by the demonstration of loss of FIG4 protein in VAC14 null mice. We developed a mouse model of CMT4J by expressing a Fig4-I41T cDNA transgene on the Fig4 null background. Expression of the mutant transcript at a level 5× higher than endogenous Fig4 completely rescued lethality, whereas 2× expression gave only partial rescue, providing a model of the human disease. The level of FIG4-I41T protein in transgenic tissues is only 2% of that predicted by the transcript level, as a consequence of the protein instability caused by impaired interaction of the mutant protein with VAC14. Analysis of patient fibroblasts demonstrated a comparably low level of mutant I41T protein. The abundance of FIG4-I41T protein in cultured cells is increased by treatment with the proteasome inhibitor MG-132. The data demonstrate that FIG4-I41T is a hypomorphic allele encoding a protein that is unstable in vivo. Expression of FIG4-I41T protein at 10% of normal level is sufficient for long-term survival, suggesting that patients with CMT4J could be treated by increased production or stabilization of the mutant protein. The transgenic model will be useful for testing in vivo interventions to increase the abundance of the mutant protein
    corecore