168 research outputs found

    On the Robustness of Reading Comprehension Models to Entity Renaming

    Full text link
    We study the robustness of machine reading comprehension (MRC) models to entity renaming -- do models make more wrong predictions when the same questions are asked about an entity whose name has been changed? Such failures imply that models overly rely on entity information to answer questions, and thus may generalize poorly when facts about the world change or questions are asked about novel entities. To systematically audit this issue, we present a pipeline to automatically generate test examples at scale, by replacing entity names in the original test sample with names from a variety of sources, ranging from names in the same test set, to common names in life, to arbitrary strings. Across five datasets and three pretrained model architectures, MRC models consistently perform worse when entities are renamed, with particularly large accuracy drops on datasets constructed via distant supervision. We also find large differences between models: SpanBERT, which is pretrained with span-level masking, is more robust than RoBERTa, despite having similar accuracy on unperturbed test data. We further experiment with different masking strategies as the continual pretraining objective and find that entity-based masking can improve the robustness of MRC models.Comment: Accepted to NAACL 202

    Tree-Level Formalism

    Full text link
    We review two novel techniques used to calculate tree-level scattering amplitudes efficiently: MHV diagrams, and on-shell recursion relations. For the MHV diagrams, we consider applications to tree-level amplitudes and focus in particular on the N=4 supersymmetric formulation. We also briefly describe the derivation of loop amplitudes using MHV diagrams. For the recursion relations, after presenting their general proof, we discuss several applications to massless theories with and without supersymmetry, to theories with massive particles, and to graviton amplitudes in General Relativity. This article is an invited review for a special issue of Journal of Physics A devoted to "Scattering Amplitudes in Gauge Theories".Comment: 40 pages, 8 figures, invited review for a special issue of Journal of Physics A devoted to "Scattering Amplitudes in Gauge Theories", R. Roiban(ed), M. Spradlin(ed), A. Volovich(ed); v2: minor corrections, references adde

    Accretion of Planetary Material onto Host Stars

    Full text link
    Accretion of planetary material onto host stars may occur throughout a star's life. Especially prone to accretion, extrasolar planets in short-period orbits, while relatively rare, constitute a significant fraction of the known population, and these planets are subject to dynamical and atmospheric influences that can drive significant mass loss. Theoretical models frame expectations regarding the rates and extent of this planetary accretion. For instance, tidal interactions between planets and stars may drive complete orbital decay during the main sequence. Many planets that survive their stars' main sequence lifetime will still be engulfed when the host stars become red giant stars. There is some observational evidence supporting these predictions, such as a dearth of close-in planets around fast stellar rotators, which is consistent with tidal spin-up and planet accretion. There remains no clear chemical evidence for pollution of the atmospheres of main sequence or red giant stars by planetary materials, but a wealth of evidence points to active accretion by white dwarfs. In this article, we review the current understanding of accretion of planetary material, from the pre- to the post-main sequence and beyond. The review begins with the astrophysical framework for that process and then considers accretion during various phases of a host star's life, during which the details of accretion vary, and the observational evidence for accretion during these phases.Comment: 18 pages, 5 figures (with some redacted), invited revie

    Optimization of MicroCT Imaging and Blood Vessel Diameter Quantitation of Preclinical Specimen Vasculature with Radiopaque Polymer Injection Medium

    Get PDF
    Vascular networks within a living organism are complex, multi-dimensional, and challenging to image capture. Radio-angiographic studies in live animals require a high level of infrastructure and technical investment in order to administer costly perfusion mediums whose signals metabolize and degrade relatively rapidly, diminishing within a few hours or days. Additionally, live animal specimens must not be subject to long duration scans, which can cause high levels of radiation exposure to the specimen, limiting the quality of images that can be captured. Lastly, despite technological advances in live-animal specimen imaging, it is quite difficult to minimize or prevent movement of a live animal, which can cause motion artifacts in the final data output. It is demonstrated here that through the use of postmortem perfusion protocols of radiopaque silicone polymer mediums and ex-vivo organ harvest, it is possible to acquire a high level of vascular signal in preclinical specimens through the use of micro-computed tomographic (microCT) imaging. Additionally, utilizing high-order rendering algorithms, it is possible to further derive vessel morphometrics for qualitative and quantitative analysis

    Evasion of anti-growth signaling: a key step in tumorigenesis and potential target for treatment and prophylaxis by natural compounds

    Get PDF
    The evasion of anti-growth signaling is an important characteristic of cancer cells. In order to continue to proliferate, cancer cells must somehow uncouple themselves from the many signals that exist to slow down cell growth. Here, we define the anti-growth signaling process, and review several important pathways involved in growth signaling: p53, phosphatase and tensin homolog (PTEN), retinoblastoma protein (Rb), Hippo, growth differentiation factor 15 (GDF15), AT-rich interactive domain 1A (ARID1A), Notch, insulin-like growth factor (IGF), and KrĂŒppel-like factor 5 (KLF5) pathways. Aberrations in these processes in cancer cells involve mutations and thus the suppression of genes that prevent growth, as well as mutation and activation of genes involved in driving cell growth. Using these pathways as examples, we prioritize molecular targets that might be leveraged to promote anti-growth signaling in cancer cells. Interestingly, naturally-occurring phytochemicals found in human diets (either singly or as mixtures) may promote anti-growth signaling, and do so without the potentially adverse effects associated with synthetic chemicals. We review examples of naturally-occurring phytochemicals that may be applied to prevent cancer by antagonizing growth signaling, and propose one phytochemical for each pathway. These are: epigallocatechin-3-gallate (EGCG) for the Rb pathway, luteolin for p53, curcumin for PTEN, porphyrins for Hippo, genistein for GDF15, resveratrol for ARID1A, withaferin A for Notch and diguelin for the IGF1-receptor pathway. The coordination of anti-growth signaling and natural compound studies will provide insight into the future application of these compounds in the clinical setting

    Software-Hardware Co-design for Fast and Scalable Training of Deep Learning Recommendation Models

    Full text link
    Deep learning recommendation models (DLRMs) are used across many business-critical services at Facebook and are the single largest AI application in terms of infrastructure demand in its data-centers. In this paper we discuss the SW/HW co-designed solution for high-performance distributed training of large-scale DLRMs. We introduce a high-performance scalable software stack based on PyTorch and pair it with the new evolution of Zion platform, namely ZionEX. We demonstrate the capability to train very large DLRMs with up to 12 Trillion parameters and show that we can attain 40X speedup in terms of time to solution over previous systems. We achieve this by (i) designing the ZionEX platform with dedicated scale-out network, provisioned with high bandwidth, optimal topology and efficient transport (ii) implementing an optimized PyTorch-based training stack supporting both model and data parallelism (iii) developing sharding algorithms capable of hierarchical partitioning of the embedding tables along row, column dimensions and load balancing them across multiple workers; (iv) adding high-performance core operators while retaining flexibility to support optimizers with fully deterministic updates (v) leveraging reduced precision communications, multi-level memory hierarchy (HBM+DDR+SSD) and pipelining. Furthermore, we develop and briefly comment on distributed data ingestion and other supporting services that are required for the robust and efficient end-to-end training in production environments
    • 

    corecore