3,049 research outputs found

    Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs

    Full text link
    Large language models (LLMs) have achieved widespread success on a variety of in-context few-shot tasks, but this success is typically evaluated via correctness rather than consistency. We argue that self-consistency is an important criteria for valid multi-step reasoning in tasks where the solution is composed of the answers to multiple sub-steps. We propose two types of self-consistency that are particularly important for multi-step reasoning -- hypothetical consistency (a model's ability to predict what its output would be in a hypothetical other context) and compositional consistency (consistency of a model's final outputs when intermediate sub-steps are replaced with the model's outputs for those steps). We demonstrate that multiple variants of the GPT-3/-4 models exhibit poor consistency rates across both types of consistency on a variety of tasks.Comment: Added GPT-4 result

    Overview of the NASA Entry, Descent and Landing Systems Analysis Study

    Get PDF
    NASA senior management commissioned the Entry, Descent and Landing Systems Analysis (EDL-SA) Study in 2008 to identify and roadmap the Entry, Descent and Landing (EDL) technology investments that the agency needed to make in order to successfully land large payloads at Mars for both robotic and human-scale missions. This paper summarizes the approach and top-level results from Year 1 of the Study, which focused on landing 10-50 mt on Mars, but also included a trade study of the best advanced parachute design for increasing the landed payloads within the EDL architecture of the Mars Science Laboratory (MSL) mission

    Atmosphere Assessment for MARS Science Laboratory Entry, Descent and Landing Operations

    Get PDF
    On August 6, 2012, the Mars Science Laboratory rover, Curiosity, successfully landed on the surface of Mars. The Entry, Descent and Landing (EDL) sequence was designed using atmospheric conditions estimated from mesoscale numerical models. The models, developed by two independent organizations (Oregon State University and the Southwest Research Institute), were validated against observations at Mars from three prior years. In the weeks and days before entry, the MSL "Council of Atmospheres" (CoA), a group of atmospheric scientists and modelers, instrument experts and EDL simulation engineers, evaluated the latest Mars data from orbiting assets including the Mars Reconnaissance Orbiter's Mars Color Imager (MARCI) and Mars Climate Sounder (MCS), as well as Mars Odyssey's Thermal Emission Imaging System (THEMIS). The observations were compared to the mesoscale models developed for EDL performance simulation to determine if a spacecraft parameter update was necessary prior to entry. This paper summarizes the daily atmosphere observations and comparison to the performance simulation atmosphere models. Options to modify the atmosphere model in the simulation to compensate for atmosphere effects are also presented. Finally, a summary of the CoA decisions and recommendations to the MSL project in the days leading up to EDL is provided

    Cardiometabolic risk factors, peripheral arterial tonometry and metformin in adults with type 1 diabetes participating in the REducing with MetfOrmin Vascular Adverse Lesions trial

    Get PDF
    BACKGROUND: Peripheral arterial tonometry (PAT) provides non-invasive measures of vascular health. Beneficial effects of metformin on vascular function have been reported in youth with type 1 diabetes (T1D). In the REducing with MetfOrmin Vascular Adverse Lesions (REMOVAL) trial in adults with T1D and high cardiovascular risk, we examined: (i) the extent to which routinely-measured cardiometabolic risk factors explain variance in baseline PAT; and (ii) the effects of metformin on PAT measures. METHODS: Cross-sectional univariable and multivariable analyses of baseline reactive hyperaemia index (RHI) and augmentation index (AI) (EndoPAT® (Itamar, Israel); and analysis of 36-months metformin versus placebo on vascular tonometry. RESULTS: In 364 adults ((mean ± SD) age 55.2 ± 8.5 years, T1D 34.0 ± 10.6 years, HbA1c 64.5 ± 9.0 mmol/mol (8.1 ± 0.8%)), RHI was 2.26 ± 0.74 and AI was 15.9 ± 19.2%. In an exhaustive search, independent associates of (i) RHI were smoking, waist circumference, systolic blood pressure and vitamin B12 (adjusted R2 = 0.11) and (ii) AI were male sex, pulse pressure, heart rate and waist circumference (adjusted R2 = 0.31). Metformin did not significantly affect RHI or AI. CONCLUSION: Cardiometabolic risk factors explained only a modest proportion of variance in PAT measures of vascular health in adults with T1D and high cardiovascular risk. PAT measures were not affected by metformin

    Cancer cells exploit an orphan RNA to drive metastatic progression.

    Get PDF
    Here we performed a systematic search to identify breast-cancer-specific small noncoding RNAs, which we have collectively termed orphan noncoding RNAs (oncRNAs). We subsequently discovered that one of these oncRNAs, which originates from the 3' end of TERC, acts as a regulator of gene expression and is a robust promoter of breast cancer metastasis. This oncRNA, which we have named T3p, exerts its prometastatic effects by acting as an inhibitor of RISC complex activity and increasing the expression of the prometastatic genes NUPR1 and PANX2. Furthermore, we have shown that oncRNAs are present in cancer-cell-derived extracellular vesicles, raising the possibility that these circulating oncRNAs may also have a role in non-cell autonomous disease pathogenesis. Additionally, these circulating oncRNAs present a novel avenue for cancer fingerprinting using liquid biopsies

    Quality of Life During Treatment With Chemohormonal Therapy: Analysis of E3805 Chemohormonal Androgen Ablation Randomized Trial in Prostate Cancer

    Get PDF
    Purpose Chemohormonal therapy with docetaxel and androgen deprivation therapy (ADT+D) for metastatic hormone-sensitive prostate cancer improves overall survival as compared with androgen deprivation therapy (ADT) alone. We compared the quality of life (QOL) between patients with metastatic hormone-sensitive prostate cancer who were treated with ADT+D and those who were treated with ADT alone. Methods Men were randomly assigned to ADT+ D (six cycles) or to ADT alone. QOL was assessed by Functional Assessment of Cancer Therapy-Prostate (FACT-P), FACT-Taxane, Functional Assessment of Chronic Illness Therapy-Fatigue, and the Brief Pain Inventory at baseline and at 3, 6, 9, and 12 months. The Wilcoxon signed rank test was used to examine changes over time. Mixed-effect models compared the QOL between arms at each time point. Results Seven hundred ninety men were randomly assigned (ADT+D [n = 397] and ADT[ n = 393]) and completed FACT-P (90% at baseline, 86% at 3 months, 83% at 6 months, 78% at 9 months, and 77% at 12 months). ADT+D patients reported a statistically significant decline in FACT-P at 3 months (P \u3c .001) but FACT-P did not differ significantly between baseline and 12 months (P = .38). ADT+D FACT-P scores were significantly lower at 3 months (P = .02) but significantly higher at 12 months (P = .04) when compared with ADT FACT-P scores. Differences did not exceed the minimal clinically important difference at any time point. ADT+D patients reported significantly lower Functional Assessment of Chronic Illness Therapy-Fatigue scores at 3 months than did ADT patients (P \u3c .001). Over time, both arms reported significantly poorer FACT-Taxane scores (P \u3c .001) when compared with baseline. Brief Pain Inventory scores were similar between arms. Conclusion Although ADT+D was associated with statistically worse QOL at 3 months, QOL was better at 12 months for ADT+D patients than for ADT patients. Both arms reported a similar minimally changed QOL over time, suggesting that ADT+D is not associated with a greater long-term negative impact on QOL

    CD103+ Dendritic Cells Control Th17 Cell Function in the Lung

    Get PDF
    Th17 cells express diverse functional programs while retaining their Th17 identity, in some cases exhibiting a stem-cell-like phenotype. Whereas the importance of Th17 cell regulation in autoimmune and infectious diseases is firmly established, the signaling pathways controlling their plasticity are undefined. Using a mouse model of invasive pulmonary aspergillosis, we found that lung CD103+ dendritic cells (DCs) would produce IL-2, dependent on NFAT signaling, leading to an optimally protective Th17 response. The absence of IL-2 in DCs caused unrestrained production of IL-23 and fatal hyperinflammation, which was characterized by strong Th17 polarization and the emergence of a Th17 stem-cell-like population. Although several cell types may be affected by deficient IL-2 production in DCs, our findings identify the balance between IL-2 and IL-23 productions by lung DCs as an important regulator of the local inflammatory response to infection

    Differential sensitivity of target genes to translational repression by miR-17~92

    Full text link
    MicroRNAs (miRNAs) are thought to exert their functions by modulating the expression of hundreds of target genes and each to a small degree, but it remains unclear how small changes in hundreds of target genes are translated into the specific function of a miRNA. Here, we conducted an integrated analysis of transcriptome and translatome of primary B cells from mutant mice expressing miR-17~92 at three different levels to address this issue. We found that target genes exhibit differential sensitivity to miRNA suppression and that only a small fraction of target genes are actually suppressed by a given concentration of miRNA under physiological conditions. Transgenic expression and deletion of the same miRNA gene regulate largely distinct sets of target genes. miR-17~92 controls target gene expression mainly through translational repression and 5’UTR plays an important role in regulating target gene sensitivity to miRNA suppression. These findings provide molecular insights into a model in which miRNAs exert their specific functions through a small number of key target genesCX is a Pew Scholar in Biomedical Sciences. This study is supported by the PEW Charitable Trusts, Cancer Research Institute, National Institute of Health (R01AI087634, R01AI089854, RC1CA146299, R56AI110403, and R01AI121155 to CX), National Natural Science Foundation of China (31570882 to WHL, 31570883 to NX, 31570911 to GF, 91429301 to JH, 31671428 and 31500665 to YZ), 1000 Young Talents Program of China (K08008 to NX), 100 Talents Program of The Chinese Academy of Sciences (YZ), National Program on Key Basic Research Project of China (2016YFA0501900 to YZ), the Fundamental Research Funds for the Central Universities of China (20720150065 to NX and GF), Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT & Future Planning (NRF-2015R1C1A1A01052387 to SGK, NRF-2016R1A4A1010115 to SGK and PHK), and 2016 Research Grant from Kangwon National University (SGK)

    Upper limits on the strength of periodic gravitational waves from PSR J1939+2134

    Get PDF
    The first science run of the LIGO and GEO gravitational wave detectors presented the opportunity to test methods of searching for gravitational waves from known pulsars. Here we present new direct upper limits on the strength of waves from the pulsar PSR J1939+2134 using two independent analysis methods, one in the frequency domain using frequentist statistics and one in the time domain using Bayesian inference. Both methods show that the strain amplitude at Earth from this pulsar is less than a few times 102210^{-22}.Comment: 7 pages, 1 figure, to appear in the Proceedings of the 5th Edoardo Amaldi Conference on Gravitational Waves, Tirrenia, Pisa, Italy, 6-11 July 200
    corecore