3,049 research outputs found
Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs
Large language models (LLMs) have achieved widespread success on a variety of
in-context few-shot tasks, but this success is typically evaluated via
correctness rather than consistency. We argue that self-consistency is an
important criteria for valid multi-step reasoning in tasks where the solution
is composed of the answers to multiple sub-steps. We propose two types of
self-consistency that are particularly important for multi-step reasoning --
hypothetical consistency (a model's ability to predict what its output would be
in a hypothetical other context) and compositional consistency (consistency of
a model's final outputs when intermediate sub-steps are replaced with the
model's outputs for those steps). We demonstrate that multiple variants of the
GPT-3/-4 models exhibit poor consistency rates across both types of consistency
on a variety of tasks.Comment: Added GPT-4 result
Overview of the NASA Entry, Descent and Landing Systems Analysis Study
NASA senior management commissioned the Entry, Descent and Landing Systems Analysis (EDL-SA) Study in 2008 to identify and roadmap the Entry, Descent and Landing (EDL) technology investments that the agency needed to make in order to successfully land large payloads at Mars for both robotic and human-scale missions. This paper summarizes the approach and top-level results from Year 1 of the Study, which focused on landing 10-50 mt on Mars, but also included a trade study of the best advanced parachute design for increasing the landed payloads within the EDL architecture of the Mars Science Laboratory (MSL) mission
Recommended from our members
Developmental exposures to perfluorooctanesulfonic acid (PFOS) impact embryonic nutrition, pancreatic morphology, and adiposity in the zebrafish, \u3cem\u3eDanio rerio\u3c/em\u3e
Perfluorooctanesulfonic acid (PFOS) is a persistent environmental contaminant previously found in consumer surfactants and industrial fire-fighting foams. PFOS has been widely implicated in metabolic dysfunction across the lifespan, including diabetes and obesity. However, the contributions of the embryonic environment to metabolic disease remain uncharacterized. This study seeks to identify perturbations in embryonic metabolism, pancreas development, and adiposity due to developmental and subchronic PFOS exposures and their persistence into later larval and juvenile periods. Zebrafish embryos were exposed to 16 or 32 μM PFOS developmentally (1–5 days post fertilization; dpf) or subchronically (1–15 dpf). Embryonic fatty acid and macronutrient concentrations and expression of peroxisome proliferator-activated receptor (PPAR) isoforms were quantified in embryos. Pancreatic islet morphometry was assessed at 15 and 30 dpf, and adiposity and fish behavior were assessed at 15 dpf. Concentrations of lauric (C12:0) and myristic (C14:0) saturated fatty acids were increased by PFOS at 4 dpf, and PPAR gene expression was reduced. Incidence of aberrant islet morphologies, principal islet areas, and adiposity were increased in 15 dpf larvae and 30 dpf juvenile fish. Together, these data suggest that the embryonic period is a susceptible window of metabolic programming in response to PFOS exposures, and that these early exposures alone can have persisting effects later in the lifecourse
Atmosphere Assessment for MARS Science Laboratory Entry, Descent and Landing Operations
On August 6, 2012, the Mars Science Laboratory rover, Curiosity, successfully landed on the surface of Mars. The Entry, Descent and Landing (EDL) sequence was designed using atmospheric conditions estimated from mesoscale numerical models. The models, developed by two independent organizations (Oregon State University and the Southwest Research Institute), were validated against observations at Mars from three prior years. In the weeks and days before entry, the MSL "Council of Atmospheres" (CoA), a group of atmospheric scientists and modelers, instrument experts and EDL simulation engineers, evaluated the latest Mars data from orbiting assets including the Mars Reconnaissance Orbiter's Mars Color Imager (MARCI) and Mars Climate Sounder (MCS), as well as Mars Odyssey's Thermal Emission Imaging System (THEMIS). The observations were compared to the mesoscale models developed for EDL performance simulation to determine if a spacecraft parameter update was necessary prior to entry. This paper summarizes the daily atmosphere observations and comparison to the performance simulation atmosphere models. Options to modify the atmosphere model in the simulation to compensate for atmosphere effects are also presented. Finally, a summary of the CoA decisions and recommendations to the MSL project in the days leading up to EDL is provided
Cardiometabolic risk factors, peripheral arterial tonometry and metformin in adults with type 1 diabetes participating in the REducing with MetfOrmin Vascular Adverse Lesions trial
BACKGROUND: Peripheral arterial tonometry (PAT) provides non-invasive measures of vascular health. Beneficial effects of metformin on vascular function have been reported in youth with type 1 diabetes (T1D). In the REducing with MetfOrmin Vascular Adverse Lesions (REMOVAL) trial in adults with T1D and high cardiovascular risk, we examined: (i) the extent to which routinely-measured cardiometabolic risk factors explain variance in baseline PAT; and (ii) the effects of metformin on PAT measures. METHODS: Cross-sectional univariable and multivariable analyses of baseline reactive hyperaemia index (RHI) and augmentation index (AI) (EndoPAT® (Itamar, Israel); and analysis of 36-months metformin versus placebo on vascular tonometry. RESULTS: In 364 adults ((mean ± SD) age 55.2 ± 8.5 years, T1D 34.0 ± 10.6 years, HbA1c 64.5 ± 9.0 mmol/mol (8.1 ± 0.8%)), RHI was 2.26 ± 0.74 and AI was 15.9 ± 19.2%. In an exhaustive search, independent associates of (i) RHI were smoking, waist circumference, systolic blood pressure and vitamin B12 (adjusted R2 = 0.11) and (ii) AI were male sex, pulse pressure, heart rate and waist circumference (adjusted R2 = 0.31). Metformin did not significantly affect RHI or AI. CONCLUSION: Cardiometabolic risk factors explained only a modest proportion of variance in PAT measures of vascular health in adults with T1D and high cardiovascular risk. PAT measures were not affected by metformin
Cancer cells exploit an orphan RNA to drive metastatic progression.
Here we performed a systematic search to identify breast-cancer-specific small noncoding RNAs, which we have collectively termed orphan noncoding RNAs (oncRNAs). We subsequently discovered that one of these oncRNAs, which originates from the 3' end of TERC, acts as a regulator of gene expression and is a robust promoter of breast cancer metastasis. This oncRNA, which we have named T3p, exerts its prometastatic effects by acting as an inhibitor of RISC complex activity and increasing the expression of the prometastatic genes NUPR1 and PANX2. Furthermore, we have shown that oncRNAs are present in cancer-cell-derived extracellular vesicles, raising the possibility that these circulating oncRNAs may also have a role in non-cell autonomous disease pathogenesis. Additionally, these circulating oncRNAs present a novel avenue for cancer fingerprinting using liquid biopsies
Quality of Life During Treatment With Chemohormonal Therapy: Analysis of E3805 Chemohormonal Androgen Ablation Randomized Trial in Prostate Cancer
Purpose
Chemohormonal therapy with docetaxel and androgen deprivation therapy (ADT+D) for metastatic hormone-sensitive prostate cancer improves overall survival as compared with androgen deprivation therapy (ADT) alone. We compared the quality of life (QOL) between patients with metastatic hormone-sensitive prostate cancer who were treated with ADT+D and those who were treated with ADT alone.
Methods
Men were randomly assigned to ADT+ D (six cycles) or to ADT alone. QOL was assessed by Functional Assessment of Cancer Therapy-Prostate (FACT-P), FACT-Taxane, Functional Assessment of Chronic Illness Therapy-Fatigue, and the Brief Pain Inventory at baseline and at 3, 6, 9, and 12 months. The Wilcoxon signed rank test was used to examine changes over time. Mixed-effect models compared the QOL between arms at each time point.
Results
Seven hundred ninety men were randomly assigned (ADT+D [n = 397] and ADT[ n = 393]) and completed FACT-P (90% at baseline, 86% at 3 months, 83% at 6 months, 78% at 9 months, and 77% at 12 months). ADT+D patients reported a statistically significant decline in FACT-P at 3 months (P \u3c .001) but FACT-P did not differ significantly between baseline and 12 months (P = .38). ADT+D FACT-P scores were significantly lower at 3 months (P = .02) but significantly higher at 12 months (P = .04) when compared with ADT FACT-P scores. Differences did not exceed the minimal clinically important difference at any time point. ADT+D patients reported significantly lower Functional Assessment of Chronic Illness Therapy-Fatigue scores at 3 months than did ADT patients (P \u3c .001). Over time, both arms reported significantly poorer FACT-Taxane scores (P \u3c .001) when compared with baseline. Brief Pain Inventory scores were similar between arms.
Conclusion
Although ADT+D was associated with statistically worse QOL at 3 months, QOL was better at 12 months for ADT+D patients than for ADT patients. Both arms reported a similar minimally changed QOL over time, suggesting that ADT+D is not associated with a greater long-term negative impact on QOL
CD103+ Dendritic Cells Control Th17 Cell Function in the Lung
Th17 cells express diverse functional programs while retaining their Th17 identity, in some cases exhibiting a stem-cell-like phenotype. Whereas the importance of Th17 cell regulation in autoimmune and infectious diseases is firmly established, the signaling pathways controlling their plasticity are undefined. Using a mouse model of invasive pulmonary aspergillosis, we found that lung CD103+ dendritic cells (DCs) would produce IL-2, dependent on NFAT signaling, leading to an optimally protective Th17 response. The absence of IL-2 in DCs caused unrestrained production of IL-23 and fatal hyperinflammation, which was characterized by strong Th17 polarization and the emergence of a Th17 stem-cell-like population. Although several cell types may be affected by deficient IL-2 production in DCs, our findings identify the balance between IL-2 and IL-23 productions by lung DCs as an important regulator of the local inflammatory response to infection
Differential sensitivity of target genes to translational repression by miR-17~92
MicroRNAs (miRNAs) are thought to exert their functions by modulating the expression of hundreds of target genes and each to a small degree, but it remains unclear how small changes in hundreds of target genes are translated into the specific function of a miRNA. Here, we conducted an integrated analysis of transcriptome and translatome of primary B cells from mutant mice expressing miR-17~92 at three different levels to address this issue. We found that target genes exhibit differential sensitivity to miRNA suppression and that only a small fraction of target genes are actually suppressed by a given concentration of miRNA under physiological conditions. Transgenic expression and deletion of the same miRNA gene regulate largely distinct sets of target genes. miR-17~92 controls target gene expression mainly through translational repression and 5’UTR plays an important role in regulating target gene sensitivity to miRNA suppression. These findings provide molecular insights into a model in which miRNAs exert their specific functions through a small number of key target genesCX is a Pew Scholar in Biomedical
Sciences. This study is supported by the PEW
Charitable Trusts, Cancer Research Institute,
National Institute of Health (R01AI087634,
R01AI089854, RC1CA146299, R56AI110403, and
R01AI121155 to CX), National Natural Science
Foundation of China (31570882 to WHL, 31570883
to NX, 31570911 to GF, 91429301 to JH,
31671428 and 31500665 to YZ), 1000 Young
Talents Program of China (K08008 to NX), 100
Talents Program of The Chinese Academy of
Sciences (YZ), National Program on Key Basic
Research Project of China (2016YFA0501900 to
YZ), the Fundamental Research Funds for the
Central Universities of China (20720150065 to NX
and GF), Basic Science Research Program through
the National Research Foundation of Korea (NRF)
funded by the Ministry of Science, ICT & Future
Planning (NRF-2015R1C1A1A01052387 to SGK,
NRF-2016R1A4A1010115 to SGK and PHK), and
2016 Research Grant from Kangwon National
University (SGK)
Upper limits on the strength of periodic gravitational waves from PSR J1939+2134
The first science run of the LIGO and GEO gravitational wave detectors
presented the opportunity to test methods of searching for gravitational waves
from known pulsars. Here we present new direct upper limits on the strength of
waves from the pulsar PSR J1939+2134 using two independent analysis methods,
one in the frequency domain using frequentist statistics and one in the time
domain using Bayesian inference. Both methods show that the strain amplitude at
Earth from this pulsar is less than a few times .Comment: 7 pages, 1 figure, to appear in the Proceedings of the 5th Edoardo
Amaldi Conference on Gravitational Waves, Tirrenia, Pisa, Italy, 6-11 July
200
- …