11 research outputs found

    Aligning Offline Metrics and Human Judgments of Value for Code Generation Models

    Full text link
    Large language models have demonstrated great potential to assist programmers in generating code. For such human-AI pair programming scenarios, we empirically demonstrate that while generated code is most often evaluated in terms of their functional correctness (i.e., whether generations pass available unit tests), correctness does not fully capture (e.g., may underestimate) the productivity gains these models may provide. Through a user study with N = 49 experienced programmers, we show that while correctness captures high-value generations, programmers still rate code that fails unit tests as valuable if it reduces the overall effort needed to complete a coding task. Finally, we propose a hybrid metric that combines functional correctness and syntactic similarity and show that it achieves a 14% stronger correlation with value and can therefore better represent real-world gains when evaluating and comparing models.Comment: Accepted at ACL 2023 (Findings

    Effect of early tranexamic acid administration on mortality, hysterectomy, and other morbidities in women with post-partum haemorrhage (WOMAN): an international, randomised, double-blind, placebo-controlled trial

    Get PDF
    Background Post-partum haemorrhage is the leading cause of maternal death worldwide. Early administration of tranexamic acid reduces deaths due to bleeding in trauma patients. We aimed to assess the effects of early administration of tranexamic acid on death, hysterectomy, and other relevant outcomes in women with post-partum haemorrhage. Methods In this randomised, double-blind, placebo-controlled trial, we recruited women aged 16 years and older with a clinical diagnosis of post-partum haemorrhage after a vaginal birth or caesarean section from 193 hospitals in 21 countries. We randomly assigned women to receive either 1 g intravenous tranexamic acid or matching placebo in addition to usual care. If bleeding continued after 30 min, or stopped and restarted within 24 h of the first dose, a second dose of 1 g of tranexamic acid or placebo could be given. Patients were assigned by selection of a numbered treatment pack from a box containing eight numbered packs that were identical apart from the pack number. Participants, care givers, and those assessing outcomes were masked to allocation. We originally planned to enrol 15 000 women with a composite primary endpoint of death from all-causes or hysterectomy within 42 days of giving birth. However, during the trial it became apparent that the decision to conduct a hysterectomy was often made at the same time as randomisation. Although tranexamic acid could influence the risk of death in these cases, it could not affect the risk of hysterectomy. We therefore increased the sample size from 15 000 to 20 000 women in order to estimate the effect of tranexamic acid on the risk of death from post-partum haemorrhage. All analyses were done on an intention-to-treat basis. This trial is registered with ISRCTN76912190 (Dec 8, 2008); ClinicalTrials.gov, number NCT00872469; and PACTR201007000192283. Findings Between March, 2010, and April, 2016, 20 060 women were enrolled and randomly assigned to receive tranexamic acid (n=10 051) or placebo (n=10 009), of whom 10 036 and 9985, respectively, were included in the analysis. Death due to bleeding was significantly reduced in women given tranexamic acid (155 [1·5%] of 10 036 patients vs 191 [1·9%] of 9985 in the placebo group, risk ratio [RR] 0·81, 95% CI 0·65–1·00; p=0·045), especially in women given treatment within 3 h of giving birth (89 [1·2%] in the tranexamic acid group vs 127 [1·7%] in the placebo group, RR 0·69, 95% CI 0·52–0·91; p=0·008). All other causes of death did not differ significantly by group. Hysterectomy was not reduced with tranexamic acid (358 [3·6%] patients in the tranexamic acid group vs 351 [3·5%] in the placebo group, RR 1·02, 95% CI 0·88–1·07; p=0·84). The composite primary endpoint of death from all causes or hysterectomy was not reduced with tranexamic acid (534 [5·3%] deaths or hysterectomies in the tranexamic acid group vs 546 [5·5%] in the placebo group, RR 0·97, 95% CI 0·87-1·09; p=0·65). Adverse events (including thromboembolic events) did not differ significantly in the tranexamic acid versus placebo group. Interpretation Tranexamic acid reduces death due to bleeding in women with post-partum haemorrhage with no adverse effects. When used as a treatment for postpartum haemorrhage, tranexamic acid should be given as soon as possible after bleeding onset. Funding London School of Hygiene & Tropical Medicine, Pfizer, UK Department of Health, Wellcome Trust, and Bill & Melinda Gates Foundation

    Evolution of Mobile Money Technologies in Developing Nations: Successes and Lessons.

    No full text

    Data2Vis: Automatic Generation of Data Visualizations Using Sequence-to-Sequence Recurrent Neural Networks

    No full text

    Understanding the Disclosure of Private Healthcare Information within Online Quantified Self 2.0 Platforms

    No full text
    The quantified self-movement encourages a continuous tracking of data points regarding a person’s daily activities through wearable sensors, and thus has important implications for health and wellness. With the advent of sophisticated low-cost wearable computing devices, online communities that facilitate social interaction and exchange of wearable data (Quantified Self 2.0 platforms) have also emerged. Although security and privacy disclosure has been studied within online social networks and online health communities, little has been done to understand how individual and group characteristics influence the disclosure behaviour regarding highly sensitive personal information gathered from wearable sensors (e.g., sleep, nutrition, mood, performance, ambient conditions). Using data collected from 43 Fitbit groups which consist of 5300 Asian users within the Fitbit online community, we examine the influence of group characteristics (size, posts, average steps) and individual attributes on privacy disclosure behaviour. Results from our hierarchical linear modelling analysis suggests that attributes such as group size and individual posts are associated with increased privacy data disclosure, whilst we surprisingly find that when other group members have higher health performance or are more active, individuals are more likely to disclose less healthcare information. Based on these findings, theoretical and practical implications are discussed

    A Cognitive Assistant for Visualizing and Analyzing Exoplanets

    No full text
    We demonstrate an embodied cognitive agent that helps scientists visualize and analyze exo-planets and their host stars. The prototype is situated in a room equipped with a large display, microphones, cameras, speakers, and pointing devices. Users communicate with the agent via speech, gestures, and combinations thereof, and it responds by displaying content and generating synthesized speech. Extensive use of context facilitates natural interaction with the agent
    corecore