3 research outputs found

    RLTF: Reinforcement Learning from Unit Test Feedback

    Full text link
    The goal of program synthesis, or code generation, is to generate executable code based on given descriptions. Recently, there has been an increasing number of studies employing reinforcement learning (RL) to improve the performance of large language models (LLMs) for code. However, these RL methods have only used offline frameworks, limiting their exploration of new sample spaces. Additionally, current approaches that utilize unit test signals are rather simple, not accounting for specific error locations within the code. To address these issues, we proposed RLTF, i.e., Reinforcement Learning from Unit Test Feedback, a novel online RL framework with unit test feedback of multi-granularity for refining code LLMs. Our approach generates data in real-time during training and simultaneously utilizes fine-grained feedback signals to guide the model towards producing higher-quality code. Extensive experiments show that RLTF achieves state-of-the-art performance on the APPS and the MBPP benchmarks. Our code can be found at: https://github.com/Zyq-scut/RLTF

    Spatial-temporal mapping and risk factors for hand foot and mouth disease in northwestern inland China.

    No full text
    BackgroundHand foot and mouth disease (HFMD) is becoming one of the common human infectious diseases in China. Previous studies have described HFMD in tropical or coastal areas of Asia-Pacific countries. However, limited studies have thoroughly studied the epidemiology and potential risk factors for HFMD in inland areas with complex environmental conditions.Methodology/principal findingsUsing the data from 2009 to 2018 on reported cases of Xinjiang Uighur Autonomous Region, we characterized the epidemic features of HFMD. Panel negative binomial model was used to identify climate, geographical and demographic determinants for HFMD incidence. A total of 70856 HFMD cases (average annual incidence: 305 per million persons) were reported in Xinjiang during the 10-year study period, of which 10393 (14.7%) were laboratory-confirmed and 98 (0.1%) were severe. HFMD peaked in summer every year during the study period, and incidence in 2012, 2015, 2016 and 2018 had minor peaks in autumn. After adjusting the school or holiday month, multiple factors were found to affect HFMD epidemiology: urban area being major land cover type (incidence risk ratio, IRR 2.08; 95% CI 1.50, 2.89), higher gross domestic product per capita (IRR 1.14; 95% CI 1.11, 1.16), rise in monthly average temperature (IRR 1.65; 95% CI 1.61, 1.69) and monthly accumulative precipitation (IRR 1.20; 95% CI 1.16, 1.24) predicted increase in the incidence of HFMD; farmland being major land cover type (IRR 0.72; 95% CI 0.64, 0.81), an increase of percentage of the minority (IRR 0.91; 95% CI 0.89, 0.93) and population density (IRR 0.98; 95% CI 0.98, 0.99) were related to a decrease in the incidence of HFMD.Conclusions/significanceIn conclusion, the epidemic status of HFMD in Xinjiang is characterized by low morbidity and fatality. Multiple factors have significant influences on the occurrence and transmission of HFMD in Xinjiang

    Magnetic sensitivity of cryptochrome 4 from a migratory songbird

    No full text
    Night-migratory songbirds are remarkably proficient navigators. Flying alone and often over great distances, they use various directional cues including, crucially, a light-dependent magnetic compass. The mechanism of this compass has been suggested to rely on the quantum spin dynamics of photoinduced radical pairs in cryptochrome flavoproteins located in the retinas of the birds. Here we show that the photochemistry of cryptochrome 4 (CRY4) from the night-migratory European robin (Erithacus rubecula) is magnetically sensitive in vitro, and more so than CRY4 from two non-migratory bird species, chicken (Gallus gallus) and pigeon (Columba livia). Site-specific mutations of ErCRY4 reveal the roles of four successive flavin–tryptophan radical pairs in generating magnetic field effects and in stabilizing potential signalling states in a way that could enable sensing and signalling functions to be independently optimized in night-migratory birds
    corecore