61 research outputs found

    Knowledge Transfer from High-Resource to Low-Resource Programming Languages for Code LLMs

    Full text link
    Over the past few years, Large Language Models of Code (Code LLMs) have started to have a significant impact on programming practice. Code LLMs are also emerging as a building block for research in programming languages and software engineering. However, the quality of code produced by a Code LLM varies significantly by programming languages. Code LLMs produce impressive results on programming languages that are well represented in their training data (e.g., Java, Python, or JavaScript), but struggle with low-resource languages, like OCaml and Racket. This paper presents an effective approach for boosting the performance of Code LLMs on low-resource languages using semi-synthetic data. Our approach generates high-quality datasets for low-resource languages, which can then be used to fine-tune any pretrained Code LLM. Our approach, called MultiPL-T, translates training data from high-resource languages into training data for low-resource languages. We apply our approach to generate tens of thousands of new, validated training items for Racket, OCaml, and Lua from Python. Moreover, we use an open dataset (The Stack) and model (StarCoderBase), which allow us to decontaminate benchmarks and train models on this data without violating the model license. With MultiPL-T generated data, we present fine-tuned versions of StarCoderBase that achieve state-of-the-art performance for Racket, OCaml, and Lua on benchmark problems. For Lua, our fine-tuned model achieves the same performance as StarCoderBase as Python -- a very high-resource language -- on the MultiPL-E benchmarks. For Racket and OCaml, we double their performance on MultiPL-E, bringing their performance close to higher-resource languages such as Ruby and C#

    Governing culture: legislators, interpreters and accountants

    Get PDF
    Cultural policy has become dominated by questions of how to account for the intangible value of government investments. This is as a result of longstanding developments within government’s approaches to policy making, most notably those influenced by practices of audit and accounting. This paper will outline these developments with reference to Peter Miller’s concept of calculative practices, and will argue two central points: first, that there are practical solutions to the problem of measuring the value of culture that connect central government discourses with the discourses of the cultural sector; and second, the paper will demonstrate how academic work has been central to this area of policy making. As a result of the centrality of accounting academics in cultural policy, for example in providing advice on the appropriate measurement tools and techniques, questions are raised about the role academia might take vis-à-vis public policy. Accounting professionals and academics not only provide technical expertise that informs state calculative practices, but also play a surveillance role through the audit and evaluation of government programmes, and act as interpreters in defining terms of performance measurement, success and failure. The paper therefore concludes by reflecting on recent work by Phillip Schlesinger to preserve academic integrity whilst allowing accounting scholars and academics influence and partnership in policy

    Understanding the environmental impacts of large fissure eruptions: Aerosol and gas emissions from the 2014-2015 Holuhraun eruption (Iceland)

    Get PDF
    The 2014-2015 Holuhraun eruption in Iceland, emitted ~11 Tg of SO2 into the troposphere over 6 months, and caused one of the most intense and widespread volcanogenic air pollution events in centuries. This study provides a number of source terms for characterisation of plumes in large fissure eruptions, in Iceland and elsewhere. We characterised the chemistry of aerosol particle matter (PM) and gas in the Holuhraun plume, and its evolution as the plume dispersed, both via measurements and modelling. The plume was sampled at the eruptive vent, and in two populated areas in Iceland. The plume caused repeated air pollution events, exceeding hourly air quality standards (350 ”g/m3) for SO2 on 88 occasions in Reykjahlíð town (100 km distance), and 34 occasions in Reykjavík capital area (250 km distance). Average daily concentration of volcanogenic PM sulphate exceeded 5 ”g/m3 on 30 days in Reykjavík capital area, which is the maximum concentration measured during non-eruptive background interval. There are currently no established air quality standards for sulphate. Combining the results from direct sampling and dispersion modelling, we identified two types of plume impacting the downwind populated areas. The first type was characterised by high concentrations of both SO2 and S-bearing PM, with a high Sgas/SPM mass ratio (SO2(g)/SO42-(PM) >10). The second type had a low Sgas/SPM ratio (<10). We suggest that this second type was a mature plume where sulphur had undergone significant gas-to-aerosol conversion in the atmosphere. Both types of plume were rich in fine aerosol (predominantly PM1 and PM2.5), sulphate (on average ~90% of the PM mass) and various trace species, including heavy metals. The fine size of the volcanic PM mass (75-80% in PM2.5), and the high environmental lability of its chemical components have potential adverse implications for environmental and health impacts. However, only the dispersion of volcanic SO2 was forecast in public warnings and operationally monitored during the eruption. We make a recommendation that sulphur gas-to-aerosol conversion processes, and a sufficiently large model domain to contain the transport of a tropospheric plume on the timescale of days be utilized for public health and environmental impact forecasting in future eruptions in Iceland and elsewhere in the world

    Adaptations Made to Pediatric Consultation-Liaison Psychiatry Service Delivery During the Early Months of the COVID-19 Pandemic: A North American Multisite Survey.

    Get PDF
    BackgroundThe COVID-19 pandemic led to rapid changes in clinical service delivery across hospital systems nationally. Local realities and resources were key driving factors impacting workflow changes, including for pediatric consultation-liaison psychiatry service (PCLPS) providers.ObjectiveThis study aims to describe the early changes implemented by 22 PCLPSs from the United States and Canada during the COVID-19 pandemic. Understanding similarities and differences in adaptations made to PCLPS care delivery can inform best practices and future models of care.MethodsA 20-point survey relating to PCLPS changes during the COVID-19 pandemic was sent to professional listservs. Baseline hospital demographics, hospital and PCLPS workflow changes, and PCLPS experience were collected from March 20 to April 28, 2020, and from August 18 to September 10, 2020. Qualitative data were collected from responding sites. An exploratory thematic analysis approach was used to analyze the qualitative data that were not dependent on predetermined coding themes. Descriptive statistics were calculated using Microsoft Excel.ResultsTwenty-two academic hospitals in the United States and Canada responded to the survey, with an average of 303 beds/hospital. Most respondents (18/22) were children's hospitals. Despite differences in regional impact of COVID-19 and resource availability, there was significant overlap in respondent experiences. Restricted visitation to one caregiver, use of virtual rounding, ongoing trainee involvement, and an overall low number of COVID-positive pediatric patients were common. While there was variability in PCLPS care delivery occurring virtually versus in person, all respondents maintained some level of on-site presence. Technological limitations and pediatric provider preference led to increased on-site presence.ConclusionsTo our knowledge, this is the first multicenter study exploring pandemic-related PCLPS changes in North America. Findings of this study demonstrate that PCLPSs rapidly adapted to COVID-19 realities. Common themes emerged that may serve as a model for future practice. However, important gaps in understanding their effectiveness and acceptability need to be addressed. This multisite survey highlights the importance of establishing consensus through national professional organizations to inform provider and hospital practices

    The face of the other: the particular versus the individual

    Full text link

    Limits to reproduction and seed size-number trade-offs that shape forest dominance and future recovery

    Get PDF
    International audienceThe relationships that control seed production in trees are fundamental to understanding the evolution of forest species and their capacity to recover from increasing losses to drought, fire, and harvest. A synthesis of fecundity data from 714 species worldwide allowed us to examine hypotheses that are central to quantifying reproduction, a foundation for assessing fitness in forest trees. Four major findings emerged. First, seed production is not constrained by a strict trade-off between seed size and numbers. Instead, seed numbers vary over ten orders of magnitude, with species that invest in large seeds producing more seeds than expected from the 1:1 trade-off. Second, gymnosperms have lower seed production than angiosperms, potentially due to their extra investments in protective woody cones. Third, nutrient-demanding species, indicated by high foliar phosphorus concentrations, have low seed production. Finally, sensitivity of individual species to soil fertility varies widely, limiting the response of community seed production to fertility gradients. In combination, these findings can inform models of forest response that need to incorporate reproductive potential

    Limits to reproduction and seed size-number tradeoffs that shape forest dominance and future recovery

    Get PDF
    The relationships that control seed production in trees are fundamental to understanding the evolution of forest species and their capacity to recover from increasing losses to drought, fire, and harvest. A synthesis of fecundity data from 714 species worldwide allowed us to examine hypotheses that are central to quantifying reproduction, a foundation for assessing fitness in forest trees. Four major findings emerged. First, seed production is not constrained by a strict trade-off between seed size and numbers. Instead, seed numbers vary over ten orders of magnitude, with species that invest in large seeds producing more seeds than expected from the 1:1 trade-off. Second, gymnosperms have lower seed production than angiosperms, potentially due to their extra investments in protective woody cones. Third, nutrient-demanding species, indicated by high foliar phosphorus concentrations, have low seed production. Finally, sensitivity of individual species to soil fertility varies widely, limiting the response of community seed production to fertility gradients. In combination, these findings can inform models of forest response that need to incorporate reproductive potential

    Increased Replicative Fitness of a Dengue Virus 2 Clade in Native Mosquitoes: Potential Contribution to a Clade Replacement Event in Nicaragua

    No full text
    UnlabelledThe four dengue virus (DENV) serotypes (DENV serotype 1 [DENV-1] to DENV-4) are transmitted by Aedes aegypti and A. albopictus mosquitoes, causing up to 390 million DENV infections worldwide each year. We previously reported a clade replacement of the DENV-2 Asian-American genotype NI-1 clade by the NI-2B clade in Managua, Nicaragua. Here, we describe our studies of the replicative ability of NI-1 and NI-2B viruses in an A. aegypti cell line (Aag2) and A. aegypti mosquitoes reared from eggs collected in Managua. In coinfection experiments, several different pairs of NI-1 and NI-2B clinical isolates were used to infect Aag2 cells or blood-fed A. aegypti mosquitoes. Results consistently showed a significant replicative advantage of NI-2B over NI-1 viruses early after infection in vitro, and in mosquitoes, NI-2B viruses attained a higher replicative index than NI-1 isolates 3 to 7 days postinfection (dpi). At 7 dpi, NI-2B viruses displayed a significantly higher replicative index in legs and salivary glands; however, this advantage was lost by 14 and 21 dpi. We also found that the percentage of mosquitoes in which NI-2B viruses were dominant was significantly higher than that in which NI-1 viruses were dominant on day 7 but not at later time points. Taken together, these data demonstrate that clade NI-2B holds a replicative advantage over clade NI-1 early in infection that wanes at later time points. This early fitness advantage of NI-2B viruses over NI-1 viruses in the native vector, A. aegypti, suggests a shorter extrinsic incubation period for NI-2B viruses, which could have contributed to the clade replacement event in Managua.ImportanceDengue virus (DENV), one of the most medically important arthropod-borne viruses, is transmitted to humans by Aedes aegypti and A. albopictus mosquitoes in tropical and subtropical regions worldwide. Dengue epidemics continue to increase in frequency, geographic range, and severity and are a major public health concern. This is due to globalization, unplanned urbanization, and climate change, as well as host genetics and immune responses and viral genetic changes. DENV consists of four serotypes, in turn composed of genotypes and genetically distinct clades. What drives the frequent replacement of a previously circulating DENV clade by another is unclear. Here, we investigate the replicative fitness of two clades of DENV serotype 2 in Aedes aegypti cells and mosquitoes collected from the region where the viruses circulated and conclude that increased replicative fitness could have contributed to a DENV clade replacement event in Nicaragua. These findings provide insight into vector-driven evolution of DENV epidemics
    • 

    corecore