1,875 research outputs found

    Large Language Models are Few-shot Testers: Exploring LLM-based General Bug Reproduction

    Full text link
    Many automated test generation techniques have been developed to aid developers with writing tests. To facilitate full automation, most existing techniques aim to either increase coverage, or generate exploratory inputs. However, existing test generation techniques largely fall short of achieving more semantic objectives, such as generating tests to reproduce a given bug report. Reproducing bugs is nonetheless important, as our empirical study shows that the number of tests added in open source repositories due to issues was about 28% of the corresponding project test suite size. Meanwhile, due to the difficulties of transforming the expected program semantics in bug reports into test oracles, existing failure reproduction techniques tend to deal exclusively with program crashes, a small subset of all bug reports. To automate test generation from general bug reports, we propose LIBRO, a framework that uses Large Language Models (LLMs), which have been shown to be capable of performing code-related tasks. Since LLMs themselves cannot execute the target buggy code, we focus on post-processing steps that help us discern when LLMs are effective, and rank the produced tests according to their validity. Our evaluation of LIBRO shows that, on the widely studied Defects4J benchmark, LIBRO can generate failure reproducing test cases for 33% of all studied cases (251 out of 750), while suggesting a bug reproducing test in first place for 149 bugs. To mitigate data contamination, we also evaluate LIBRO against 31 bug reports submitted after the collection of the LLM training data terminated: LIBRO produces bug reproducing tests for 32% of the studied bug reports. Overall, our results show LIBRO has the potential to significantly enhance developer efficiency by automatically generating tests from bug reports.Comment: Accepted to IEEE/ACM International Conference on Software Engineering 2023 (ICSE 2023

    Basic block coverage for search-based unit testing and crash reproduction

    Get PDF
    Search-based techniques have been widely used for white-box test generation. Many of these approaches rely on the approach level and branch distance heuristics to guide the search process and generate test cases with high line and branch coverage. Despite the positive results achieved by these two heuristics, they only use the information related to the coverage of explicit branches (e.g., indicated by conditional and loop statements), but ignore potential implicit branchings within basic blocks of code. If such implicit branching happens at runtime (e.g., if an exception is thrown in a branchless-method), the existing fitness functions cannot guide the search process. To address this issue, we introduce a new secondary objective, called Basic Block Coverage (BBC), which takes into account the coverage level of relevant basic blocks in the control flow graph. We evaluated the impact of BBC on search-based unit test generation (using the DynaMOSA algorithm) and search-based crash reproduction (using the STDistance and WeightedSum fitness functions). Our results show that for unit test generation, BBC improves the branch coverage of the generated tests. Although small (∼ 1.5%), this improvement in the branch coverage is systematic and leads to an increase of the output domain coverage and implicit runtime exception coverage, and of the diversity of runtime states. In terms of crash reproduction, in the combination of STDistance and WeightedSum, BBC helps in reproducing 3 new crashes for each fitness function. BBC significantly decreases the time required to reproduce 43.5% and 45.1% of the crashes using STDistance and WeightedSum, respectively. For these crashes, BBC reduces the consumed time by 71.7% (for STDistance) and 68.7% (for WeightedSum) on average.Software Engineerin

    An examination of the relationship between teacher burnout and organizational design

    Get PDF
    The purpose of this study was to examine the relationship between teacher burnout and organizational design. To that end, the Maslach Burnout Inventory (BMI) and the organizational design survey (ods) developed by this researcher were completed by 306 teachers in northwestern North Carolina. Demographic differences among the respondents were compared on the MBI and the ods. The hypothesis that no relationship existed between teacher burnout and organizational design was rejected after factor analysis and canonical correlation analysis. The hypothesis that there was no difference in demographic subgroups when compared on the six MBI subscales was rejected. ANOVA performed to test this hypothesis resulted in significant differences on 7 of 13 demographic variables. Significant findings were these: Blacks experience less Emotional Exhaustion than Whites; Those in graduate schools reported more Personal Accomplishment than those not in school; Academic teachers experience more Depersonalization than vocational teachers; Those reporting salary dissatisfaction experience more Depersonalization and Emotional Exhaustion; Those who would not teach again experience more Depersonalization, Emotional Exhaustion, and less Personal Accomplishment

    Simulation analysis of anti-rollover mechanism for vehicles

    Get PDF
    Rollover accidents are considered the most significant safety problems for all classes of light vehicles, especially pickups, Sport Utility Vehicles (SUV), Light Truck Vehicle (LTV) and vans. The main objective of the research is the design of a new mechanism able to keep the vehicle stable under various road conditions and high speeds, to prevent the vehicle from rolling over and to maintain the stability for the vehicle by creating an anti-rolling torque on the vehicle body capable of turning the vehicle smoothly to stable position

    Fundamental Approaches to Software Engineering

    Get PDF
    computer software maintenance; computer software selection and evaluation; formal logic; formal methods; formal specification; programming languages; semantics; software engineering; specifications; verificatio

    Population dynamics of an island population of water voles Arvicola amphibius (Linnaeus, 1758) with one major predator, the eagle owl Bubo bubo (Linnaeus, 1758), in northern Norway

    Get PDF
    Predator–prey relationships are of great significance to ecosystems, and their effects on the population dynamics of voles and lemmings (Microtinae) in Boreal and Arctic environments have long been of particular interest. A simple ecosystem with one major prey and one major predator could be an ideal setting for a study of their interactions. This is the situation on several small islands on the coast of northern Norway just below the Arctic Circle, with populations of water voles Arvicola amphibius preyed upon by the eagle owl Bubo bubo. The population dynamics of the water vole was studied by trapping and tagging in 2003–2018, eagle owl pellets were collected for analyses, eagle owl breeding attempts were recorded, and some weather variables collected from official recordings. After having been introduced well into the study period, the number of sheep Ovis aries was also recorded. Water voles were the main prey of the eagle owl, with 89% occurrence in pellets, with an overrepresentation of adults and males. Both predation, sheep grazing and extreme weather events influenced the vole population. Predator exclusion, as happened in three summers due to an intensive radio tracking study, especially increased the number of surviving young (in particular from the early cohorts) and the mass of adults. Extreme weather events, such as flooding in summer and deeply frozen ground in winter, most significantly reduced vole populations. Sheep grazing may exacerbate the effects of predation. A similar multitude of factors may affect populations of other rodent species as well

    A vulnerability assesment framework for the IMS

    Get PDF
    Includes bibliography.With multimedia services being made available via more and more devices to end users, it is no longer feasible to develop a delivery platform for each new type of service. The IP multimedia subsystem (IMS) aims to provide a unified service delivery platform capable of supporting a wide range of multimedia, data and voice services. It has been developed with a focus on content delivery and rich communications, and has already begun to replace existing legacy GSM network components. The IMS is intended to be an access agnostic platform, capable of providing services over both mobile and fixed networks using a multi-access all-IP platform. By providing a feature-rich all IP platform, operators are able to deploy open IP-based networks, allowing for easy deployment and development of new, rich multimedia centric communication services. With the IMS in place, an operator may take the role of a service broker, providing them with far more revenue generating opportunities than just traditional voice and data. Application services may leverage the functionality provided by the IMS to create new services quickly while allowing them to be easily integrated into the network infrastructure. With the IMS gaining more and more attention from telecoms operators, and already being adopted by some, the ability to assess the security of the system becomes critical to the success of the IMS platform. While the 3GPP has placed emphasis on security throughout the development of the IMS, implementation is left up to vendors looking to create their own IMS systems. Implementation specific vulnerabilities may be missed by standard quality assurance testing, as they may be triggered only by boundary or near boundary conditions, or non-standard or unexpected state transitions
    • …
    corecore