Search CORE

46 research outputs found

Mutation Testing Advances: An Analysis and Survey

Author: Abraham
Abreu
Adra
Ahmed
Ahmed
Aichernig
Aichernig
Aichernig
Aichernig
Aichernig
Aichernig
Al-Hajjaji
Alberto
Alipour
Ammann
Ammann
Anand
Anand
Anbalagan
Andrews
Andrews
Andrews
Andrés
Andrés
Andrés
Aranega
Arcaini
Arcaini
Arcaini
Arcaini
Arcaini
Arcaini
Arcuri
Arcuri
Ayari
Aydal
Baker
Bardin
Bardin
Barr
Bartel
Baudry
Belli
Belli
Belli
Belli
Bertolino
Bertolino
Binder
Binkley
Black
Bottaci
Boubeta-Puig
Bowes
Bradbury
Briand
Brodersen
Brown
Cadar
Chandra
Chekam
Chekam
Chen
Ciupa
Coles
Dadeau
Dadeau
Dan
Dan
Debroy
Debroy
Delamare
Delamare
Delamaro
Delamaro
Delamaro
Delamaro
Delamaro
Delgado-Pérez
Delgado-Pérez
DeMillo
DeMillo
DeMillo
DeMillo
Deng
Deng
Derezinska
Derezińska
Devroey
Devroey
Devroey
Devroey
Do
Dobolyi
Domínguez-Jiménez
Durelli
Durelli
Durães
El-Fakih
El-Fakih
Ellims
Elrakaiby
Enoiu
Estero-Botaro
Estero-Botaro
Fabbri
Feng
Fernandes
Ferrari
Filho
Filho
Foster
Frankl
Frankl
Frankl
Frankl
Fraser
Fraser
Fraser
Fraser
Fraser
Fraser
Galeotti
Garvin
Gay
Geist
Gligoric
Gligoric
Gligoric
Gligoric
Gligoric
Gligoric
Gong
Gong
Goodenough
Gopinath
Gopinath
Gopinath
Gopinath
Gopinath
Gopinath
Gopinath
Groce
Grün
Guan
Hamlet
Hao
Hariri
Harman
Harman
Harman
Harman
Hassan
Henard
Henard
Henard
Henard
Henard
Holling
Hong
Hong
Howden
Hu
Hwang
Iida
Inozemtseva
Inozemtseva
Jabbarvand
Jagannath
Jahangirova
Jamrozik
Jia
Jia
Jia
Jia
Jia
Just
Just
Just
Just
Just
Just
Just
Just
Kakarla
Kaminski
Kaminski
Kaminski
Kapfhammer
Kaplan
Khan
Kim
Kim
King
Kintis
Kintis
Kintis
Kintis
Kintis
Kintis
Kintis
Kintis
Kintis
Knauth
Krenn
Kurtz
Kurtz
Kurtz
Kurtz
Kusano
Lakehal
Langdon
Langdon
Larsen
Laurent
Laurent
Le
Le Goues
Le Goues
Lelli
Li
Li
Linares-Vásquez
Lindström
Lindström
Lisper
Loise
Long
Lou
Ma
Ma
Ma
Madeyski
Madeyski
Madiraju
Maezawa
Mahajan
Marcozzi
Marcozzi
Maruchi
Mateo
Mateo
Mateo
Mateo
Mateo
Matinnejad
Mirshokraie
Mirshokraie
Mirshokraie
Mirshokraie
Moon
Moore
Morell
Mouelhi
Mouelhi
Murtaza
Murtaza
Musco
Márki
Nam
Namin
Namin
Namin
Namin
Nanavati
Nardo
Nguyen
Nguyen
Nica
Ocariza
Offutt
Offutt
Offutt
Offutt
Offutt
Offutt
Offutt
Oliveira
Omar
Omar
Omar
Omar
Omar
Pankumhang
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Parsai
Parsai
Patrick
Patrick
Patrick
Patrick
Patrick
Patrick
Petke
Pill
Polo
Praphamontripong
Praphamontripong
Praphamontripong
Rajan
Ramler
Riener
Rojas
Rojas
Rothermel
Roy
Rutherford
Saifan
Schirp
Schuler
Schuler
Schuler
Schuler
Schwarz
Shi
Shin
Shin
Silva
Simao
Souza
Souza
Sridharan
Staats
Stephan
Stephan
Su
Sullivan
Sun
Svajlenko
Tai
Tai
Tan
Tan
Tengeri
Tisi
Tokumoto
Trakhtenbrot
Trakhtenbrot
Troya
Tuya
Tuya
Untch
Usaola
Usaola
Visser
Voas
Walsh
Wang
Wei
Weiglhofer
Weimer
Weimer
Winbladh
Wotawa
Wright
Wu
Wu
Xie
Xu
Yao
Ye
Yoshida
Yoshida
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhou
Zhou
Zhou
Zhou
Zhu
Publication venue
Publication date: 01/01/2019
Field of study

Crossref

Open Repository and Bibliography - Luxembourg

Learning Code Transformations via Neural Machine Translation

Author: Tufano Michele
Publication venue: W&M ScholarWorks
Publication date: 01/01/2019
Field of study

Source code evolves – inevitably – to remain useful, secure, correct, readable, and efficient. Developers perform software evolution and maintenance activities by transforming existing source code via corrective, adaptive, perfective, and preventive changes. These code changes are usually managed and stored by a variety of tools and infrastructures such as version control, issue trackers, and code review systems. Software Evolution and Maintenance researchers have been mining these code archives in order to distill useful insights on the nature of such developers’ activities. One of the long-lasting goal of Software Engineering research is to better support and automate different types of code changes performed by developers. In this thesis we depart from classic manually crafted rule- or heuristic-based approaches, and propose a novel technique to learn code transformations by leveraging the vast amount of publicly available code changes performed by developers. We rely on Deep Learning, and in particular on Neural Machine Translation (NMT), to train models able to learn code change patterns and apply them to novel, unseen, source code. First, we tackle the problem of generating source code mutants for Mutation Testing. In contrast with classic approaches, which rely on handcrafted mutation operators, we propose to automatically learn how to mutate source code by observing real faults. We mine millions of bug fixing commits from GitHub, process and abstract their source code. This data is used to train and evaluate an NMT model to translate fixed code into buggy code (i.e., the mutated code). In the second project, we rely on the same dataset of bug-fixes to learn code transformations for the purpose of Automated Program Repair (APR). This represents one of the most challenging research problem in Software Engineering, whose goal is to automatically fix bugs without developers’ intervention. We train a model to translate buggy code into fixed code (i.e., learning patches) and, in conjunction with Beam Search, generate many different potential patches for a given buggy method. In our empirical investigation we found that such a model is able to fix thousands of unique buggy methods in the wild.Finally, in our third project we push our novel technique to the limits and enlarge the scope to consider not only bug-fixing activities, but any type of meaningful code changes performed by developers. We focus on accepted and merged code changes that undergone a Pull Request (PR) process. We quantitatively and qualitatively investigate the code transformations learned by the model to build a taxonomy. The taxonomy shows that NMT can replicate a wide variety of meaningful code changes, especially refactorings and bug-fixing activities. In this dissertation we illustrate and evaluate the proposed techniques, which represent a significant departure from earlier approaches in the literature. The promising results corroborate the potential applicability of learning techniques, such as NMT, to a variety of Software Engineering tasks

College of William & Mary: W&M Publish

Search-based Unit Test Generation for Evolving Software

Author: Medeiros de Campos José Carlos
Publication venue: 'University of Sheffield Conference Proceedings'
Publication date: 30/11/2017
Field of study

Search-based software testing has been successfully applied to generate unit test cases for object-oriented software. Typically, in search-based test generation approaches, evolutionary search algorithms are guided by code coverage criteria such as branch coverage to generate tests for individual coverage objectives. Although it has been shown that this approach can be effective, there remain fundamental open questions. In particular, which criteria should test generation use in order to produce the best test suites? Which evolutionary algorithms are more effective at generating test cases with high coverage? How to scale up search-based unit test generation to software projects consisting of large numbers of components, evolving and changing frequently over time? As a result, the applicability of search-based test generation techniques in practice is still fundamentally limited. In order to answer these fundamental questions, we investigate the following improvements to search-based testing. First, we propose the simultaneous optimisation of several coverage criteria at the same time using an evolutionary algorithm, rather than optimising for individual criteria. We then perform an empirical evaluation of different evolutionary algorithms to understand the influence of each one on the test optimisation problem. We then extend a coverage-based test generation with a non-functional criterion to increase the likelihood of detecting faults as well as helping developers to identify the locations of the faults. Finally, we propose several strategies and tools to efficiently apply search-based test generation techniques in large and evolving software projects. Our results show that, overall, the optimisation of several coverage criteria is efficient, there is indeed an evolutionary algorithm that clearly works better for test generation problem than others, the extended coverage-based test generation is effective at revealing and localising faults, and our proposed strategies, specifically designed to test entire software projects in a continuous way, improve efficiency and lead to higher code coverage. Consequently, the techniques and toolset presented in this thesis - which provides support to all contributions here described - brings search-based software testing one step closer to practical usage, by equipping software engineers with the state of the art in automated test generation

White Rose E-theses Online

Recommended from our members

Automating the Formal Verification of Software

Author: First Emily
Publication venue: ScholarWorks@UMass Amherst
Publication date: 04/08/2023
Field of study

Formally verified correctness is one of the most desirable properties of software systems. Despite great progress made toward verification via interactive proof assistants, such as Coq and Isabelle/HOL, such verification remains one of the most effort-intensive (and often prohibitively difficult) software development activities. Recent work has created tools that automatically synthesize proofs either through reasoning using precomputed facts or using machine learning to model proofs and then perform biased search through the proof space. However, models in existing tools fail to capture the richness present in proofs, such as the information the programmer has access to when writing proofs and the natural language contained within variable names. Furthermore, these prior models do not make use of variations in the learning process and advances in large language models. In this dissertation, I develop tools to improve proof synthesis and to enable fully automating more verification. I first present TacTok, a proof-synthesis tool that models proofs using both the partial proof written thus far and the semantics of the proof state. I then present Diva, a proof-synthesis tool that controls the learning process to produce a diverse set of models and, due to the unique nature of proof synthesis (the existence of the theorem prover, an oracle that infallibly judges a proof’s correctness), efficiently combines these models to improve the overall proving power. I then present Passport, a proof-synthesis tool that systematically explores different ways of encoding identifiers in proofs to improve synthesis. Finally, I present Baldur, a proof-synthesis tool that uses transformer-based pretrained large language models fine-tuned on proofs to generate and repair whole proofs at once, rather than one step at a time. This dissertation contributes new ideas for improving automated proof synthesis and empirically demonstrates that the improvement is significant on large benchmarks consisting of open-source software projects

ScholarWorks@UMass Amherst

Data Mining for Automatic Generation of Software Tests

Author: Alberto Plácido Oliveira
Publication venue
Publication date: 10/11/2020
Field of study

Repositório Aberto da Universidade do Porto

Debugging Type Errors with a Blackbox Compiler

Author: Sharrad Joanna
Publication venue
Publication date: 11/03/2022
Field of study

Type error debugging can be a laborious yet necessary process for programmers of statically typed functional programming languages. Often a compiler compounds this by inaccurately reporting the location of a type error, a problem that has been a subject of research for over thirty years. However, despite its long history, the solutions proposed are often reliant on direct modifications to the compiler, often distributed in the form of patches. These patches append another level of arduous activity to the task of debugging, keeping them modernised to the ever-changing programming language they support. This thesis investigates an additional option; the blackbox compiler. Split into three central parts, it shows the individual solutions involved in using a blackbox compiler to debug type errors in functional programming languages. First is a demonstration of how the combination of a blackbox compiler and a generic debugging algorithm can successfully locate type errors. Next tackled is a side-effect of this new combination, the introduction of extra errors, combated with a new speed boosted algorithm, evaluated with a proposed framework based on Data Science techniques to quantify the quality of a type error debugger. Lastly, the algorithms employed throughout this thesis, along with the blackbox compiler, have agnostic properties, they do not need language-specific knowledge. Thus, the final part presents utilising the agnostic abilities for an agnostic debugger to locate type errors

Kent Academic Repository

Grammar-based fuzzing using input features

Author: Havrikov Nikolas
Publication venue: Saarländische Universitäts- und Landesbibliothek
Publication date: 01/01/2021
Field of study

In grammar-based fuzz testing, a formal grammar is used to produce test inputs that are syntactically valid in order to reach the business logic of a program under test. In this setting, it is advantageous to ensure a high diversity of inputs to test more of the program's behavior. How can we characterize features that make inputs diverse and associate them with the execution of particular parts of the program? Previous work does not answer this question to satisfaction, with most attempts mainly considering superficial features defined by the structure of the grammar such as the presence of production rules or terminal symbols, regardless of their context. We present a measure of input coverage called k-path coverage, which takes into account combinations of grammar entities up to a given context depth k, and makes it possible to efficiently express, assess, and achieve input diversity. In a series of experiments, we demonstrate and evaluate how to systematically attain k-path coverage, how it correlates with code coverage and can thus be used as its predictor. By automatically inferring explicit associations between k-path features and the coverage of individual methods we further show how to generate inputs that specifically target the execution of given code locations. We expect the presented instrument of k-paths to prove useful in numerous additional applications such as assessing the quality of grammars, serving as an adequacy criterion for input test suites, enabling test case prioritization, facilitating program comprehension, and perhaps beyond.Im Bereich des grammatik-basierten Fuzz-Testens benutzt man eine formale Grammatik, um Testeingaben zu produzieren, welche syntaktisch korrekt sind, mit dem Ziel die Geschäftslogik eines zu testenden Programms zu erreichen. Dafür ist es vorteilhaft eine hohe Diversität der Eingaben zu sichern, um mehr vom Verhalten des Programms testen zu können. Wie kann man Merkmale charakterisieren, die Eingaben vielfältig machen und diese mit der Ausführung bestimmter Programmteile in Verbindung bringen? Bisherige Ansätze liefern darauf keine ausreichende Antwort, denn meistens betrachten sie oberflächliche, durch die Grammatikstruktur definierte Merkmale, wie das Vorhandensein von Produktionsregeln oder Terminalen, unabhängig von ihrem Verwendungskontext. Wir präsentieren ein Maß für Eingabeabdeckung, genannt -path Abdeckung, welche Kombinationen von Grammatikelementen bis zu einer vorgegebenen Kontexttiefe berücksichtigt und es ermöglicht, die Diversität von Eingaben effizient auszudrücken, zu bewerten und zu erzielen. Mit Experimenten zeigen und evaluieren wir, wie man gezielt -path Abdeckung erreicht und wie sie mit der Codeabdeckung zusammenhängt und diese somit vorhersagen kann. Ferner zeigen wir wie automatisches Erlernen expliziter Assoziationen zwischen Merkmalen und der Abdeckung einzelner Methoden die Erzeugung von Eingaben ermöglicht, welche auf die Ausführung bestimmter Codestellen abzielen. Wir rechnen damit, dass sich -paths als ein vielseitiges Instrument beweisen, dessen Anwendung über solche Gebiete, wie z.B. Messung der Qualität von Grammatiken und Eingabe-Testsuiten, Testfallpriorisierung, oder Erleichterung von Programmverständnis, hinausgeht

Universaar

Acronym

Recommended from our members

Enhancing Usability and Explainability of Data Systems

Author: Fariha Anna
Publication venue: ScholarWorks@UMass Amherst
Publication date: 20/10/2021
Field of study

The recent growth of data science expanded its reach to an ever-growing user base of nonexperts, increasing the need for usability, understandability, and explainability in these systems. Enhancing usability makes data systems accessible to people with different skills and backgrounds alike, leading to democratization of data systems. Furthermore, proper understanding of data and data-driven systems is necessary for the users to trust the function of the systems that learn from data. Finally, data systems should be transparent: when a data system behaves unexpectedly or malfunctions, the users deserve proper explanation of what caused the observed incident. Unfortunately, most existing data systems offer limited usability and support for explanations: these systems are usable only by experts with sound technical skills, and even expert users are hindered by the lack of transparency into the systems\u27 inner workings and functions. The aim of my thesis is to bridge the usability gap between nonexpert users and complex data systems, aid all sort of users, including the expert ones, in data and system understanding, and provide explanations that help reason about unexpected outcomes involving data systems. Specifically, my thesis has the following three goals: (1) enhancing usability of data systems for nonexperts, (2) enable data understanding that can assist users in a variety of tasks such as achieving trust in data-driven machine learning, gaining data understanding, and data cleaning, and (3) explaining causes of unexpected outcomes involving data and data systems. For enhancing usability, we focus on example-driven user intent discovery. We develop systems based on example-driven interactions in two different settings: querying relational databases and personalized document summarization. Towards data understanding, we develop a new data-profiling primitive that can characterize tuples for which a machine-learned model is likely to produce untrustworthy predictions. We also develop an explanation framework to explain causes of such untrustworthy predictions. Additionally, this new data-profiling primitive enables interactive data cleaning. Finally, we develop two explanation frameworks, tailored to provide explanations in debugging data system components, including the data itself. The explanation frameworks focus on explaining the root cause of a concurrent application\u27s intermittent failure and exposing issues in the data that cause a data-driven system to malfunction

ScholarWorks@UMass Amherst

A User-aware Intelligent Refactoring for Discrete and Continuous Software Integration

Author: Alizadeh Vahid
Publication venue
Publication date: 13/04/2020
Field of study

Successful software products evolve through a process of continual change. However, this process may weaken the design of the software and make it unnecessarily complex, leading to significantly reduced productivity and increased fault-proneness. Refactoring improves the software design while preserving overall functionality and behavior, and is an important technique in managing the growing complexity of software systems. Most of the existing work on software refactoring uses either an entirely manual or a fully automated approach. Manual refactoring is time-consuming, error-prone and unsuitable for large-scale, radical refactoring. Furthermore, fully automated refactoring yields a static list of refactorings which, when applied, leads to a new and often hard to comprehend design. In addition, it is challenging to merge these refactorings with other changes performed in parallel by developers. In this thesis, we propose a refactoring recommendation approach that dynamically adapts and interactively suggests refactorings to developers and takes their feedback into consideration. Our approach uses Non-dominated Sorting Genetic Algorithm (NSGAII) to find a set of good refactoring solutions that improve software quality while minimizing the deviation from the initial design. These refactoring solutions are then analyzed to extract interesting common features between them such as the frequently occurring refactorings in the best non-dominated solutions. We combined our interactive approach and unsupervised learning to reduce the developer’s interaction effort when refactoring a system. The unsupervised learning algorithm clusters the different trade-off solutions, called the Pareto front, to guide the developers in selecting their region of interests and reduce the number of refactoring options to explore. To reduce the interaction effort, we propose an approach to convert multi-objective search into a mono-objective one after interacting with the developer to identify a good refactoring solution based on their preferences. Since developers may want to focus on specific code locations, the ”Decision Space” is also important. Therefore, our interactive approach enables developers to pinpoint their preference simultaneously in the objective (quality metrics) and decision (code location) spaces. Due to an urgent need for refactoring tools that can support continuous integration and some recent development processes such as DevOps that are based on rapid releases, we propose, for the first time, an intelligent software refactoring bot, called RefBot. Our bot continuously monitors the software repository and find the best sequence of refactorings to fix the quality issues in Continous Integration/Continous Development (CI/CD) environments as a set of pull-requests generated after mining previous code changes to understand the profile of developers. We quantitatively and qualitatively evaluated the performance and effectiveness of our proposed approaches via a set of studies conducted with experienced developers who used our tools on both open source and industry projects.Ph.D.College of Engineering & Computer ScienceUniversity of Michigan-Dearbornhttps://deepblue.lib.umich.edu/bitstream/2027.42/154775/1/Vahid Alizadeh Final Dissertation.pdfDescription of Vahid Alizadeh Final Dissertation.pdf : Dissertatio

Deep Blue Documents at the University of Michigan

Automated Realistic Test Input Generation and Cost Reduction in Service-centric System Testing

Author: Bozkurt M
Publication venue: UCL (University College London)
Publication date: 28/08/2013
Field of study

Service-centric System Testing (ScST) is more challenging than testing traditional software due to the complexity of service technologies and the limitations that are imposed by the SOA environment. One of the most important problems in ScST is the problem of realistic test data generation. Realistic test data is often generated manually or using an existing source, thus it is hard to automate and laborious to generate. One of the limitations that makes ScST challenging is the cost associated with invoking services during testing process. This thesis aims to provide solutions to the aforementioned problems, automated realistic input generation and cost reduction in ScST. To address automation in realistic test data generation, the concept of Service-centric Test Data Generation (ScTDG) is presented, in which existing services used as realistic data sources. ScTDG minimises the need for tester input and dependence on existing data sources by automatically generating service compositions that can generate the required test data. In experimental analysis, our approach achieved between 93% and 100% success rates in generating realistic data while state-of-the-art automated test data generation achieved only between 2% and 34%. The thesis addresses cost concerns at test data generation level by enabling data source selection in ScTDG. Source selection in ScTDG has many dimensions such as cost, reliability and availability. This thesis formulates this problem as an optimisation problem and presents a multi-objective characterisation of service selection in ScTDG, aiming to reduce the cost of test data generation. A cost-aware pareto optimal test suite minimisation approach addressing testing cost concerns during test execution is also presented. The approach adapts traditional multi-objective minimisation approaches to ScST domain by formulating ScST concerns, such as invocation cost and test case reliability. In experimental analysis, the approach achieved reductions between 69% and 98.6% in monetary cost of service invocations during testin

UCL Discovery