Search CORE

3,974 research outputs found

Evolutionary improvement of programs

Author: Arcuri A.
Clark J.A.
White D.R.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/08/2011
Field of study

Most applications of genetic programming (GP) involve the creation of an entirely new function, program or expression to solve a specific problem. In this paper, we propose a new approach that applies GP to improve existing software by optimizing its non-functional properties such as execution time, memory usage, or power consumption. In general, satisfying non-functional requirements is a difficult task and often achieved in part by optimizing compilers. However, modern compilers are in general not always able to produce semantically equivalent alternatives that optimize non-functional properties, even if such alternatives are known to exist: this is usually due to the limited local nature of such optimizations. In this paper, we discuss how best to combine and extend the existing evolutionary methods of GP, multiobjective optimization, and coevolution in order to improve existing software. Given as input the implementation of a function, we attempt to evolve a semantically equivalent version, in this case optimized to reduce execution time subject to a given probability distribution of inputs. We demonstrate that our framework is able to produce non-obvious optimizations that compilers are not yet able to generate on eight example functions. We employ a coevolved population of test cases to encourage the preservation of the function's semantics. We exploit the original program both through seeding of the population in order to focus the search, and as an oracle for testing purposes. As well as discussing the issues that arise when attempting to improve software, we employ rigorous experimental method to provide interesting and practical insights to suggest how to address these issues

Enlighten

A Fitness Function for Search-based Testing of Java Classes, which is Based on the States Reached by the Object under Test

Author: Elinda Mece
Ina Papadhopulli
Publication venue: Global Journals Inc. (US)
Publication date: 22/04/2016
Field of study

Genetic Algorithms are among the most efficient search-based techniques to automatically generate unit test cases today. The search is guided by a fitness function which evaluates how close an individual is to satisfy a given coverage goal. There exists several coverage criteria but the default criterion today is branch coverage. Nevertheless achieving high or full branch coverage does not imply that the generated test suite has good quality. In object oriented programs the state of the object affects its behavior. Thereupon, test cases that put the object under test, in new states are of interest in the testing context. In this article we propose a new fitness function which takes into consideration three factors for evaluation: the approach level, the branch distance and the new states reached by a test case. The coverage targets are still the branches, but during the search, the state of the object under test evolves with the scope to produce individuals that discover interesting features of the class and as a consequence can discover errors. We implemented this fitness function in the eToc tool. In our experiments the usage of the proposed fitness function towards the original fitness function results in a relative increase of 15.6% in the achieved average mutation score with the cost of a relative increase of 12.6% in the average test suite size

Global Journal of Computer Science and Technology (GJCST)

A Study of Equivalent and Stubborn Mutation Operators using Human Analysis of Equivalence

Author: Aho A. V.
Budd T. A.
Harman M.
International Standards
Just R.
Offutt A. J.
Schuler D.
Voas J.
Publication venue: Association for Computer Machinery (ACM)
Publication date: 01/01/2014
Field of study

Though mutation testing has been widely studied for more than thirty years, the prevalence and properties of equivalent mutants remain largely unknown. We report on the causes and prevalence of equivalent mutants and their relationship to stubborn mutants (those that remain undetected by a high quality test suite, yet are non-equivalent). Our results, based on manual analysis of 1,230 mutants from 18 programs, reveal a highly uneven distribution of equivalence and stubbornness. For example, the ABS class and half UOI class generate many equivalent and almost no stubborn mutants, while the LCR class generates many stubborn and few equivalent mutants. We conclude that previous test effectiveness studies based on fault seeding could be skewed, while developers of mutation testing tools should prioritise those operators that we found generate disproportionately many stubborn (and few equivalent) mutants

CiteSeerX

Crossref

UCL Discovery

MTFuzz: Fuzzing with a Multi-Task Neural Network

Author: Abadi Martín
Blazytko Tim
Böhme Marcel
Cadar Cristian
Caruana Rich
Chen Xi
Dolan-Gavitt B.
Finn Chelsea
Gan Shuitao
Godefroid Patrice
Lemieux Caroline
Long Mingsheng
McMinn Phil
Mihalkova Lilyana
Pratt Lorien Y.
She Dongdong
Wang Jinghan
You Wei Zhen
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 11/09/2020
Field of study

Fuzzing is a widely used technique for detecting software bugs and vulnerabilities. Most popular fuzzers generate new inputs using an evolutionary search to maximize code coverage. Essentially, these fuzzers start with a set of seed inputs, mutate them to generate new inputs, and identify the promising inputs using an evolutionary fitness function for further mutation. Despite their success, evolutionary fuzzers tend to get stuck in long sequences of unproductive mutations. In recent years, machine learning (ML) based mutation strategies have reported promising results. However, the existing ML-based fuzzers are limited by the lack of quality and diversity of the training data. As the input space of the target programs is high dimensional and sparse, it is prohibitively expensive to collect many diverse samples demonstrating successful and unsuccessful mutations to train the model. In this paper, we address these issues by using a Multi-Task Neural Network that can learn a compact embedding of the input space based on diverse training samples for multiple related tasks (i.e., predicting for different types of coverage). The compact embedding can guide the mutation process by focusing most of the mutations on the parts of the embedding where the gradient is high. \tool uncovers

11

previously unseen bugs and achieves an average of

2\times

more edge coverage compared with 5 state-of-the-art fuzzer on 10 real-world programs.Comment: ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE) 202

arXiv.org e-Print Archive

Crossref

Automatic mapping of free texts to bioinformatics ontology terms

Author: Jaaniso Erik
Publication venue
Publication date: 01/01/2016
Field of study

Bioinformaatika valdkonnas on olemas palju tööriistu ja teenuseid, mille hulk kasvab üha kiirenevas tempos.\n\rEt informatsioon nende ressursside kohta oleks kättesaadav võimalikult kasulikul viisil, annoteerime me need ontoloogia terminitega.\n\rHetkel toimub annoteerimine käsitsi, mis on aeganõudev ja veaohtlik protsess.\n\rAntud töös seame eesmärgiks luua tööriist, mis aitab annotaatorit, pakkudes talle annoteerimissoovitusi.\n\rMe loome programmi, mis loeb sisse vabatekstilise tööriistade ja teenuste kirjelduse, lisab neile seotud veebilehtede ja teadusartiklite sisu ja sellel põhinevalt annab välja parimad leitud ontoloogia terminite vasted.\n\rSeejärel, optimimeerime programmi parameetreid käsitsi tehtud annotatsioonide põhjal.\n\rEsmased tulemused on paljulubavad -- paljud leitud soovitused on kooskõlas käsitsi tehtud annotatsioonidega.\n\rVeelgi enam, kogenud annotaatorite väitel on mitmed teised soovitused samuti korrektsed.In the field of bioinformatics, the number of tools and services is ever-increasing.\n\rIn order to make information about these resources available in a useful was, we annotate them with ontology terms.\n\rThis is currently done manually -- which is time-consuming and error-prone.\n\rIn this thesis, we set out to make a tool that helps the annotator by providing annotation suggestions.\n\rWe developed a program, that reads in free text descriptions of tools and services, adds content of web pages and publications related to the tool and based on this outputs best matches to ontology terms.\n\rThen, we optimised the parameters of the program on manually done annotation sets.\n\rInitial results look promising, as when comparing performance against these manual annotations, we see that many suggestions are agreeing with them.\n\rMoreover, according to experienced annotators, many of the other suggestions are also correct

DSpace at Tartu University Library

Finding The Lazy Programmer's Bugs

Author: Allwood Tristan Oliver Richard
Allwood Tristan Oliver Richard
Publication venue: Computing, Imperial College London
Publication date: 01/09/2011
Field of study

Traditionally developers and testers created huge numbers of explicit tests, enumerating interesting cases, perhaps biased by what they believe to be the current boundary conditions of the function being tested. Or at least, they were supposed to. A major step forward was the development of property testing. Property testing requires the user to write a few functional properties that are used to generate tests, and requires an external library or tool to create test data for the tests. As such many thousands of tests can be created for a single property. For the purely functional programming language Haskell there are several such libraries; for example QuickCheck [CH00], SmallCheck and Lazy SmallCheck [RNL08]. Unfortunately, property testing still requires the user to write explicit tests. Fortunately, we note there are already many implicit tests present in programs. Developers may throw assertion errors, or the compiler may silently insert runtime exceptions for incomplete pattern matches. We attempt to automate the testing process using these implicit tests. Our contributions are in four main areas: (1) We have developed algorithms to automatically infer appropriate constructors and functions needed to generate test data without requiring additional programmer work or annotations. (2) To combine the constructors and functions into test expressions we take advantage of Haskell's lazy evaluation semantics by applying the techniques of needed narrowing and lazy instantiation to guide generation. (3) We keep the type of test data at its most general, in order to prevent committing too early to monomorphic types that cause needless wasted tests. (4) We have developed novel ways of creating Haskell case expressions to inspect elements inside returned data structures, in order to discover exceptions that may be hidden by laziness, and to make our test data generation algorithm more expressive. In order to validate our claims, we have implemented these techniques in Irulan, a fully automatic tool for generating systematic black-box unit tests for Haskell library code. We have designed Irulan to generate high coverage test suites and detect common programming errors in the process

Spiral - Imperial College Digital Repository

It is not the length that matters, it is how you control it

Author: Andrea Arcuri
Gordon Fraser
Publication venue
Publication date: 01/01/2011
Field of study

Abstract-The length of test cases is a little investigated topic in search-based test generation for object oriented software, where test cases are sequences of method calls. While intuitively longer tests can achieve higher overall code coverage, there is always the threat of bloat -a complex phenomenon in evolutionary computation, where the length abnormally grows over time. In this paper, we show that bloat indeed also occurs in the context of test generation for object oriented software. We present different techniques to overcome the problem of length bloat, and evaluate all possible combinations of these techniques using different search lengths. Experiments on a set of difficult search targets selected from several open source and industrial projects show that the important choice in search-based testing is not the length of test cases, but how to make sure that this length does not become bloated

CiteSeerX

Symbolic search-based testing

Author: Arthur Baars
Fondazione Bruno Kessler
Kiran Lakhotia
Mark Harman
Paolo Tonella
Phil Mcminn
Tanja Vos
Youssef Hassoun
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2011
Field of study

We present an algorithm for constructing fitness functions that improve the efficiency of search-based testing when trying to generate branch adequate test data. The algorithm combines symbolic information with dynamic analysis and has two key advantages: It does not require any change in the underlying test data generation technique and it avoids many problems traditionally associated with symbolic execution, in particular the presence of loops. We have evaluated the algorithm on industrial closed source and open source systems using both local and global search-based testing techniques, demonstrating that both are statistically significantly more efficient using our approach. The test for significance was done using a one-sided, paired Wilcoxon signed rank test. On average, the local search requires 23.41% and the global search 7.78% fewer fitness evaluations when using a symbolic execution based fitness function generated by the algorithm

CiteSeerX

Crossref

Archivio della ricerca - Fondazione Bruno Kessler

UCL Discovery

White Rose Research Online