Search CORE

2,366 research outputs found

DSpot: Test Amplification for Automatic Assessment of Computational Diversity

Author: Allier Simon
Baudry Benoit
Monperrus Martin
Rodriguez-Cancio Marcelino
Publication venue
Publication date: 09/06/2015
Field of study

Context: Computational diversity, i.e., the presence of a set of programs that all perform compatible services but that exhibit behavioral differences under certain conditions, is essential for fault tolerance and security. Objective: We aim at proposing an approach for automatically assessing the presence of computational diversity. In this work, computationally diverse variants are defined as (i) sharing the same API, (ii) behaving the same according to an input-output based specification (a test-suite) and (iii) exhibiting observable differences when they run outside the specified input space. Method: Our technique relies on test amplification. We propose source code transformations on test cases to explore the input domain and systematically sense the observation domain. We quantify computational diversity as the dissimilarity between observations on inputs that are outside the specified domain. Results: We run our experiments on 472 variants of 7 classes from open-source, large and thoroughly tested Java classes. Our test amplification multiplies by ten the number of input points in the test suite and is effective at detecting software diversity. Conclusion: The key insights of this study are: the systematic exploration of the observable output space of a class provides new insights about its degree of encapsulation; the behavioral diversity that we observe originates from areas of the code that are characterized by their flexibility (caching, checking, formatting, etc.).Comment: 12 page

arXiv.org e-Print Archive

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

HAL-Rennes 1

An empirical study to quantify the characteristics of Java programs that may influence symbolic execution from a unit testing perspective

Author: Durelli Vinicius H. S.
Eler Marcelo M.
Endo Andre T.
Publication venue
Publication date: 01/11/2016
Field of study

In software testing, a program is executed in hopes of revealing faults. Over the years, specific testing criteria have been proposed to help testers to devise test cases that cover the most relevant faulty scenarios. Symbolic execution has been used as an effective way of automatically generating test data that meet those criteria. Although this technique has been used for over three decades, several challenges remain and there is a lack of research on how often they appear in real-world applications. In this paper, we analyzed two samples of open source Java projects in order to understand the characteristics that may hinder the generation of unit test data using symbolic execution. The first sample, named SF100, is a third party corpus of classes obtained from 100 projects hosted by SourceForge. The second sample, called R47, is a set of 47 well-known and mature projects we selected from different repositories. Both samples are compared with respect to four dimensions that influence symbolic execution: path explosion, constraint complexity, dependency, and exception-dependent paths. The results provide valuable insight into how researchers and practitioners can tailor symbolic execution techniques and tools to better suit the needs of different Java applications

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Recommended from our members

Uncovering Features in Behaviorally Similar Programs

Author: Su Fang-Hsiang
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2018
Field of study

The detection of similar code can support many so ware engineering tasks such as program understanding and program classification. Many excellent approaches have been proposed to detect programs having similar syntactic features. However, these approaches are unable to identify programs dynamically or statistically close to each other, which we call behaviorally similar programs. We believe the detection of behaviorally similar programs can enhance or even automate the tasks relevant to program classification. In this thesis, we will discuss our current approaches to identify programs having similar behavioral features in multiple perspectives. We first discuss how to detect programs having similar functionality. While the definition of a program’s functionality is undecidable, we use inputs and outputs (I/Os) of programs as the proxy of their functionality. We then use I/Os of programs as a behavioral feature to detect which programs are functionally similar: two programs are functionally similar if they share similar inputs and outputs. This approach has been studied and developed in the C language to detect functionally equivalent programs having equivalent I/Os. Nevertheless, some natural problems in Object Oriented languages, such as input generation and comparisons between application-specific data types, hinder the development of this approach. We propose a new technique, in-vivo detection, which uses existing and meaningful inputs to drive applications systematically and then applies a novel similarity model considering both inputs and outputs of programs, to detect functionally similar programs. We develop the tool, HitoshiIO, based on our in-vivo detection. In the subjects that we study, HitoshiIO correctly detect 68.4% of functionally similar programs, where its false positive rate is only 16.6%. In addition to functional I/Os of programs, we attempt to discover programs having similar execution behavior. Again, the execution behavior of a program can be undecidable, so we use instructions executed at run-time as a behavioral feature of a program. We create DyCLINK, which observes program executions and encodes them in dynamic instruction graphs. A vertex in a dynamic instruction graph is an instruction and an edge is a type of dependency between two instructions. The problem to detect which programs have similar executions can then be reduced to a problem of solving inexact graph isomorphism. We propose a link analysis based algorithm, LinkSub, which vectorizes each dynamic instruction graph by the importance of every instruction, to solve this graph isomorphism problem efficiently. In a K Nearest Neighbor (KNN) based program classification experiment, DyCLINK achieves 90 + % precision. Because HitoshiIO and DyCLINK both rely on dynamic analysis to expose program behavior, they have better capability to locate and search for behaviorally similar programs than traditional static analysis tools. However, they suffer from some common problems of dynamic analysis, such as input generation and run-time overhead. These problems may make our approaches challenging to scale. Thus, we create the system, Macneto, which integrates static analysis with machine topic modeling and deep learning to approximate program behaviors from their binaries without truly executing programs. In our deobfuscation experiments considering two commercial obfuscators that alter lexical information and syntax in programs, Macneto achieves 90 + % precision, where the groundtruth is that the behavior of a program before and after obfuscation should be the same. In this thesis, we offer a more extensive view of similar programs than the traditional definitions. While the traditional definitions of similar programs mostly use static features, such as syntax and lexical information, we propose to leverage the power of dynamic analysis and machine learning models to trace/collect behavioral features of pro- grams. These behavioral features of programs can then apply to detect behaviorally similar programs. We believe the techniques we invented in this thesis to detect behaviorally similar programs can improve the development of software engineering and security applications, such as code search and deobfuscation

Columbia University Academic Commons

Software Engineering Laboratory Series: Collected Software Engineering Papers

Author
Publication venue
Publication date
Field of study

The Software Engineering Laboratory (SEL) is an organization sponsored by NASA/GSFC and created to investigate the effectiveness of software engineering technologies when applied to the development of application software. The activities, findings, and recommendations of the SEL are recorded in the Software Engineering Laboratory Series, a continuing series of reports that includes this document

NASA Technical Reports Server

Software redundancy: what, where, how

Author: Carzaniga Antonio
Mattavelli Andrea
Pezzè Mauro
Publication venue
Publication date: 12/01/2017
Field of study

Software systems have become pervasive in everyday life and are the core component of many crucial activities. An inadequate level of reliability may determine the commercial failure of a software product. Still, despite the commitment and the rigorous verification processes employed by developers, software is deployed with faults. To increase the reliability of software systems, researchers have investigated the use of various form of redundancy. Informally, a software system is redundant when it performs the same functionality through the execution of different elements. Redundancy has been extensively exploited in many software engineering techniques, for example for fault-tolerance and reliability engineering, and in self-adaptive and self- healing programs. Despite the many uses, though, there is no formalization or study of software redundancy to support a proper and effective design of software. Our intuition is that a systematic and formal investigation of software redundancy will lead to more, and more effective uses of redundancy. This thesis develops this intuition and proposes a set of ways to characterize qualitatively as well as quantitatively redundancy. We first formalize the intuitive notion of redundancy whereby two code fragments are considered redundant when they perform the same functionality through different executions. On the basis of this abstract and general notion, we then develop a practical method to obtain a measure of software redundancy. We prove the effectiveness of our measure by showing that it distinguishes between shallow differences, where apparently different code fragments reduce to the same underlying code, and deep code differences, where the algorithmic nature of the computations differs. We also demonstrate that our measure is useful for developers, since it is a good predictor of the effectiveness of techniques that exploit redundancy. Besides formalizing the notion of redundancy, we investigate the pervasiveness of redundancy intrinsically found in modern software systems. Intrinsic redundancy is a form of redundancy that occurs as a by-product of modern design and development practices. We have observed that intrinsic redundancy is indeed present in software systems, and that it can be successfully exploited for good purposes. This thesis proposes a technique to automatically identify equivalent method sequences in software systems to help developers assess the presence of intrinsic redundancy. We demonstrate the effectiveness of the technique by showing that it identifies the majority of equivalent method sequences in a system with good precision and performance

RERO DOC Digital Library

Quantifying and Predicting the Influence of Execution Platform on Software Component Performance

Author: Kuperberg Michael
Publication venue: KIT Scientific Publishing, Karlsruhe
Publication date: 01/01/2010
Field of study

The performance of software components depends on several factors, including the execution platform on which the software components run. To simplify cross-platform performance prediction in relocation and sizing scenarios, a novel approach is introduced in this thesis which separates the application performance profile from the platform performance profile. The approach is evaluated using transparent instrumentation of Java applications and with automated benchmarks for Java Virtual Machines

KITopen

CONFPROFITT: A CONFIGURATION-AWARE PERFORMANCE PROFILING, TESTING, AND TUNING FRAMEWORK

Author: Han Xue
Publication venue: UKnowledge
Publication date: 01/01/2019
Field of study

Modern computer software systems are complicated. Developers can change the behavior of the software system through software configurations. The large number of configuration option and their interactions make the task of software tuning, testing, and debugging very challenging. Performance is one of the key aspects of non-functional qualities, where performance bugs can cause significant performance degradation and lead to poor user experience. However, performance bugs are difficult to expose, primarily because detecting them requires specific inputs, as well as specific configurations. While researchers have developed techniques to analyze, quantify, detect, and fix performance bugs, many of these techniques are not effective in highly-configurable systems. To improve the non-functional qualities of configurable software systems, testing engineers need to be able to understand the performance influence of configuration options, adjust the performance of a system under different configurations, and detect configuration-related performance bugs. This research will provide an automated framework that allows engineers to effectively analyze performance-influence configuration options, detect performance bugs in highly-configurable software systems, and adjust configuration options to achieve higher long-term performance gains. To understand real-world performance bugs in highly-configurable software systems, we first perform a performance bug characteristics study from three large-scale opensource projects. Many researchers have studied the characteristics of performance bugs from the bug report but few have reported what the experience is when trying to replicate confirmed performance bugs from the perspective of non-domain experts such as researchers. This study is meant to report the challenges and potential workaround to replicate confirmed performance bugs. We also want to share a performance benchmark to provide real-world performance bugs to evaluate future performance testing techniques. Inspired by our performance bug study, we propose a performance profiling approach that can help developers to understand how configuration options and their interactions can influence the performance of a system. The approach uses a combination of dynamic analysis and machine learning techniques, together with configuration sampling techniques, to profile the program execution, analyze configuration options relevant to performance. Next, the framework leverages natural language processing and information retrieval techniques to automatically generate test inputs and configurations to expose performance bugs. Finally, the framework combines reinforcement learning and dynamic state reduction techniques to guide subject application towards achieving higher long-term performance gains

University of Kentucky

Fundamental Approaches to Software Engineering

Author
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

This open access book constitutes the proceedings of the 24th International Conference on Fundamental Approaches to Software Engineering, FASE 2021, which took place during March 27–April 1, 2021, and was held as part of the Joint Conferences on Theory and Practice of Software, ETAPS 2021. The conference was planned to take place in Luxembourg but changed to an online format due to the COVID-19 pandemic. The 16 full papers presented in this volume were carefully reviewed and selected from 52 submissions. The book also contains 4 Test-Comp contributions

OAPEN Library

Performance Benchmarks for Custom Applications: Considerations and Strategies

Author: Cabral Braulio J.
Publication venue: ePublications at Regis University
Publication date: 28/09/2006
Field of study

The motivation for this research came from the need to solve a problem affecting not only the company used in this study, but also the many other companies in the information technology industry having similar problem: how to conduct performance benchmarks for custom applications in an effective, unbiased, and accurate manner. This paper presents the pros and cons of existing benchmark methodologies. It proposes a combination of the best characteristics of these benchmarks into a methodology that addresses the problem from an application perspective considering the overall synergy between operating system and software. The author also discusses a software design to implement the proposed methodology. The methodology proposed is generic enough to be adapted to any particular application performance-benchmarking situation

ePublications at Regis University

Mutation Testing Advances: An Analysis and Survey

Author: Abraham
Abreu
Adra
Ahmed
Ahmed
Aichernig
Aichernig
Aichernig
Aichernig
Aichernig
Aichernig
Al-Hajjaji
Alberto
Alipour
Ammann
Ammann
Anand
Anand
Anbalagan
Andrews
Andrews
Andrews
Andrés
Andrés
Andrés
Aranega
Arcaini
Arcaini
Arcaini
Arcaini
Arcaini
Arcaini
Arcuri
Arcuri
Ayari
Aydal
Baker
Bardin
Bardin
Barr
Bartel
Baudry
Belli
Belli
Belli
Belli
Bertolino
Bertolino
Binder
Binkley
Black
Bottaci
Boubeta-Puig
Bowes
Bradbury
Briand
Brodersen
Brown
Cadar
Chandra
Chekam
Chekam
Chen
Ciupa
Coles
Dadeau
Dadeau
Dan
Dan
Debroy
Debroy
Delamare
Delamare
Delamaro
Delamaro
Delamaro
Delamaro
Delamaro
Delgado-Pérez
Delgado-Pérez
DeMillo
DeMillo
DeMillo
DeMillo
Deng
Deng
Derezinska
Derezińska
Devroey
Devroey
Devroey
Devroey
Do
Dobolyi
Domínguez-Jiménez
Durelli
Durelli
Durães
El-Fakih
El-Fakih
Ellims
Elrakaiby
Enoiu
Estero-Botaro
Estero-Botaro
Fabbri
Feng
Fernandes
Ferrari
Filho
Filho
Foster
Frankl
Frankl
Frankl
Frankl
Fraser
Fraser
Fraser
Fraser
Fraser
Fraser
Galeotti
Garvin
Gay
Geist
Gligoric
Gligoric
Gligoric
Gligoric
Gligoric
Gligoric
Gong
Gong
Goodenough
Gopinath
Gopinath
Gopinath
Gopinath
Gopinath
Gopinath
Gopinath
Groce
Grün
Guan
Hamlet
Hao
Hariri
Harman
Harman
Harman
Harman
Hassan
Henard
Henard
Henard
Henard
Henard
Holling
Hong
Hong
Howden
Hu
Hwang
Iida
Inozemtseva
Inozemtseva
Jabbarvand
Jagannath
Jahangirova
Jamrozik
Jia
Jia
Jia
Jia
Jia
Just
Just
Just
Just
Just
Just
Just
Just
Kakarla
Kaminski
Kaminski
Kaminski
Kapfhammer
Kaplan
Khan
Kim
Kim
King
Kintis
Kintis
Kintis
Kintis
Kintis
Kintis
Kintis
Kintis
Kintis
Knauth
Krenn
Kurtz
Kurtz
Kurtz
Kurtz
Kusano
Lakehal
Langdon
Langdon
Larsen
Laurent
Laurent
Le
Le Goues
Le Goues
Lelli
Li
Li
Linares-Vásquez
Lindström
Lindström
Lisper
Loise
Long
Lou
Ma
Ma
Ma
Madeyski
Madeyski
Madiraju
Maezawa
Mahajan
Marcozzi
Marcozzi
Maruchi
Mateo
Mateo
Mateo
Mateo
Mateo
Matinnejad
Mirshokraie
Mirshokraie
Mirshokraie
Mirshokraie
Moon
Moore
Morell
Mouelhi
Mouelhi
Murtaza
Murtaza
Musco
Márki
Nam
Namin
Namin
Namin
Namin
Nanavati
Nardo
Nguyen
Nguyen
Nica
Ocariza
Offutt
Offutt
Offutt
Offutt
Offutt
Offutt
Offutt
Oliveira
Omar
Omar
Omar
Omar
Omar
Pankumhang
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Papadakis
Parsai
Parsai
Patrick
Patrick
Patrick
Patrick
Patrick
Patrick
Petke
Pill
Polo
Praphamontripong
Praphamontripong
Praphamontripong
Rajan
Ramler
Riener
Rojas
Rojas
Rothermel
Roy
Rutherford
Saifan
Schirp
Schuler
Schuler
Schuler
Schuler
Schwarz
Shi
Shin
Shin
Silva
Simao
Souza
Souza
Sridharan
Staats
Stephan
Stephan
Su
Sullivan
Sun
Svajlenko
Tai
Tai
Tan
Tan
Tengeri
Tisi
Tokumoto
Trakhtenbrot
Trakhtenbrot
Troya
Tuya
Tuya
Untch
Usaola
Usaola
Visser
Voas
Walsh
Wang
Wei
Weiglhofer
Weimer
Weimer
Winbladh
Wotawa
Wright
Wu
Wu
Xie
Xu
Yao
Ye
Yoshida
Yoshida
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhou
Zhou
Zhou
Zhou
Zhu
Publication venue
Publication date: 01/01/2019
Field of study

Crossref

Open Repository and Bibliography - Luxembourg