Search CORE

11 research outputs found

Tracing Naming Semantics in Unit Tests of Popular Github Android Projects

Author: Madeja Matej
Publication venue: OASIcs - OpenAccess Series in Informatics. 8th Symposium on Languages, Applications and Technologies (SLATE 2019)
Publication date: 01/01/2019
Field of study

The tests are so closely linked to the source code that we consider them up-to-date documentation. Developers are aware of recommended naming conventions and other best practices that should be used to write tests. In this paper we focus on how the developers test in practice and what conventions they use. For the analysis 5 very popular Android projects from Github were selected. The results show that 49 % of tests contain full and 76 % of tests contain a partial unit under test (UUT) method name in their name. Further, there was observed that UUT was only rarely tested by multiple test classes and thus in cases when the tester wanted to distinguish the way he or she worked with the tested object. The analysis of this paper shows that the word "test" in the test title is not a reliable metric for identifying the test. Apart from assertions, the developers use statements like verify, try-catch and throw exception to verify the correctness of UUT functionality. At the same time it was found out that the test titles contained keywords which could lead to the identification of UUT, use case of test or data used for test. It was also found out that the words in the test title were very often found in its body and in a smaller amount in UUT body which indicated the use of similar vocabulary in tests and UUT

Dagstuhl Research Online Publication Server

Automating Test Case Identification in Open Source Projects on GitHub

Author: Bačíková Michaela
Chodarev Sergej
Gurbáľ Filip
Juhár Ján
Madeja Matej
Porubän Jaroslav
Sulír Matúš
Publication venue
Publication date: 23/02/2021
Field of study

Software testing is one of the very important Quality Assurance (QA) components. A lot of researchers deal with the testing process in terms of tester motivation and how tests should or should not be written. However, it is not known from the recommendations how the tests are actually written in real projects. In this paper the following was investigated: (i) the denotation of the test word in different natural languages; (ii) whether the test word correlates with the presence of test cases; and (iii) what testing frameworks are mostly used. The analysis was performed on 38 GitHub open source repositories thoroughly selected from the set of 4.3M GitHub projects. We analyzed 20,340 test cases in 803 classes manually and 170k classes using an automated approach. The results show that: (i) there exists weak correlation (r = 0.655) between the word test and test cases presence in a class; (ii) the proposed algorithm using static file analysis correctly detected 95\% of test cases; (iii) 15\% of the analyzed classes used main() function whose represent regular Java programs that test the production code without using any third-party framework. The identification of such tests is very low due to implementation diversity. The results may be leveraged to more quickly identify and locate test cases in a repository, to understand practices in customized testing solutions and to mine tests to improve program comprehension in the future.Comment: 31 page

arXiv.org e-Print Archive

Computing and Informatics (E-Journal - Institute of Informatics, SAS, Bratislava)

A Fine-grained Data Set and Analysis of Tangling in Bug Fixing Commits

Author: Aghamohammadi Alireza
Ahmadabadi Matin Nili
Aktas Ethem Utku
Alam Omar
Albrecht Ella
Aldaeej Abdullah
Amit Idan
Bossenmaier Tim
Chahal Kuljit Kaur
Chakroborti Debasish
Colomo-Palacios Ricardo
Davis James
Davis Willard
Eismann Simon
Erbel Johannes
Fard Fatemeh
Ghaleb Taher Ahmed
Henley Austin Z.
Herbold Steffen
Hoy Nathaniel
Kourtzanidis Stratos
Ledel Benjamin
Lenarduzzi Valentina
Madeja Matej
Makedonski Philip
Malavolta Ivano
Marcilio Diego
Nagaria Bhaveet
Pashchenko Ivan
Qin Yihao
Rodríguez-Pérez Gema
Serebrenik Alexander
Shamasbi Simin Maleki
Singh Paramvir
Spieker Helge
Strüber Daniel
Sulir Matus
Szabados Kristof
Trautsch Alexander
Treude Christoph
Turhan Burak
Tuzun Eray
Verdecchia Roberto
Walunj Vijay
Wang Shangwen
Wickert Anna-Katharina
Wu Hongjun
Wyrich Marvin
Publication venue
Publication date: 01/01/2021
Field of study

Context: Tangled commits are changes to software that address multiple concerns at once. For researchers interested in bugs, tangled commits mean that they actually study not only bugs, but also other concerns irrelevant for the study of bugs. Objective: We want to improve our understanding of the prevalence of tangling and the types of changes that are tangled within bug fixing commits. Methods: We use a crowd sourcing approach for manual labeling to validate which changes contribute to bug fixes for each line in bug fixing commits. Each line is labeled by four participants. If at least three participants agree on the same label, we have consensus. Results: We estimate that between 17% and 32% of all changes in bug fixing commits modify the source code to fix the underlying problem. However, when we only consider changes to the production code files this ratio increases to 66% to 87%. We find that about 11% of lines are hard to label leading to active disagreements between participants. Due to confirmed tangling and the uncertainty in our data, we estimate that 3% to 47% of data is noisy without manual untangling, depending on the use case. Conclusion: Tangled commits have a high prevalence in bug fixes and can lead to a large amount of noise in the data. Prior research indicates that this noise may alter results. As researchers, we should be skeptics and assume that unvalidated data is likely very noisy, until proven otherwise.Comment: Status: Accepted at Empirical Software Engineerin

arXiv.org e-Print Archive

University of Oulu Repository - Jultika

Monash University Research Portal

Automated testing environment and assessment of assignments for Android MOOC

Author: Madeja Matej
Porubän Jaroslav
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/07/2018
Field of study

This paper describes the design of a testing environment for massive assessment of assignments for Android application programming courses. Specific testing methods and tool suggestions are continuously consulted with Wirecard company, dedicated to the development of mobile applications. The paper also analyzes the most common mistakes of students and suggests ways to uncover them through tests. Based on these, it creates tests, compares the performance of the emulator and real device tests, and the proposed tools are partially retrospectively tested on assignments from the previous run of a particular Android application programming course. From partial results the paper suggests changes for the course in relation to the testing environment and deploys it in the background of the course alongside the manual evaluation. It describes testing experience, analyzes the results and suggests changes for the futur

Directory of Open Access Journals

Unit Under Test Identification Using Natural Language Processing Techniques

Author: Madeja Matej
Porubän Jaroslav
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/12/2020
Field of study

Unit under test identification (UUT) is often difficult due to test smells, such as testing multiple UUTs in one test. Because the tests best reflect the current product specification they can be used to comprehend parts of the production code and the relationships between them. Because there is a similar vocabulary between the test and UUT, five NLP techniques were used on the source code of 5 popular Github projects in this paper. The collected results were compared with the manually identified UUTs. The tf-idf model achieved the best accuracy of 22% for a right UUT and 57% with a tolerance up to fifth place of manual identification. These results were obtained after preprocessing input documents with java keywords removal and word split. The tf-idf model achieved the best model training time and the index search takes within 1s per request, so it could be used in an Integrated Development Environment (IDE) as a support tool in the future. At the same time, it has been found that, for document preprocessing, word splitting improves accuracy best and removing java keywords has just a small improvement for tf-idf model results. Removing comments only slightly worsens the accuracy of Natural Language Processing (NLP) models. The best speed provided the word splitting with average 0.3s preprocessing time per all documents in a project

Directory of Open Access Journals

Automating Test Case Identification in Java Open Source Projects on GitHub

Author: Bačíková Michaela
Chodarev Sergej
Gurbáľ Filip
Juhár Ján
Madeja Matej
Porubän Jaroslav
Sulír Matúš
Publication venue: Institute of Informatics, Slovak Academy of Sciences
Publication date: 30/11/2021
Field of study

Software testing is one of the very important Quality Assurance (QA) components. A lot of researchers deal with the testing process in terms of tester motivation and how tests should or should not be written. However, it is not known from the recommendations how the tests are written in real projects. In this paper, the following was investigated: (i) the denotation of the word "test" in different natural languages; (ii) whether the number of occurrences of the word "test" correlates with the number of test cases; and (iii) what testing frameworks are mostly used. The analysis was performed on 38 GitHub open source repositories thoroughly selected from the set of 4.3 M GitHub projects. We analyzed 20 340 test cases in 803 classes manually and 170 k classes using an automated approach. The results show that: (i) there exists a weak correlation (r = 0.655) between the number of occurrences of the word "test" and the number of test cases in a class; (ii) the proposed algorithm using static file analysis correctly detected 97 % of test cases; (iii) 15 % of the analyzed classes used ttmain() function whose represent regular Java programs that test the production code without using any third-party framework. The identification of such tests is very complex due to implementation diversity. The results may be leveraged to more quickly identify and locate test cases in a repository, to understand practices in customized testing solutions, and to mine tests to improve program comprehension in the future

Computing and Informatics (E-Journal - Institute of Informatics, SAS, Bratislava)

Large-Scale Dataset of Local Java Software Build Results

Author: Ján Juhár
Matej Madeja
Matúš Sulír
Michaela Bačíková
Sergej Chodarev
Publication venue: 'MDPI AG'
Publication date: 21/09/2020
Field of study

When a person decides to inspect or modify a third-party software project, the first necessary step is its successful compilation from source code using a build system. However, such attempts often end in failure. In this data descriptor paper, we provide a dataset of build results of open source Java software systems. We tried to automatically build a large number of Java projects from GitHub using their Maven, Gradle, and Ant build scripts in a Docker container simulating a standard programmer’s environment. The dataset consists of the output of two executions: 7264 build logs from a study executed in 2016 and 7233 logs from the 2020 execution. In addition to the logs, we collected exit codes, file counts, and various project metadata. The proportion of failed builds in our dataset is 38% in the 2016 execution and 59% in the 2020 execution. The published data can be helpful for multiple purposes, such as correlation analysis of factors affecting build success, build failure prediction, and research in the area of build breakage repair

Multidisciplinary Digital Publishing Institute

Empirical Study of Test Case and Test Framework Presence in Public Projects on GitHub

Author: Filip Gurbáľ
Jaroslav Porubän
Matej Madeja
Matúš Sulír
Sergej Chodarev
Publication venue: 'MDPI AG'
Publication date: 01/08/2021
Field of study

Automated tests are often considered an indicator of project quality. In this paper, we performed a large analysis of 6.3 M public GitHub projects using Java as the primary programming language. We created an overview of tests occurrence in publicly available GitHub projects and the use of test frameworks in them. The results showed that 52% of the projects contain at least one test case. However, there is a large number of example tests that do not represent relevant production code testing. It was also found that there is only a poor correlation between the number of the word “test” in different parts of the project (e.g., file paths, file name, file content, etc.) and the number of test cases, creation date, date of the last commit, number of commits, or number of watchers. Testing framework analysis confirmed that JUnit is the most used testing framework with a 48% share. TestNG, considered the second most popular Java unit testing framework, occurred in only 3% of the projects

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

Domain Usability Evaluation

Author: Jaroslav Porubän
Matej Madeja
Matúš Sulír
Michaela Bačíková
Sergej Chodarev
William Steingartner
Publication venue: 'MDPI AG'
Publication date: 01/08/2021
Field of study

Contemporary software systems focus on usability and accessibility from the point of view of effectiveness and ergonomics. However, the correct usage of the domain dictionary and the description of domain relations and properties via their user interfaces are often neglected. We use the term domain usability (DU) to describe the aspects of the user interface related to the terminology and domain. Our experience showed that poor domain usability reduces the memorability and effectiveness of user interfaces. To address this problem, we describe a method called ADUE (Automatic Domain Usability Evaluation) for the automated evaluation of selected DU properties on existing user interfaces. As a prerequisite to the method, metrics for formal evaluation of domain usability, a form stereotype recognition algorithm, and general application terms filtering algorithm have been proposed. We executed ADUE on several real-world Java applications and report our findings. We also provide proposals to modify existing manual usability evaluation techniques for the purpose of domain usability evaluation

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

A fine-grained data set and analysis of tangling in bug fixing commits

Author: AGHAMOHAMMADI Alireza
AHMADABADI Matin Nili
BOSSENMAIER Tim
COLOMO-PALACIOS Ricardo
GHALEB Taher Ahmed
HERBOLD Steffen
HOY Nathaniel G.
KAUR CHAHAL Kuljit
LEDEL Benjamin
MADEJA Matej
MAKEDONSKI Philip
NAGARIA Bhaveet
RODRÍGUEZ-PÉREZ Gema
SINGH Paramvir
SPIEKER Helge
SZABADOS Kristóf
TRAUTSCH Alexander
TREUDE Christoph
VERDECCHIA Roberto
WANG Shangwen
Publication venue: Springer
Publication date: 01/11/2022
Field of study

Institutional Knowledge at Singapore Management University