Search CORE

96 research outputs found

On the Effect of Semantically Enriched Context Models on Software Modularization

Author: Hage Jurriaan
Jansen Slinger
Khadka Ravi
Saeidi Amir
Publication venue: 'Aspect-Oriented Software Association (AOSA)'
Publication date: 04/08/2017
Field of study

Many of the existing approaches for program comprehension rely on the linguistic information found in source code, such as identifier names and comments. Semantic clustering is one such technique for modularization of the system that relies on the informal semantics of the program, encoded in the vocabulary used in the source code. Treating the source code as a collection of tokens loses the semantic information embedded within the identifiers. We try to overcome this problem by introducing context models for source code identifiers to obtain a semantic kernel, which can be used for both deriving the topics that run through the system as well as their clustering. In the first model, we abstract an identifier to its type representation and build on this notion of context to construct contextual vector representation of the source code. The second notion of context is defined based on the flow of data between identifiers to represent a module as a dependency graph where the nodes correspond to identifiers and the edges represent the data dependencies between pairs of identifiers. We have applied our approach to 10 medium-sized open source Java projects, and show that by introducing contexts for identifiers, the quality of the modularization of the software systems is improved. Both of the context models give results that are superior to the plain vector representation of documents. In some cases, the authoritativeness of decompositions is improved by 67%. Furthermore, a more detailed evaluation of our approach on JEdit, an open source editor, demonstrates that inferred topics through performing topic analysis on the contextual representations are more meaningful compared to the plain representation of the documents. The proposed approach in introducing a context model for source code identifiers paves the way for building tools that support developers in program comprehension tasks such as application and domain concept location, software modularization and topic analysis

arXiv.org e-Print Archive

Heriot Watt Pure

Crossref

ZENODO

Utrecht University Repository

FigShare

A search for the Ten Commentments: An exploratory study on automated quality assessment of comments in Java source code

Author: Lung C.
Publication venue
Publication date: 16/05/2022
Field of study

Open University of the Netherlands Research Portal

Feature location in source code: a taxonomy and survey

Author: Agrawal
Antoniol
Binkley
Binkley
Blei
Bohnet
Cleary
Comon
Cornelissen
Cubranic
Deerwester
Eaddy
Edwards
Egyed
Egyed
Egyed
Eisenbarth
Furnas
Greevy
Kleinberg
Ko
Lukins
Petrenko
Poshyvanyk
Rajlich
Robillard
Robillard
Robillard
Robillard
Rohatgi
Sahner
Salton
Sartipi
Schach
Simmons
Somerville
Wilde
Wilde
Zhai
Zhao
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

Configuring and Assembling Information Retrieval based Solutions for Software Engineering Tasks.

Author: Dit Bogdan
Publication venue: W&M ScholarWorks
Publication date: 01/01/2015
Field of study

Information Retrieval (IR) approaches are used to leverage textual or unstructured data generated during the software development process to support various software engineering (SE) tasks (e.g., concept location, traceability link recovery, change impact analysis, etc.). Two of the most important steps for applying IR techniques to support SE tasks are preprocessing the corpus and configuring the IR technique, and these steps can significantly influence the outcome and the amount of effort developers have to spend for these maintenance tasks. We present the use of Genetic Algorithms (GAs) to automatically configure and assemble an IR process to support SE tasks. The approach named IR-GA determines the (near) optimal solution to be used for each step of the IR process without requiring any training. We applied IR-GA on three different SE tasks and the results of the study indicate that IR-GA outperforms approaches previously used in the literature, and that it does not significantly differ from an ideal upper bound that could be achieved by a supervised approach and a combinatorial approach

Boise State University - ScholarWorks

College of William & Mary: W&M Publish

Analyzing requirements and traceability information to improve bug localization

Author: Git
Marcus A.
Rath Michael
von Knethen A.
Voorhees Ellen M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/05/2018
Field of study

Crossref

Institutional Knowledge at Singapore Management University

Software Feature Location in Practice: Debugging Aircraft Simulation Systems

Author: Hoseini Salman
Publication venue
Publication date: 07/12/2013
Field of study

In this thesis, we report on a study that we have conducted at CAE, one of the largest civil aircraft simulation companies in the world, in which we have developed a feature location approach to help software engineers debug simulation scenarios. A simulation scenario consists of a set of software components, configured in a certain way. A simulation fails when it does not behave as intended. This is typically a sign of a configuration problem. To detect configuration errors, we propose FELODE (Feature Location for Debugging), an approach that uses a single trace combined with user queries. When applied to CAE systems, FELODE achieves in average a precision of 50% and a recall of up to 100%

Concordia University Research Repository