Search CORE

4 research outputs found

Generic code clone detection model for java applications

Author: Mubarak-Ali Al-Fahim
Sulaiman Shahida
Publication venue: 'IOP Publishing'
Publication date: 01/01/2020
Field of study

Code clone is a common term used for codes that are repeated multiple times in a program. There are Type 1, Type 2, Type 3 and Type 4 code clones. Various code clone detection approaches and models have been used to detect a code clone. However, a major challenge faced in detecting code clone using these models is the lack of generality in detecting all clone types. To address this problem, Generic Code Clone Detection (GCCD) model that consists of five processes which are Preprocessing, Transformation, Parameterization, Categorization and Match Detection process is proposed. Initially, a pre-processing process produces source units through the application of five combinatorial rules. This is followed by the transformation process to produce transformed source units based on the letter to number substitution concept. Next, a parameterization process produces parameters used in categorization and match detection process. Next, a categorization process groups the source units into pools. Finally, a match detection process uses a hybrid exact matching with Euclidean distance to detect the clones. Based on these processes, a prototype of the GCCD was developed using Netbeans 8.0. The model was compared with the Generic Pipeline Model (GPM). The comparisons showed that the GCCD was able to detect clone pairs of Type-1 until Type-4 while the GPM was able to detect clone pair for Type-1 only. Furthermore, the GCCD prototype was empirically tested with Bellons benchmark data and it was able to detect clones in Java applications with up to 203,000 line of codes. As a conclusion, the GCCD model is able to overcome the lack of generality in detecting all code clone types by detecting Type 1, Type 2, Type 3 and Type 4 clones

Universiti Teknologi Malaysia Institutional Repository

A systematic literature review on source code similarity measurement and clone detection: techniques, applications, and challenges

Author: Ekhtiarzadeh Masoud
Parsa Saeed
Ramezani Mohammad
Roy Chanchal
Zakeri-Nasrabadi Morteza
Publication venue
Publication date: 28/06/2023
Field of study

Measuring and evaluating source code similarity is a fundamental software engineering activity that embraces a broad range of applications, including but not limited to code recommendation, duplicate code, plagiarism, malware, and smell detection. This paper proposes a systematic literature review and meta-analysis on code similarity measurement and evaluation techniques to shed light on the existing approaches and their characteristics in different applications. We initially found over 10000 articles by querying four digital libraries and ended up with 136 primary studies in the field. The studies were classified according to their methodology, programming languages, datasets, tools, and applications. A deep investigation reveals 80 software tools, working with eight different techniques on five application domains. Nearly 49% of the tools work on Java programs and 37% support C and C++, while there is no support for many programming languages. A noteworthy point was the existence of 12 datasets related to source code similarity measurement and duplicate codes, of which only eight datasets were publicly accessible. The lack of reliable datasets, empirical evaluations, hybrid methods, and focuses on multi-paradigm languages are the main challenges in the field. Emerging applications of code similarity measurement concentrate on the development phase in addition to the maintenance.Comment: 49 pages, 10 figures, 6 table

arXiv.org e-Print Archive

Method-level code clone detection through LWH (Light Weight Hybrid) approach

Author: A Leitao
A Leitner
A Marcus
B Al-Batran
BS Baker
C Kapser
C Kapser
C Liu
C Wohlin
CJ Kapser
CK Roy
CK Roy
CK Roy
E Fenton
GMK Selim
H Basit
H Petersen
J Krinke
J Mayland
J Pate
JR Cordy
K Greenan
K Hotta
K Moller
L Moonen
M Fowler
M Funaro
M Gabel
M Lee
M Zibran
M Zibran
R Adamov
R Komondoor
R Koschke
R Koschke
R Wettel
RK Yin
S Bellon
S Bellon
S Ducasse
S Ducasse
S Thummalapenta
T Kamiya
W Evans
WS Evans
Y Ueda
Z Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

FORMALIZATION AND DETECTION OF COLLABORATIVE PATTERNS IN SOFTWARE

Author: KULDEEP KUMAR
Publication venue
Publication date: 09/06/2015
Field of study

Ph.DDOCTOR OF PHILOSOPH

ScholarBank@NUS