Comparative Evaluation of String Metrics for Context Ontology Database

Abstract

Static Context Code Coverage Program (SCCCP) is a program developed to calculate the coverage of context code in a Java file of an Android application. The database built for SCCCP includes records on location and speech context, exclusive to Android. There is a huge need for string matching algorithm since strings from the source codes and database have to be checked for any similarity first before moving on to the calculation of context coverage. Therefore, three different string metrics were analyzed prior to choosing the most suitable one for SCCCP. In this paper, the results obtained from using JaroWinkler, Levenshtein, and Strike a Match string distance metrics are analyzed based on the task of matching the source codes with database records and other pair of strings. Some issues related during our experiment on source code matching are discussed in this paper. The findings conclude that Strike a Match algorithm is the best option since it gives the highest accuracy among others

    Similar works