Article thumbnail

Identifying composite crosscutting concerns through semi-supervised learning

By Jianlin Zhu, Jin Huang, Daicui Zhou, Federico Carminati, Guoping Zhang and Qiang He


Aspect mining improves the modularity of legacy software systems through identifying their underlying crosscutting concerns (CCs). However, a realistic CC is a composite one that consists of CC seeds and relative program elements, which makes it a great challenge to identify a composite CC. In this paper, inspired by the state-of-the-art information retrieval techniques, we model this problem as a semi-supervised learning problem. First, the link analysis technique is adopted to generate CC seeds. Second, we construct a coupling graph, which indicates the relationship between CC seeds. Then, we adopt community detection technique to generate groups of CC seeds as constraints for semi-supervised learning, which can guide the clustering process. Furthermore, we propose a semi-supervised graph clustering approach named constrained authority-shift clustering to identify composite CCs. Two measurements, namely, similarity and connectivity, are defined and seeded graph is generated for clustering program elements. We evaluate constrained authority-shift clustering on numerous software systems including large-scale distributed software system. The experimental results demonstrate that our semi-supervised learning is more effective in detecting composite CCs

Topics: Aspect mining, Composite crosscutting concerns, Link analysis, Semi-supervised learning
Publisher: John Wiley & Sons
Year: 2014
DOI identifier: 10.1002/spe.2234
OAI identifier:
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • (external link)
  • (external link)
  • Suggested articles

    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.