Search CORE

6 research outputs found

Cross-Language Learning for Program Classification using Bilateral Tree-Based Convolutional Neural Networks

Author: Bui Nghi D. Q.
Jiang Lingxiao
Yu Yijun
Publication venue
Publication date: 29/11/2017
Field of study

Towards the vision of translating code that implements an algorithm from one programming language into another, this paper proposes an approach for automated program classification using bilateral tree-based convolutional neural networks (BiTBCNNs). It is layered on top of two tree-based convolutional neural networks (TBCNNs), each of which recognizes the algorithm of code written in an individual programming language. The combination layer of the networks recognizes the similarities and differences among code in different programming languages. The BiTBCNNs are trained using the source code in different languages but known to implement the same algorithms and/or functionalities. For a preliminary evaluation, we use 3591 Java and 3534 C++ code snippets from 6 algorithms we crawled systematically from GitHub. We obtained over 90% accuracy in the cross-language binary classification task to tell whether any given two code snippets implement a same algorithm. Also, for the algorithm classification task, i.e., to predict which one of the six algorithm labels is implemented by an arbitrary C++ code snippet, we achieved over 80% precision

arXiv.org e-Print Archive

Open Research Online (The Open University)

Cross-language learning for program classification using bilateral tree-based convolutional neural networks

Author: BUI Duy Quoc Nghi
JIANG Lingxiao
YU Yijun
Publication venue: AAAI Press
Publication date: 01/02/2018
Field of study

Institutional Knowledge at Singapore Management University

Deep Learning Applied to Code Analysis

Author: Genin Simon
Publication venue
Publication date: 03/09/2019
Field of study

Repository of the University of Namur

Mining Fix Patterns for FindBugs Violations

Author: Bissyandé Tegawendé F.
Kim Dongsun
Liu Kui
Traon Yves Le
Yoo Shin
Publication venue
Publication date: 01/01/2018
Field of study

In this paper, we first collect and track a large number of fixed and unfixed violations across revisions of software. The empirical analyses reveal that there are discrepancies in the distributions of violations that are detected and those that are fixed, in terms of occurrences, spread and categories, which can provide insights into prioritizing violations. To automatically identify patterns in violations and their fixes, we propose an approach that utilizes convolutional neural networks to learn features and clustering to regroup similar instances. We then evaluate the usefulness of the identified fix patterns by applying them to unfixed violations. The results show that developers will accept and merge a majority (69/116) of fixes generated from the inferred fix patterns. It is also noteworthy that the yielded patterns are applicable to four real bugs in the Defects4J major benchmark for software testing and automated repair.Comment: Accepted for IEEE Transactions on Software Engineerin

arXiv.org e-Print Archive

Open Repository and Bibliography - Luxembourg