Application of seq2seq models on code correction

Chin, Sang; Huang, Shan; Zhou, Xiao

Application of seq2seq models on code correction

Authors: Sang Chin
Shan Huang
Xiao Zhou
Publication date: 28 January 2020
Publisher: 'Frontiers Media SA'
Doi

Abstract

We apply various seq2seq models on programming language correction tasks on Juliet Test Suite for C/C++ and Java of Software Assurance Reference Datasets and achieve 75% (for C/C++) and 56% (for Java) repair rates on these tasks. We introduce pyramid encoder in these seq2seq models, which significantly increases the computational efficiency and memory efficiency, while achieving similar repair rate to their nonpyramid counterparts. We successfully carry out error type classification task on ITC benchmark examples (with only 685 code instances) using transfer learning with models pretrained on Juliet Test Suite, pointing out a novel way of processing small programming language datasets.Published versio

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Boston University Institutional Repository (OpenBU)

oai:open.bu.edu:2144/43314

Last time updated on 10/12/2021

Boston University Institutional Repository (OpenBU)

oai:open.bu.edu:2144/43315

Last time updated on 10/12/2021