Experiments to investigate the utility of nearest neighbour metrics based on linguistically informed features for detecting textual plagiarism

Almquist, Per; Karlgren, Jussi

research

Experiments to investigate the utility of nearest neighbour metrics based on linguistically informed features for detecting textual plagiarism

Authors: Per Almquist
Jussi Karlgren
Publication date: 1 January 2011
Publisher

Abstract

Plagiarism detection is a challenge for linguistic models — most current implemented models use simple occurrence statistics for linguistic items. In this paper we report two experiments related to plagiarism detection where we use a model for distributional semantics and of sentence stylistics to compare sentence by sentence the likelihood of a text being partly plagiarised. The result of the comparison are displayed for visual inspection by a plagiarism assessor