Deep Investigation of Cross-Language Plagiarism Detection Methods

Agnes, Frederic; Besacier, Laurent; Ferrero, Jeremy; Schwab, Didier

research

Deep Investigation of Cross-Language Plagiarism Detection Methods

Authors: Frederic Agnes
Laurent Besacier
Jeremy Ferrero
Didier Schwab
Publication date: 24 May 2017
Publisher

Abstract

This paper is a deep investigation of cross-language plagiarism detection methods on a new recently introduced open dataset, which contains parallel and comparable collections of documents with multiple characteristics (different genres, languages and sizes of texts). We investigate cross-language plagiarism detection methods for 6 language pairs on 2 granularities of text units in order to draw robust conclusions on the best methods while deeply analyzing correlations across document styles and languages.Comment: Accepted to BUCC (10th Workshop on Building and Using Comparable Corpora) colocated with ACL 201

Similar works

Full text

Available Versions

Hal - Université Grenoble Alpes

oai:HAL:hal-01531346v1

Last time updated on 10/08/2017

Archive Ouverte en Sciences de l'Information et de la Communication

oai:HAL:hal-01531346v1

Last time updated on 12/10/2017