Search CORE

5 research outputs found

Nomenclature and Contemporary Affirmation of the Unsupervised Learning in Text and Document Mining

Author: Annaluri Sreenivasa Rao
Prof. S. Ramakrishna
Publication venue: Global Journals Inc. (US)
Publication date: 21/02/2015
Field of study

Document clustering is primarily a method applied for an uncomplicated, document search, analysis and review of content or is a process of automatic classification of documents of similar type categorized to relevant clusters, in a clustering hierarchy. In this paper a review of the related work in the field of document clustering from the simple techniques of word and phrase to the present complex techniques of statistical analysis, machine learning etc are illustrated with their implications for future research work

Global Journal of Computer Science and Technology (GJCST)

The role of expressivity and productivity in (re)shaping the constructional network : a corpus-based study into synchronic and diachronic variation in the intensifying fake reflexive resultative construction in 19th to 21st Century Dutch

Author: Gyselinck Emmeline
Publication venue: Ghent University. Faculty of Arts and Philosophy
Publication date: 01/01/2018
Field of study

Ghent University Academic Bibliography

Detecting plagiarism in the forensic linguistics turn

Author: Sousa Silva Rui
Publication venue
Publication date
Field of study

This study investigates plagiarism detection, with an application in forensic contexts. Two types of data were collected for the purposes of this study. Data in the form of written texts were obtained from two Portuguese Universities and from a Portuguese newspaper. These data are analysed linguistically to identify instances of verbatim, morpho-syntactical, lexical and discursive overlap. Data in the form of survey were obtained from two higher education institutions in Portugal, and another two in the United Kingdom. These data are analysed using a 2 by 2 between-groups Univariate Analysis of Variance (ANOVA), to reveal cross-cultural divergences in the perceptions of plagiarism. The study discusses the legal and social circumstances that may contribute to adopting a punitive approach to plagiarism, or, conversely, reject the punishment. The research adopts a critical approach to plagiarism detection. On the one hand, it describes the linguistic strategies adopted by plagiarists when borrowing from other sources, and, on the other hand, it discusses the relationship between these instances of plagiarism and the context in which they appear. A focus of this study is whether plagiarism involves an intention to deceive, and, in this case, whether forensic linguistic evidence can provide clues to this intentionality. It also evaluates current computational approaches to plagiarism detection, and identifies strategies that these systems fail to detect. Specifically, a method is proposed to translingual plagiarism. The findings indicate that, although cross-cultural aspects influence the different perceptions of plagiarism, a distinction needs to be made between intentional and unintentional plagiarism. The linguistic analysis demonstrates that linguistic elements can contribute to finding clues for the plagiarist’s intentionality. Furthermore, the findings show that translingual plagiarism can be detected by using the method proposed, and that plagiarism detection software can be improved using existing computer tools

Aston Publications Explorer

Framing dance writing : a corpus linguistics approach

Author: Wiesner Susan L
Publication venue
Publication date: 01/01/2007
Field of study

EThOS - Electronic Theses Online ServiceGBUnited Kingdo

Surrey Research Insight

OpenGrey Repository