18 research outputs found
Chat mining for gender prediction
The aim of this paper is to investigate the feasibility of predicting the gender of a text document's author using linguistic evidence. For this purpose, term- and style-based classification techniques are evaluated over a large collection of chat messages. Prediction accuracies up to 84.2% are achieved, illustrating the applicability of these techniques to gender prediction. Moreover, the reverse problem is exploited, and the effect of gender on the writing style is discussed. © Springer-Verlag Berlin Heidelberg 2006
A statistical investigation into the provenance of De Doctrina Christiana, attributed to John Milton
SIGLEAvailable from British Library Document Supply Centre-DSC:DXN014527 / BLDSC - British Library Document Supply CentreGBUnited Kingdo
How variable may a constant be? Measures of lexical richness in perspective
Item does not contain fulltext30 p
Enhancing LNRE models with partition-based adjustment
Item does not contain fulltex
A mixture model for a uni-modal word frequency distribution
Item does not contain fulltex
Sample-size invariance of LNRE model parameters: Problems and opportunities
Item does not contain fulltext10 p
The subjects as a simple random effect fallacy: Subject variability and morphological family effects in the mental lexicon
This is a methodological study addressing the appropriateness of standard by–subject and by–item averaging procedures for the analysis of repeated–measures designs. By means of a reanalysis of published data (Schreuder & Baayen, 1997), using random regression models, we present a proof of existence of systematic variability between participants that is ignored in the standard psycholinguistic analytical procedures. By applying linear mixed effects modeling (Pinheiro & Bates, 2000), we call attention to the potential lack of power of the by-subject and by-item analyses, which in this case study fail to reveal the coexistence of a facilitatory family size effect and an inhibitory family frequency effect in visual and auditory lexical processing
Outside the caves of shadows: using syntactic annotation to enhance authorship attribution
Item does not contain fulltex
An experiment in authorship attribution
Item does not contain fulltextJADT 2002, 13 maart 20028 p
New machine learning methods demonstrate the theorized existence of a human stylome
Contains fulltext :
42498.pdf (publisher's version ) (Closed access)14 p