1 research outputs found

    TOPIC MODELING FOR EMAIL SUBJECT LINE ANALYSIS

    Get PDF
    Email processing is an emerging area in natural language processing and machine learning. Archivists often must make judgements about the relevance and record status of email messages. This study is an attempt to streamline that process by testing subject line and message body analysis using topic modeling. Specifically, using the Enron Corpus and Latent Dirichlet Allocation, this study investigates the extent to which email subject lines can be used to predict the content of email messages to support efficient archival processing.Master of Science in Information Scienc
    corecore