Search CORE

2 research outputs found

Righteousness in Early Christian Literature : Distant Reading and Textual Networks

Author: Glomb Tomáš
Kaše Vojtěch
Nikki Nina
Publication venue
Publication date: 01/01/2022
Field of study

Peer reviewe

University of West Bohemia Digital Library

Helsingin yliopiston digitaalinen arkisto

DSpace at University of West Bohemia

Deriving word association networks from text corpora

Author: Bruza Peter
Galea David
Publication venue: Sun SITE Central Europe (CEUR)
Publication date: 01/01/2015
Field of study

This article presents and evaluates a model to automatically derive word association networks from text corpora. Two aspects were evaluated: To what degree can corpus-based word association networks (CANs) approximate human word association networks with respect to (1) their ability to quantitatively predict word associations and (2) their structural network characteristics. Word association networks are the basis of the human mental lexicon. However, extracting such networks from human subjects is laborious, time consuming and thus necessarily limited in relation to the breadth of human vocabulary. Automatic derivation of word associations from text corpora would address these limitations. In both evaluations corpus-based processing provided vector representations for words. These representations were then employed to derive CANs using two measures: (1) the well known cosine metric, which is a symmetric measure, and (2) a new asymmetric measure computed from orthogonal vector projections. For both evaluations, the full set of 4068 free association networks (FANs) from the University of South Florida word association norms were used as baseline human data. Two corpus based models were benchmarked for comparison: a latent topic model and latent semantic analysis (LSA). We observed that CANs constructed using the asymmetric measure were slightly less effective than the topic model in quantitatively predicting free associates, and slightly better than LSA. The structural networks analysis revealed that CANs do approximate the FANs to an encouraging degree

Queensland University of Technology ePrints Archive