research

Data reduction typology and the bimodal distribution bias

Abstract

Confronting low data reduction typologies, as established by using data from parallel texts, with the high data reduction typologies of WALS reveals a systematic bias of WALS typologies toward highly bimodal distribution. Properties with a distribution supporting a discrete feature analysis in many languages are likelier to be represented in WALS and to be represented accurately. This bias has important consequences when WALS typologies are interpreted theoretically or further processed statisticall

    Similar works