Sparse Overcomplete Word Vector Representations

Dyer, Chris; Faruqui, Manaal; Smith, Noah; Tsvetkov, Yulia; Yogatama, Dani

research

Sparse Overcomplete Word Vector Representations

Authors: Chris Dyer
Manaal Faruqui
Noah Smith
Yulia Tsvetkov
Dani Yogatama
Publication date: 1 January 2015
Publisher
Doi

Abstract

Current distributed representations of words show little resemblance to theories of lexical semantics. The former are dense and uninterpretable, the latter largely based on familiar, discrete classes (e.g., supersenses) and relations (e.g., synonymy and hypernymy). We propose methods that transform word vectors into sparse (and optionally binary) vectors. The resulting representations are more similar to the interpretable features typically used in NLP, though they are discovered automatically from raw corpora. Because the vectors are highly sparse, they are computationally easy to work with. Most importantly, we find that they outperform the original vectors on benchmark tasks.Comment: Proceedings of ACL 201

Similar works

Full text

Available Versions

Crossref

info:doi/10.3115%2Fv1%2Fp15-11...

Last time updated on 19/03/2019