Learning Word Representations with Hierarchical Sparse Coding

Dyer, Chris; Faruqui, Manaal; Smith, Noah A.; Yogatama, Dani

research

Learning Word Representations with Hierarchical Sparse Coding

Authors: Chris Dyer
Manaal Faruqui
Noah A. Smith
Dani Yogatama
Publication date: 6 November 2014
Publisher

Abstract

We propose a new method for learning word representations using hierarchical regularization in sparse coding inspired by the linguistic study of word meanings. We show an efficient learning algorithm based on stochastic proximal methods that is significantly faster than previous approaches, making it possible to perform hierarchical sparse coding on a corpus of billions of word tokens. Experiments on various benchmark tasks---word similarity ranking, analogies, sentence completion, and sentiment analysis---demonstrate that the method outperforms or is competitive with state-of-the-art methods. Our word representations are available at \url{http://www.ark.cs.cmu.edu/dyogatam/wordvecs/}

Similar works

Full text

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.749.2...

Last time updated on 30/10/2017