Distributed representation of multi-sense words: A loss-driven approach

A Banerjee; IS Dhillon; J Duchi; Rami Al-Rfou

research

Distributed representation of multi-sense words: A loss-driven approach

Authors: A Banerjee
IS Dhillon
J Duchi
Rami Al-Rfou
Publication date: 14 April 2019
Publisher: 'Springer Science and Business Media LLC'
Doi

Abstract

Word2Vec's Skip Gram model is the current state-of-the-art approach for estimating the distributed representation of words. However, it assumes a single vector per word, which is not well-suited for representing words that have multiple senses. This work presents LDMI, a new model for estimating distributional representations of words. LDMI relies on the idea that, if a word carries multiple senses, then having a different representation for each of its senses should lead to a lower loss associated with predicting its co-occurring words, as opposed to the case when a single vector representation is used for all the senses. After identifying the multi-sense words, LDMI clusters the occurrences of these words to assign a sense to each occurrence. Experiments on the contextual word similarity task show that LDMI leads to better performance than competing approaches.Comment: PAKDD 2018 Best paper award runner-u

Similar works

Full text

Available Versions

Crossref

info:doi/10.1007%2F978-3-319-9...

Last time updated on 10/08/2021