Search CORE

3 research outputs found

Real-valued syntactic word vectors

Author: Basirat Ali
Nivre Joakim
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2020
Field of study

We introduce a word embedding method that generates a set of real-valued word vectors from a distributional semantic space. The semantic space is built with a set of context units (words) which are selected by an entropy-based feature selection approach with respect to the certainty involved in their contextual environments. We show that the most predictive context of a target word is its preceding word. An adaptive transformation function is also introduced that reshapes the data distribution to make it suitable for dimensionality reduction techniques. The final low-dimensional word vectors are formed by the singular vectors of a matrix of transformed data. We show that the resulting word vectors are as good as other sets of word vectors generated with popular word embedding methods

Publikationer från Uppsala Universitet

Copenhagen University Research Information System

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Real-valued Syntactic Word Vectors (RSV) for Greedy Neural Dependency Parsing

Author: Basirat Ali
Nivre Joakim
Publication venue: 'Linkoping University Electronic Press'
Publication date: 01/01/2017
Field of study

We show that a set of real-valued word vectors formed by right singular vectors of a transformed co-occurrence matrix are meaningful for determining different types of dependency relations between words. Our experimental results on the task of dependency parsing confirm the superiority of the word vectors to the other sets of word vectors generated by popular methods of word embedding. We also study the effect of using these vectors on the accuracy of dependency parsing in different languages versus using more complex parsing architectures

Publikationer från Uppsala Universitet

Digitala Vetenskapliga Arkivet - Academic Archive On-line