5,489 research outputs found
Polysemy Detection in Distributed Representation of Word Sense
In this paper, we propose a statistical test to determine whether a given
word is used as a polysemic word or not. The statistic of the word in this test
roughly corresponds to the fluctuation in the senses of the neighboring words a
nd the word itself. Even though the sense of a word corresponds to a single
vector, we discuss how polysemy of the words affects the position of vectors.
Finally, we also explain the method to detect this effect.Comment: The 9th International Conference on Knowledge and Smart Technology
(KST-2017
Distributed representation of multi-sense words: A loss-driven approach
Word2Vec's Skip Gram model is the current state-of-the-art approach for
estimating the distributed representation of words. However, it assumes a
single vector per word, which is not well-suited for representing words that
have multiple senses. This work presents LDMI, a new model for estimating
distributional representations of words. LDMI relies on the idea that, if a
word carries multiple senses, then having a different representation for each
of its senses should lead to a lower loss associated with predicting its
co-occurring words, as opposed to the case when a single vector representation
is used for all the senses. After identifying the multi-sense words, LDMI
clusters the occurrences of these words to assign a sense to each occurrence.
Experiments on the contextual word similarity task show that LDMI leads to
better performance than competing approaches.Comment: PAKDD 2018 Best paper award runner-u
Distributed Representation of Geometrically Correlated Images with Compressed Linear Measurements
This paper addresses the problem of distributed coding of images whose
correlation is driven by the motion of objects or positioning of the vision
sensors. It concentrates on the problem where images are encoded with
compressed linear measurements. We propose a geometry-based correlation model
in order to describe the common information in pairs of images. We assume that
the constitutive components of natural images can be captured by visual
features that undergo local transformations (e.g., translation) in different
images. We first identify prominent visual features by computing a sparse
approximation of a reference image with a dictionary of geometric basis
functions. We then pose a regularized optimization problem to estimate the
corresponding features in correlated images given by quantized linear
measurements. The estimated features have to comply with the compressed
information and to represent consistent transformation between images. The
correlation model is given by the relative geometric transformations between
corresponding features. We then propose an efficient joint decoding algorithm
that estimates the compressed images such that they stay consistent with both
the quantized measurements and the correlation model. Experimental results show
that the proposed algorithm effectively estimates the correlation between
images in multi-view datasets. In addition, the proposed algorithm provides
effective decoding performance that compares advantageously to independent
coding solutions as well as state-of-the-art distributed coding schemes based
on disparity learning
Automatic estimation of harmonic tension by distributed representation of chords
The buildup and release of a sense of tension is one of the most essential
aspects of the process of listening to music. A veridical computational model
of perceived musical tension would be an important ingredient for many music
informatics applications. The present paper presents a new approach to
modelling harmonic tension based on a distributed representation of chords. The
starting hypothesis is that harmonic tension as perceived by human listeners is
related, among other things, to the expectedness of harmonic units (chords) in
their local harmonic context. We train a word2vec-type neural network to learn
a vector space that captures contextual similarity and expectedness, and define
a quantitative measure of harmonic tension on top of this. To assess the
veridicality of the model, we compare its outputs on a number of well-defined
chord classes and cadential contexts to results from pertinent empirical
studies in music psychology. Statistical analysis shows that the model's
predictions conform very well with empirical evidence obtained from human
listeners.Comment: 12 pages, 4 figures. To appear in Proceedings of the 13th
International Symposium on Computer Music Multidisciplinary Research (CMMR),
Porto, Portuga
A Novel Distributed Representation of News (DRNews) for Stock Market Predictions
In this study, a novel Distributed Representation of News (DRNews) model is
developed and applied in deep learning-based stock market predictions. With the
merit of integrating contextual information and cross-documental knowledge, the
DRNews model creates news vectors that describe both the semantic information
and potential linkages among news events through an attributed news network.
Two stock market prediction tasks, namely the short-term stock movement
prediction and stock crises early warning, are implemented in the framework of
the attention-based Long Short Term-Memory (LSTM) network. It is suggested that
DRNews substantially enhances the results of both tasks comparing with five
baselines of news embedding models. Further, the attention mechanism suggests
that short-term stock trend and stock market crises both receive influences
from daily news with the former demonstrates more critical responses on the
information related to the stock market {\em per se}, whilst the latter draws
more concerns on the banking sector and economic policies.Comment: 25 page
- …