Search CORE

2,239 research outputs found

Enhancing Domain Word Embedding via Latent Semantic Imputation

Author: Lai Siwei
Lin Frank
Mikolov Tomas
van der Maaten Laurens
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 21/05/2019
Field of study

We present a novel method named Latent Semantic Imputation (LSI) to transfer external knowledge into semantic space for enhancing word embedding. The method integrates graph theory to extract the latent manifold structure of the entities in the affinity space and leverages non-negative least squares with standard simplex constraints and power iteration method to derive spectral embeddings. It provides an effective and efficient approach to combining entity representations defined in different Euclidean spaces. Specifically, our approach generates and imputes reliable embedding vectors for low-frequency words in the semantic space and benefits downstream language tasks that depend on word embedding. We conduct comprehensive experiments on a carefully designed classification problem and language modeling and demonstrate the superiority of the enhanced embedding via LSI over several well-known benchmark embeddings. We also confirm the consistency of the results under different parameter settings of our method.Comment: ACM SIGKDD 201

arXiv.org e-Print Archive

Crossref

Data-driven Efficient Solvers and Predictions of Conformational Transitions for Langevin Dynamics on Manifold in High Dimensions

Author: Gao Yuan
Liu Jian-Guo
Wu Nan
Publication venue
Publication date: 22/05/2020
Field of study

We work on dynamic problems with collected data

\{\mathsf{x}_i\}

that distributed on a manifold

\mathcal{M}\subset\mathbb{R}^p

. Through the diffusion map, we first learn the reaction coordinates

\{\mathsf{y}_i\}\subset \mathcal{N}

where

\mathcal{N}

is a manifold isometrically embedded into an Euclidean space

\mathbb{R}^\ell

for

\ell \ll p

. The reaction coordinates enable us to obtain an efficient approximation for the dynamics described by a Fokker-Planck equation on the manifold

\mathcal{N}

. By using the reaction coordinates, we propose an implementable, unconditionally stable, data-driven upwind scheme which automatically incorporates the manifold structure of

\mathcal{N}

. Furthermore, we provide a weighted

L^2

convergence analysis of the upwind scheme to the Fokker-Planck equation. The proposed upwind scheme leads to a Markov chain with transition probability between the nearest neighbor points. We can benefit from such property to directly conduct manifold-related computations such as finding the optimal coarse-grained network and the minimal energy path that represents chemical reactions or conformational changes. To establish the Fokker-Planck equation, we need to acquire information about the equilibrium potential of the physical system on

\mathcal{N}

. Hence, we apply a Gaussian Process regression algorithm to generate equilibrium potential for a new physical system with new parameters. Combining with the proposed upwind scheme, we can calculate the trajectory of the Fokker-Planck equation on

\mathcal{N}

based on the generated equilibrium potential. Finally, we develop an algorithm to pullback the trajectory to the original high dimensional space as a generative data for the new physical system.Comment: 59 pages, 16 figure

arXiv.org e-Print Archive

Zero-Shot Learning on Semantic Class Prototype Graph

Author: Fu Z
Gong S
Kodirov E
Xiang T
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 02/08/2017
Field of study

Queen Mary Research Online

Learning Edge Representations via Low-Rank Asymmetric Projections

Author: Bruna Joan
Cao Shaosheng
Chen Haochen
Dai Hanjun
Gori M.
Ioffe Sergey
Li Y.
Luo Y.
Mikolov T.
Niepert M.
Pan S.
Wang H.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 13/09/2017
Field of study

We propose a new method for embedding graphs while preserving directed edge information. Learning such continuous-space vector representations (or embeddings) of nodes in a graph is an important first step for using network information (from social networks, user-item graphs, knowledge bases, etc.) in many machine learning tasks. Unlike previous work, we (1) explicitly model an edge as a function of node embeddings, and we (2) propose a novel objective, the "graph likelihood", which contrasts information from sampled random walks with non-existent edges. Individually, both of these contributions improve the learned representations, especially when there are memory constraints on the total size of the embeddings. When combined, our contributions enable us to significantly improve the state-of-the-art by learning more concise representations that better preserve the graph structure. We evaluate our method on a variety of link-prediction task including social networks, collaboration networks, and protein interactions, showing that our proposed method learn representations with error reductions of up to 76% and 55%, on directed and undirected graphs. In addition, we show that the representations learned by our method are quite space efficient, producing embeddings which have higher structure-preserving accuracy but are 10 times smaller

arXiv.org e-Print Archive

Crossref

Simultaneous Learning of Nonlinear Manifold and Dynamical Models for High-dimensional Time Series

Author: Li Rui
Sclaroff Stan
Tian Tai-Peng
Publication venue: Boston University Computer Science Department
Publication date: 01/01/2007
Field of study

The goal of this work is to learn a parsimonious and informative representation for high-dimensional time series. Conceptually, this comprises two distinct yet tightly coupled tasks: learning a low-dimensional manifold and modeling the dynamical process. These two tasks have a complementary relationship as the temporal constraints provide valuable neighborhood information for dimensionality reduction and conversely, the low-dimensional space allows dynamics to be learnt efficiently. Solving these two tasks simultaneously allows important information to be exchanged mutually. If nonlinear models are required to capture the rich complexity of time series, then the learning problem becomes harder as the nonlinearities in both tasks are coupled. The proposed solution approximates the nonlinear manifold and dynamics using piecewise linear models. The interactions among the linear models are captured in a graphical model. By exploiting the model structure, efficient inference and learning algorithms are obtained without oversimplifying the model of the underlying dynamical process. Evaluation of the proposed framework with competing approaches is conducted in three sets of experiments: dimensionality reduction and reconstruction using synthetic time series, video synthesis using a dynamic texture database, and human motion synthesis, classification and tracking on a benchmark data set. In all experiments, the proposed approach provides superior performance.National Science Foundation (IIS 0308213, IIS 0329009, CNS 0202067

CiteSeerX

Boston University Institutional Repository (OpenBU)