Search CORE

13,899 research outputs found

Modeling Social Media Content with Word Vectors for Recommendation

Author: DM Blei
F Wang
G Chen
J Nocedal
SM Kywe
T Mikolov
Y Koren
Y Shi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/12/2015
Field of study

Crossref

Institutional Knowledge at Singapore Management University

Ask the GRU: Multi-Task Learning for Deep Text Recommendations

Author: Basu Chumki
Dai Andrew M
den Oord Aaron Van
Gopalan Prem K
He R.
Jozefowicz Rafal
Melville Prem
Mikolov Tomas
Mikolov Tomas
Mnih Andriy
Rendle Steffen
Sutskever Ilya
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 09/09/2016
Field of study

In a variety of application domains the content to be recommended to users is associated with text. This includes research papers, movies with associated plot summaries, news articles, blog posts, etc. Recommendation approaches based on latent factor models can be extended naturally to leverage text by employing an explicit mapping from text to factors. This enables recommendations for new, unseen content, and may generalize better, since the factors for all items are produced by a compactly-parametrized model. Previous work has used topic models or averages of word embeddings for this mapping. In this paper we present a method leveraging deep recurrent neural networks to encode the text sequence into a latent vector, specifically gated recurrent units (GRUs) trained end-to-end on the collaborative filtering task. For the task of scientific paper recommendation, this yields models with significantly higher accuracy. In cold-start scenarios, we beat the previous state-of-the-art, all of which ignore word order. Performance is further improved by multi-task learning, where the text encoder network is trained for a combination of content recommendation and item metadata prediction. This regularizes the collaborative filtering model, ameliorating the problem of sparsity of the observed rating matrix.Comment: 8 page

arXiv.org e-Print Archive

Crossref

DocTag2Vec: An Embedding Based Multi-label Learning Approach for Document Tagging

Author: Chen Sheng
Mehdad Yashar
Pappu Aasish
Soni Akshay
Publication venue
Publication date: 01/01/2017
Field of study

Tagging news articles or blog posts with relevant tags from a collection of predefined ones is coined as document tagging in this work. Accurate tagging of articles can benefit several downstream applications such as recommendation and search. In this work, we propose a novel yet simple approach called DocTag2Vec to accomplish this task. We substantially extend Word2Vec and Doc2Vec---two popular models for learning distributed representation of words and documents. In DocTag2Vec, we simultaneously learn the representation of words, documents, and tags in a joint vector space during training, and employ the simple

k

-nearest neighbor search to predict tags for unseen documents. In contrast to previous multi-label learning methods, DocTag2Vec directly deals with raw text instead of provided feature vector, and in addition, enjoys advantages like the learning of tag representation, and the ability of handling newly created tags. To demonstrate the effectiveness of our approach, we conduct experiments on several datasets and show promising results against state-of-the-art methods.Comment: 10 page

arXiv.org e-Print Archive

Crossref