Search CORE

3,965 research outputs found

Spartan Daily, March 3, 2008

Author: San Jose State University School of Journalism and Mass Communications
Publication venue: SJSU ScholarWorks
Publication date: 03/03/2008
Field of study

Volume 130, Issue 22https://scholarworks.sjsu.edu/spartandaily/10448/thumbnail.jp

SJSU ScholarWorks

Spartan Daily May 2, 2011

Author: San Jose State University School of Journalism and Mass Communications
Publication venue: SJSU ScholarWorks
Publication date: 02/05/2011
Field of study

Volume 136, Issue 47https://scholarworks.sjsu.edu/spartandaily/1154/thumbnail.jp

SJSU ScholarWorks

TransNets: Learning to Transform for Recommendation

Author: Bao Yang
Diederik
Goodfellow Ian J.
Hinton Geoffrey E.
Le Quoc V.
Mikolov Tomas
Nair Vinod
Reed Scott
Seo Sungyong
Socher Richard
Tan Yunzhi
van den Oord Aäron
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 30/06/2017
Field of study

Recently, deep learning methods have been shown to improve the performance of recommender systems over traditional methods, especially when review text is available. For example, a recent model, DeepCoNN, uses neural nets to learn one latent representation for the text of all reviews written by a target user, and a second latent representation for the text of all reviews for a target item, and then combines these latent representations to obtain state-of-the-art performance on recommendation tasks. We show that (unsurprisingly) much of the predictive value of review text comes from reviews of the target user for the target item. We then introduce a way in which this information can be used in recommendation, even when the target user's review for the target item is not available. Our model, called TransNets, extends the DeepCoNN model by introducing an additional latent layer representing the target user-target item pair. We then regularize this layer, at training time, to be similar to another latent representation of the target user's review of the target item. We show that TransNets and extensions of it improve substantially over the previous state-of-the-art.Comment: Accepted for publication in the 11th ACM Conference on Recommender Systems (RecSys 2017

arXiv.org e-Print Archive

Crossref

Vec2Vec: A Compact Neural Network Approach for Transforming Text Embeddings with High Fidelity

Author: Gao Andrew Kean
Publication venue
Publication date: 22/06/2023
Field of study

Vector embeddings have become ubiquitous tools for many language-related tasks. A leading embedding model is OpenAI's text-ada-002 which can embed approximately 6,000 words into a 1,536-dimensional vector. While powerful, text-ada-002 is not open source and is only available via API. We trained a simple neural network to convert open-source 768-dimensional MPNet embeddings into text-ada-002 embeddings. We compiled a subset of 50,000 online food reviews. We calculated MPNet and text-ada-002 embeddings for each review and trained a simple neural network to for 75 epochs. The neural network was designed to predict the corresponding text-ada-002 embedding for a given MPNET embedding. Our model achieved an average cosine similarity of 0.932 on 10,000 unseen reviews in our held-out test dataset. We manually assessed the quality of our predicted embeddings for vector search over text-ada-002-embedded reviews. While not as good as real text-ada-002 embeddings, predicted embeddings were able to retrieve highly relevant reviews. Our final model, Vec2Vec, is lightweight (<80 MB) and fast. Future steps include training a neural network with a more sophisticated architecture and a larger dataset of paired embeddings to achieve greater performance. The ability to convert between and align embedding spaces may be helpful for interoperability, limiting dependence on proprietary models, protecting data privacy, reducing costs, and offline operations.Comment: 14 pages, 6 figures, 5 table

arXiv.org e-Print Archive