MISSRec: Pre-training and Transferring Multi-modal Interest-aware
  Sequence Representation for Recommendation

Li, Tianxiang; Lu, Xingyu; Wang, Jinpeng; Wang, Yunxiao; Wang, Yuting; Xia, Shu-Tao; Yuan, Jun; Zeng, Ziyun; Zhang, Rui; Zheng, Hai-Tao

MISSRec: Pre-training and Transferring Multi-modal Interest-aware Sequence Representation for Recommendation

Authors: Tianxiang Li
Xingyu Lu
Jinpeng Wang
Yunxiao Wang
Yuting Wang
Shu-Tao Xia
Jun Yuan
Ziyun Zeng
Rui Zhang
Hai-Tao Zheng
Publication date: 22 August 2023
Publisher
Doi

Abstract

The goal of sequential recommendation (SR) is to predict a user's potential interested items based on her/his historical interaction sequences. Most existing sequential recommenders are developed based on ID features, which, despite their widespread use, often underperform with sparse IDs and struggle with the cold-start problem. Besides, inconsistent ID mappings hinder the model's transferability, isolating similar recommendation domains that could have been co-optimized. This paper aims to address these issues by exploring the potential of multi-modal information in learning robust and generalizable sequence representations. We propose MISSRec, a multi-modal pre-training and transfer learning framework for SR. On the user side, we design a Transformer-based encoder-decoder model, where the contextual encoder learns to capture the sequence-level multi-modal synergy while a novel interest-aware decoder is developed to grasp item-modality-interest relations for better sequence representation. On the candidate item side, we adopt a dynamic fusion module to produce user-adaptive item representation, providing more precise matching between users and items. We pre-train the model with contrastive learning objectives and fine-tune it in an efficient manner. Extensive experiments demonstrate the effectiveness and flexibility of MISSRec, promising an practical solution for real-world recommendation scenarios.Comment: Accepted to ACM MM 202

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2308.11175

Last time updated on 24/08/2023