61 research outputs found
A Model-Agnostic Framework for Recommendation via Interest-aware Item Embeddings
Item representation holds significant importance in recommendation systems,
which encompasses domains such as news, retail, and videos. Retrieval and
ranking models utilise item representation to capture the user-item
relationship based on user behaviours. While existing representation learning
methods primarily focus on optimising item-based mechanisms, such as attention
and sequential modelling. However, these methods lack a modelling mechanism to
directly reflect user interests within the learned item representations.
Consequently, these methods may be less effective in capturing user interests
indirectly. To address this challenge, we propose a novel Interest-aware
Capsule network (IaCN) recommendation model, a model-agnostic framework that
directly learns interest-oriented item representations. IaCN serves as an
auxiliary task, enabling the joint learning of both item-based and
interest-based representations. This framework adopts existing recommendation
models without requiring substantial redesign. We evaluate the proposed
approach on benchmark datasets, exploring various scenarios involving different
deep neural networks, behaviour sequence lengths, and joint learning ratios of
interest-oriented item representations. Experimental results demonstrate
significant performance enhancements across diverse recommendation models,
validating the effectiveness of our approach.Comment: Accepted Paper under LBR track in the Seventeenth ACM Conference on
Recommender Systems (RecSys) 202
AdaptDHM: Adaptive Distribution Hierarchical Model for Multi-Domain CTR Prediction
Large-scale commercial platforms usually involve numerous business domains
for diverse business strategies and expect their recommendation systems to
provide click-through rate (CTR) predictions for multiple domains
simultaneously. Existing promising and widely-used multi-domain models discover
domain relationships by explicitly constructing domain-specific networks, but
the computation and memory boost significantly with the increase of domains. To
reduce computational complexity, manually grouping domains with particular
business strategies is common in industrial applications. However, this
pre-defined data partitioning way heavily relies on prior knowledge, and it may
neglect the underlying data distribution of each domain, hence limiting the
model's representation capability. Regarding the above issues, we propose an
elegant and flexible multi-distribution modeling paradigm, named Adaptive
Distribution Hierarchical Model (AdaptDHM), which is an end-to-end optimization
hierarchical structure consisting of a clustering process and classification
process. Specifically, we design a distribution adaptation module with a
customized dynamic routing mechanism. Instead of introducing prior knowledge
for pre-defined data allocation, this routing algorithm adaptively provides a
distribution coefficient for each sample to determine which cluster it belongs
to. Each cluster corresponds to a particular distribution so that the model can
sufficiently capture the commonalities and distinctions between these distinct
clusters. Extensive experiments on both public and large-scale Alibaba
industrial datasets verify the effectiveness and efficiency of AdaptDHM: Our
model achieves impressive prediction accuracy and its time cost during the
training stage is more than 50% less than that of other models
Rethinking Multi-Interest Learning for Candidate Matching in Recommender Systems
Existing research efforts for multi-interest candidate matching in
recommender systems mainly focus on improving model architecture or
incorporating additional information, neglecting the importance of training
schemes. This work revisits the training framework and uncovers two major
problems hindering the expressiveness of learned multi-interest
representations. First, the current training objective (i.e., uniformly sampled
softmax) fails to effectively train discriminative representations in a
multi-interest learning scenario due to the severe increase in easy negative
samples. Second, a routing collapse problem is observed where each learned
interest may collapse to express information only from a single item, resulting
in information loss. To address these issues, we propose the REMI framework,
consisting of an Interest-aware Hard Negative mining strategy (IHN) and a
Routing Regularization (RR) method. IHN emphasizes interest-aware hard
negatives by proposing an ideal sampling distribution and developing a
Monte-Carlo strategy for efficient approximation. RR prevents routing collapse
by introducing a novel regularization term on the item-to-interest routing
matrices. These two components enhance the learned multi-interest
representations from both the optimization objective and the composition
information. REMI is a general framework that can be readily applied to various
existing multi-interest candidate matching methods. Experiments on three
real-world datasets show our method can significantly improve state-of-the-art
methods with easy implementation and negligible computational overhead. The
source code will be released.Comment: RecSys 202
- …