318 research outputs found
Whole-Chain Recommendations
With the recent prevalence of Reinforcement Learning (RL), there have been
tremendous interests in developing RL-based recommender systems. In practical
recommendation sessions, users will sequentially access multiple scenarios,
such as the entrance pages and the item detail pages, and each scenario has its
specific characteristics. However, the majority of existing RL-based
recommender systems focus on optimizing one strategy for all scenarios or
separately optimizing each strategy, which could lead to sub-optimal overall
performance. In this paper, we study the recommendation problem with multiple
(consecutive) scenarios, i.e., whole-chain recommendations. We propose a
multi-agent RL-based approach (DeepChain), which can capture the sequential
correlation among different scenarios and jointly optimize multiple
recommendation strategies. To be specific, all recommender agents (RAs) share
the same memory of users' historical behaviors, and they work collaboratively
to maximize the overall reward of a session. Note that optimizing multiple
recommendation strategies jointly faces two challenges in the existing
model-free RL model - (i) it requires huge amounts of user behavior data, and
(ii) the distribution of reward (users' feedback) are extremely unbalanced. In
this paper, we introduce model-based RL techniques to reduce the training data
requirement and execute more accurate strategy updates. The experimental
results based on a real e-commerce platform demonstrate the effectiveness of
the proposed framework.Comment: 29th ACM International Conference on Information and Knowledge
Managemen
Recommended from our members
A words-of-interest model of sketch representation for image retrieval
In this paper we propose a method for sketch-based image retrieval. Sketch is a magical medium which is capable of conveying semantic messages for user. It’s in accordance with user’s cognitive psychology to retrieve images with sketch. In order to narrow down the semantic gap between the user and the images in database, we preprocess all the images into sketches by the coherent line drawing algorithm. During the process of sketches extraction, saliency maps are used to filter out the redundant background information, while preserve the important semantic information. We use a variant of Words-of-Interest model to retrieve relevant images for the user according to the query. Words-of-Interest (WoI) model is based on Bag-ofvisual Words (BoW) model, which has been proven successfully for information retrieval. Bag-of-Words ignores the spatial relationships among visual words, which are important for sketch representation. Our method takes advantage of the spatial information of the query to select words of interest. Experimental results demonstrate that our sketch-based retrieval method achieves a good tradeoff between retrieval accuracy and semantic representation of users’ query
Disentangled Contrastive Collaborative Filtering
Recent studies show that graph neural networks (GNNs) are prevalent to model
high-order relationships for collaborative filtering (CF). Towards this
research line, graph contrastive learning (GCL) has exhibited powerful
performance in addressing the supervision label shortage issue by learning
augmented user and item representations. While many of them show their
effectiveness, two key questions still remain unexplored: i) Most existing
GCL-based CF models are still limited by ignoring the fact that user-item
interaction behaviors are often driven by diverse latent intent factors (e.g.,
shopping for family party, preferred color or brand of products); ii) Their
introduced non-adaptive augmentation techniques are vulnerable to noisy
information, which raises concerns about the model's robustness and the risk of
incorporating misleading self-supervised signals. In light of these
limitations, we propose a Disentangled Contrastive Collaborative Filtering
framework (DCCF) to realize intent disentanglement with self-supervised
augmentation in an adaptive fashion. With the learned disentangled
representations with global context, our DCCF is able to not only distill
finer-grained latent factors from the entangled self-supervision signals but
also alleviate the augmentation-induced noise. Finally, the cross-view
contrastive learning task is introduced to enable adaptive augmentation with
our parameterized interaction mask generator. Experiments on various public
datasets demonstrate the superiority of our method compared to existing
solutions. Our model implementation is released at the link
https://github.com/HKUDS/DCCF.Comment: Published as a SIGIR'23 full pape
Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning
Recommender systems play a crucial role in mitigating the problem of
information overload by suggesting users' personalized items or services. The
vast majority of traditional recommender systems consider the recommendation
procedure as a static process and make recommendations following a fixed
strategy. In this paper, we propose a novel recommender system with the
capability of continuously improving its strategies during the interactions
with users. We model the sequential interactions between users and a
recommender system as a Markov Decision Process (MDP) and leverage
Reinforcement Learning (RL) to automatically learn the optimal strategies via
recommending trial-and-error items and receiving reinforcements of these items
from users' feedback. Users' feedback can be positive and negative and both
types of feedback have great potentials to boost recommendations. However, the
number of negative feedback is much larger than that of positive one; thus
incorporating them simultaneously is challenging since positive feedback could
be buried by negative one. In this paper, we develop a novel approach to
incorporate them into the proposed deep recommender system (DEERS) framework.
The experimental results based on real-world e-commerce data demonstrate the
effectiveness of the proposed framework. Further experiments have been
conducted to understand the importance of both positive and negative feedback
in recommendations.Comment: arXiv admin note: substantial text overlap with arXiv:1801.0020
Nanoporous platinum electrode grown on anodic aluminum oxide membrane: Fabrication, characterization, electrocatalytic activity toward reactive oxygen and nitrogen species
A new type of nanoelectrode, nanoporous platinum (NPt) electrode was prepared on aluminum oxide membrane by thermal evaporation deposition. The morphology, conductivity and electrocatalytic activity of NPt electrode were characterized and compared with those of nanofilm-Pt electrode through scanning electron microscopy (SEM), electrochemical impedance spectroscopy (EIS) and cyclic voltammetry (CV) techniques, respectively. SEM images showed that "nanocavities" observed in NPt electrode were actually 2-dimensional enclosures by linked nanoparticles. It was different from the conventional arrays of "nanocavities" formed on homogeneous metal films. EIS data indicated that NPt electrode possesses higher conductivity. Compared with that on nanofilm-Pt electrode (14.05 Omega.cm(2)), the impedance spectrum on NPt electrode exhibits a semicircle portion with much smaller diameters (1.24 Omega.cm(2) for NPt-100, 1.48 Omega.cm(2) for NPt-200). Meanwhile, the response sensitivity of NPt electrode to O-2 is 0.85 mA cm(-2), which is larger than that of nanofilm-Pt electrode (0.54 mA cm(-2)). The largest catalytic current for nitric oxide (NO) was obtained in buffer with pH value of 9.4 while for Angeli's salt (AS) was obtained in buffer with pH value of 5.4. Additionally, electrocatalytic mechanisms of NPt electrode toward NO and AS were proposed, which indicating it depended on pH value of buffer solution. (C) 2018 Elsevier B.V. All rights reserved
- …