318 research outputs found

    Whole-Chain Recommendations

    Full text link
    With the recent prevalence of Reinforcement Learning (RL), there have been tremendous interests in developing RL-based recommender systems. In practical recommendation sessions, users will sequentially access multiple scenarios, such as the entrance pages and the item detail pages, and each scenario has its specific characteristics. However, the majority of existing RL-based recommender systems focus on optimizing one strategy for all scenarios or separately optimizing each strategy, which could lead to sub-optimal overall performance. In this paper, we study the recommendation problem with multiple (consecutive) scenarios, i.e., whole-chain recommendations. We propose a multi-agent RL-based approach (DeepChain), which can capture the sequential correlation among different scenarios and jointly optimize multiple recommendation strategies. To be specific, all recommender agents (RAs) share the same memory of users' historical behaviors, and they work collaboratively to maximize the overall reward of a session. Note that optimizing multiple recommendation strategies jointly faces two challenges in the existing model-free RL model - (i) it requires huge amounts of user behavior data, and (ii) the distribution of reward (users' feedback) are extremely unbalanced. In this paper, we introduce model-based RL techniques to reduce the training data requirement and execute more accurate strategy updates. The experimental results based on a real e-commerce platform demonstrate the effectiveness of the proposed framework.Comment: 29th ACM International Conference on Information and Knowledge Managemen

    Disentangled Contrastive Collaborative Filtering

    Full text link
    Recent studies show that graph neural networks (GNNs) are prevalent to model high-order relationships for collaborative filtering (CF). Towards this research line, graph contrastive learning (GCL) has exhibited powerful performance in addressing the supervision label shortage issue by learning augmented user and item representations. While many of them show their effectiveness, two key questions still remain unexplored: i) Most existing GCL-based CF models are still limited by ignoring the fact that user-item interaction behaviors are often driven by diverse latent intent factors (e.g., shopping for family party, preferred color or brand of products); ii) Their introduced non-adaptive augmentation techniques are vulnerable to noisy information, which raises concerns about the model's robustness and the risk of incorporating misleading self-supervised signals. In light of these limitations, we propose a Disentangled Contrastive Collaborative Filtering framework (DCCF) to realize intent disentanglement with self-supervised augmentation in an adaptive fashion. With the learned disentangled representations with global context, our DCCF is able to not only distill finer-grained latent factors from the entangled self-supervision signals but also alleviate the augmentation-induced noise. Finally, the cross-view contrastive learning task is introduced to enable adaptive augmentation with our parameterized interaction mask generator. Experiments on various public datasets demonstrate the superiority of our method compared to existing solutions. Our model implementation is released at the link https://github.com/HKUDS/DCCF.Comment: Published as a SIGIR'23 full pape

    Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning

    Full text link
    Recommender systems play a crucial role in mitigating the problem of information overload by suggesting users' personalized items or services. The vast majority of traditional recommender systems consider the recommendation procedure as a static process and make recommendations following a fixed strategy. In this paper, we propose a novel recommender system with the capability of continuously improving its strategies during the interactions with users. We model the sequential interactions between users and a recommender system as a Markov Decision Process (MDP) and leverage Reinforcement Learning (RL) to automatically learn the optimal strategies via recommending trial-and-error items and receiving reinforcements of these items from users' feedback. Users' feedback can be positive and negative and both types of feedback have great potentials to boost recommendations. However, the number of negative feedback is much larger than that of positive one; thus incorporating them simultaneously is challenging since positive feedback could be buried by negative one. In this paper, we develop a novel approach to incorporate them into the proposed deep recommender system (DEERS) framework. The experimental results based on real-world e-commerce data demonstrate the effectiveness of the proposed framework. Further experiments have been conducted to understand the importance of both positive and negative feedback in recommendations.Comment: arXiv admin note: substantial text overlap with arXiv:1801.0020

    Nanoporous platinum electrode grown on anodic aluminum oxide membrane: Fabrication, characterization, electrocatalytic activity toward reactive oxygen and nitrogen species

    Get PDF
    A new type of nanoelectrode, nanoporous platinum (NPt) electrode was prepared on aluminum oxide membrane by thermal evaporation deposition. The morphology, conductivity and electrocatalytic activity of NPt electrode were characterized and compared with those of nanofilm-Pt electrode through scanning electron microscopy (SEM), electrochemical impedance spectroscopy (EIS) and cyclic voltammetry (CV) techniques, respectively. SEM images showed that "nanocavities" observed in NPt electrode were actually 2-dimensional enclosures by linked nanoparticles. It was different from the conventional arrays of "nanocavities" formed on homogeneous metal films. EIS data indicated that NPt electrode possesses higher conductivity. Compared with that on nanofilm-Pt electrode (14.05 Omega.cm(2)), the impedance spectrum on NPt electrode exhibits a semicircle portion with much smaller diameters (1.24 Omega.cm(2) for NPt-100, 1.48 Omega.cm(2) for NPt-200). Meanwhile, the response sensitivity of NPt electrode to O-2 is 0.85 mA cm(-2), which is larger than that of nanofilm-Pt electrode (0.54 mA cm(-2)). The largest catalytic current for nitric oxide (NO) was obtained in buffer with pH value of 9.4 while for Angeli's salt (AS) was obtained in buffer with pH value of 5.4. Additionally, electrocatalytic mechanisms of NPt electrode toward NO and AS were proposed, which indicating it depended on pH value of buffer solution. (C) 2018 Elsevier B.V. All rights reserved
    • …
    corecore