26 research outputs found

    Electromagnetic emissions from near-horizon region of an extreme Kerr-Taub-Nut black hole

    Full text link
    We have studied electromagnetic line emissions from near-horizon region in the extremal Kerr-Taub-NUT black hole spacetime and then probe the effects of NUT charge on the electromagnetic line emissions. Due to the presence of the NUT charge, the equatorial plane is no more a symmetry plane of the KTN spacetime, which leads to that the dependence of electromagnetic line emission on the NUT charge for the observer in the Southern Hemisphere differs from that in the Northern one. Our result indicate that the electromagnetic line emission in the Kerr-Taub-NUT black hole case is brighter than that in the case of Kerr black hole for the observer in the equatorial plane or in the Southern Hemisphere, but it becomes more faint as the observer's position deviates far from the equatorial plane in the Northern one. Moreover, we also probe effects of redshift factor on electromagnetic emission from near-horizon region in the extremal Kerr-Taub-NUT black hole spacetime.Comment: 18 pages, 7 figure

    Whole-Chain Recommendations

    Full text link
    With the recent prevalence of Reinforcement Learning (RL), there have been tremendous interests in developing RL-based recommender systems. In practical recommendation sessions, users will sequentially access multiple scenarios, such as the entrance pages and the item detail pages, and each scenario has its specific characteristics. However, the majority of existing RL-based recommender systems focus on optimizing one strategy for all scenarios or separately optimizing each strategy, which could lead to sub-optimal overall performance. In this paper, we study the recommendation problem with multiple (consecutive) scenarios, i.e., whole-chain recommendations. We propose a multi-agent RL-based approach (DeepChain), which can capture the sequential correlation among different scenarios and jointly optimize multiple recommendation strategies. To be specific, all recommender agents (RAs) share the same memory of users' historical behaviors, and they work collaboratively to maximize the overall reward of a session. Note that optimizing multiple recommendation strategies jointly faces two challenges in the existing model-free RL model - (i) it requires huge amounts of user behavior data, and (ii) the distribution of reward (users' feedback) are extremely unbalanced. In this paper, we introduce model-based RL techniques to reduce the training data requirement and execute more accurate strategy updates. The experimental results based on a real e-commerce platform demonstrate the effectiveness of the proposed framework.Comment: 29th ACM International Conference on Information and Knowledge Managemen

    Effective metric of spinless binaries with radiation-reaction effect up to fourth Post-Minkowskian order in effective-one-body theory

    Full text link
    By means of the scattering angles, we obtain an effective metric of spinless binaries with radiation-reaction effects up to fourth post-Minkowskian order, which is the foundation of the effective-one-body theory. We note that there are freedoms for the parameters of the effective metric because one equation corresponds to two parameters for each post-Minkowskian order. Accordingly, in order to construct a self-consistent effective-one-body theory in which the Hamiltonian, radiation-reaction forces and waveforms for the ``plus" and ``cross" modes of the gravitational wave should be based on the same physical model, we can fix these freedoms by requiring the null tetrad component of the gravitationally perturbed Weyl tensor Ψ4B\Psi_4^B to be decoupled in the effective spacetime.Comment: 17 page

    Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning

    Full text link
    Recommender systems play a crucial role in mitigating the problem of information overload by suggesting users' personalized items or services. The vast majority of traditional recommender systems consider the recommendation procedure as a static process and make recommendations following a fixed strategy. In this paper, we propose a novel recommender system with the capability of continuously improving its strategies during the interactions with users. We model the sequential interactions between users and a recommender system as a Markov Decision Process (MDP) and leverage Reinforcement Learning (RL) to automatically learn the optimal strategies via recommending trial-and-error items and receiving reinforcements of these items from users' feedback. Users' feedback can be positive and negative and both types of feedback have great potentials to boost recommendations. However, the number of negative feedback is much larger than that of positive one; thus incorporating them simultaneously is challenging since positive feedback could be buried by negative one. In this paper, we develop a novel approach to incorporate them into the proposed deep recommender system (DEERS) framework. The experimental results based on real-world e-commerce data demonstrate the effectiveness of the proposed framework. Further experiments have been conducted to understand the importance of both positive and negative feedback in recommendations.Comment: arXiv admin note: substantial text overlap with arXiv:1801.0020
    corecore