Search CORE

12,625 research outputs found

Beam energy dependence of strange hadron production from STAR at RHIC

Author: Zhang Xiaoping
Publication venue: 'Elsevier BV'
Publication date: 01/05/2013
Field of study

We present STAR measurements of K^{0}_{S}, \phi, \Lambda, \Xi, and \Omega at mid-rapidity from Au+Au collisions at \sqrt{s_{NN}} = 7.7, 11.5, 19.6, 27, and 39 GeV from the Beam Energy Scan (BES) program at the BNL Relativistic Heavy Ion Collider (RHIC). Nuclear modification factors and baryon-to-meson ratios are measured to understand recombination and parton energy loss mechanisms. Implications on partonic versus hadronic dynamics at low beam energies are discussed.Comment: 4 pages, 2 figures, Quark Matter 2012 proceeding

arXiv.org e-Print Archive

Crossref

Goal-oriented Dialogue Policy Learning from Failures

Author: Chen Xiaoping
Lu Keting
Zhang Shiqi
Publication venue
Publication date: 22/11/2018
Field of study

Reinforcement learning methods have been used for learning dialogue policies. However, learning an effective dialogue policy frequently requires prohibitively many conversations. This is partly because of the sparse rewards in dialogues, and the very few successful dialogues in early learning phase. Hindsight experience replay (HER) enables learning from failures, but the vanilla HER is inapplicable to dialogue learning due to the implicit goals. In this work, we develop two complex HER methods providing different trade-offs between complexity and performance, and, for the first time, enabled HER-based dialogue policy learning. Experiments using a realistic user simulator show that our HER methods perform better than existing experience replay methods (as applied to deep Q-networks) in learning rate

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

A data-driven game theoretic strategy for developers in software crowdsourcing: a case study

Author: Fan Xiaoping
Liao Zhifang
Zeng Zhi
Zhang Yan
Publication venue: 'MDPI AG'
Publication date: 01/02/2019
Field of study

Crowdsourcing has the advantages of being cost-effective and saving time, which is a typical embodiment of collective wisdom and community workers’ collaborative development. However, this development paradigm of software crowdsourcing has not been used widely. A very important reason is that requesters have limited knowledge about crowd workers’ professional skills and qualities. Another reason is that the crowd workers in the competition cannot get the appropriate reward, which affects their motivation. To solve this problem, this paper proposes a method of maximizing reward based on the crowdsourcing ability of workers, they can choose tasks according to their own abilities to obtain appropriate bonuses. Our method includes two steps: Firstly, it puts forward a method to evaluate the crowd workers’ ability, then it analyzes the intensity of competition for tasks at Topcoder.com—an open community crowdsourcing platform—on the basis of the workers’ crowdsourcing ability; secondly, it follows dynamic programming ideas and builds game models under complete information in different cases, offering a strategy of reward maximization for workers by solving a mixed-strategy Nash equilibrium. This paper employs crowdsourcing data from Topcoder.com to carry out experiments. The experimental results show that the distribution of workers’ crowdsourcing ability is uneven, and to some extent it can show the activity degree of crowdsourcing tasks. Meanwhile, according to the strategy of reward maximization, a crowd worker can get the theoretically maximum reward

Multidisciplinary Digital Publishing Institute

Crossref

Directory of Open Access Journals

ResearchOnline@GCU