Search CORE

23,298 research outputs found

Deep Reinforcement Learning with Surrogate Agent-Environment Interface

Author: Jing Yu
Wang Song
Publication venue
Publication date: 10/11/2017
Field of study

In this paper, we propose surrogate agent-environment interface (SAEI) in reinforcement learning. We also state that learning based on probability surrogate agent-environment interface provides optimal policy of task agent-environment interface. We introduce surrogate probability action and develop the probability surrogate action deterministic policy gradient (PSADPG) algorithm based on SAEI. This algorithm enables continuous control of discrete action. The experiments show PSADPG achieves the performance of DQN in certain tasks with the stochastic optimal policy nature in the initial training stage

arXiv.org e-Print Archive

Does Amati Relation Depend on Luminosity of GRB's Host Galaxies?

Author: Deng Jing-song
Qiu Yu-lei
Wang Jing
Publication venue: 'IOP Publishing'
Publication date: 28/10/2007
Field of study

In order to test systematic of the Amati relation, the 24 long-duration GRBs with firmly determined

E_{\gamma,\mathrm{iso}}

and

E_{\mathrm p}

are separated into two sub-groups according to B-band luminosity of their host galaxies. The Amati relations in the two subgroups are found to be in agreement with each other within uncertainties. Taking into account of the well established luminosity - metallicity relation of galaxies, no strong evolution of the Amati relation with GRB's environment metallicity is implied in this study.Comment: 7 pages, 3 figures and 1 table, accepted by ChJA

arXiv.org e-Print Archive

Electrically Induced Photonic and Acoustic Quantum Effect From Liquid Metal Droplets in Aqueous Solution

Author: Liu Jing
Wang Qian
Yu Yang
Publication venue
Publication date: 08/05/2018
Field of study

So far, several macroscopic quantum phenomena have been discovered in the Josephson junction. Through introducing such a structure with a liquid membrane sandwiched between two liquid metal electrodes, we had ever observed a lighting and sound phenomenon which was explained before as discharge plasma. In fact, such an effect also belongs to a quantum process. It is based on this conceiving, we proposed here that an electrically controllable method can thus be established to generate and manipulate as much photonic quantum as desired. We attributed such electrically induced lighting among liquid metal droplets immersed inside aqueous solution as photonic quantum effect. Our experiments clarified that a small electrical voltage would be strong enough to trigger blue-violet light and sound inside the aqueous solution system. Meanwhile, thermal heat is released, and chemical reaction occurs over the solution. From an alternative way which differs from former effort in interpreting such effect as discharge plasma, we treated this process as a quantum one and derived new conceptual equations to theoretically quantify this phenomenon in light of quantum mechanics principle. It can be anticipated that given specific designing, such spontaneously generated tremendous quantum can be manipulated to entangle together which would possibly help mold functional elements for developing future quantum computing or communication system. With superior adaptability than that of the conventional rigid junction, the present electro-photonic quantum generation system made of liquid metal droplets structure could work in solution, room temperature situation and is easy to be adjusted. It suggests a macroscopic way to innovate the classical strategies and technologies in generating quantum as frequently adopted in classical quantum engineering area.Comment: 13 pages, 6 figure

arXiv.org e-Print Archive

Sparse Coding and Counting for Robust Visual Tracking

Author: Cai Yu
Liu Risheng
Su Zhixun
Wang Jing
Wang Yiyang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 28/05/2016
Field of study

In this paper, we propose a novel sparse coding and counting method under Bayesian framwork for visual tracking. In contrast to existing methods, the proposed method employs the combination of L0 and L1 norm to regularize the linear coefficients of incrementally updated linear basis. The sparsity constraint enables the tracker to effectively handle difficult challenges, such as occlusion or image corruption. To achieve realtime processing, we propose a fast and efficient numerical algorithm for solving the proposed model. Although it is an NP-hard problem, the proposed accelerated proximal gradient (APG) approach is guaranteed to converge to a solution quickly. Besides, we provide a closed solution of combining L0 and L1 regularized representation to obtain better sparsity. Experimental results on challenging video sequences demonstrate that the proposed method achieves state-of-the-art results both in accuracy and speed

arXiv.org e-Print Archive

Learning with Differential Privacy: Stability, Learnability and the Sufficiency and Necessity of ERM Principle

Author: Fienberg Stephen E.
Lei Jing
Wang Yu-Xiang
Publication venue
Publication date: 27/04/2016
Field of study

While machine learning has proven to be a powerful data-driven solution to many real-life problems, its use in sensitive domains has been limited due to privacy concerns. A popular approach known as **differential privacy** offers provable privacy guarantees, but it is often observed in practice that it could substantially hamper learning accuracy. In this paper we study the learnability (whether a problem can be learned by any algorithm) under Vapnik's general learning setting with differential privacy constraint, and reveal some intricate relationships between privacy, stability and learnability. In particular, we show that a problem is privately learnable **if an only if** there is a private algorithm that asymptotically minimizes the empirical risk (AERM). In contrast, for non-private learning AERM alone is not sufficient for learnability. This result suggests that when searching for private learning algorithms, we can restrict the search to algorithms that are AERM. In light of this, we propose a conceptual procedure that always finds a universally consistent algorithm whenever the problem is learnable under privacy constraint. We also propose a generic and practical algorithm and show that under very general conditions it privately learns a wide class of learning problems. Lastly, we extend some of the results to the more practical

(\epsilon,\delta)

-differential privacy and establish the existence of a phase-transition on the class of problems that are approximately privately learnable with respect to how small

\delta

needs to be.Comment: to appear, Journal of Machine Learning Research, 201

arXiv.org e-Print Archive

Influence of the Nucleon Hard Partons Distribution on J/\Psi Suppression in a GMC Framework

Author: Hou Zhao-Yu
Sun Xian-Jing
Wang Hongmin
Publication venue: 'IOP Publishing'
Publication date: 23/05/2013
Field of study

In a Glauber Monte Carlo framework, taking account of the transverse spatial distribution of hard partons in the nucleon, we analyse the nuclear modification factor

R_{dAu}

for

J/\psi

in d+Au collisions with the EPS09 shadowing parametrization. After the influence of nucleon hard partons distribution is considered, a clearly upward correction is revealed for the dependence of

R_{dAu}

N_{coll}

in peripheral d+Au collisions, however, an unconspicuous correction is shown for the results versus

p_{T}

. The theoretical results are in good agreement with the experimental data from PHENIX.Comment: 7pages,2figure

arXiv.org e-Print Archive

Hadron Multiplicities in p+p and p+Pb Collisions

Author: Hou Zhao-Yu
Sun Xian-Jing
Wang Hongmin
Publication venue
Publication date: 26/01/2014
Field of study

Experiments at the Large Hadron Collider (LHC) have measured multiplicity distributions in p+p and p+Pb collisions at a new domain of collision energy. Based on considering an energy-dependent broadening of the nucleon's density distribution, charged hadron multiplicities are studied with the phenomenological saturation model and the evolution equation dependent saturation model. By assuming the saturation scale have a small dependence on the 3-dimensional root mean square (rms) radius at different energy, the theoretical results are in good agreement with the experimental data from CMS and ALICE collaboration. Then, the predictive results in p+p collisions at

\sqrt{s}=

14 TeV of the LHC are also given

arXiv.org e-Print Archive

Energy Dependent Growth of Nucleon and Inclusive Charged Hadron Distributions

Author: Hou Zhao-Yu
Sun Xian-Jing
Wang Hongmin
Publication venue: 'IOP Publishing'
Publication date: 06/01/2015
Field of study

In the Color Glass Condensate formalism, charged hadron p_{T} distributions in p+p collisions are studied by considering an energy-dependent broadening of nucleon's density distribution. Then, in the Glasma flux tube picture, the n-particle multiplicity distributions at different pseudo-rapidity ranges are investigated. Both of the theoretical results show good agreement with the recent experimental data from ALICE and CMS at \sqrt{s}=0.9, 2.36, 7 TeV. The predictive results for p_{T} and multiplicity distributions in p+p and p+Pb collisions at the Large Hadron Collider are also given in this paper.Comment: 11 pages, 4 figure

arXiv.org e-Print Archive

A Co-Matching Model for Multi-choice Reading Comprehension

Author: Chang Shiyu
Jiang Jing
Wang Shuohang
Yu Mo
Publication venue
Publication date: 11/06/2018
Field of study

Multi-choice reading comprehension is a challenging task, which involves the matching between a passage and a question-answer pair. This paper proposes a new co-matching approach to this problem, which jointly models whether a passage can match both a question and a candidate answer. Experimental results on the RACE dataset demonstrate that our approach achieves state-of-the-art performance.Comment: 6, accepted ACL 201

arXiv.org e-Print Archive

On-Average KL-Privacy and its equivalence to Generalization for Max-Entropy Mechanisms

Author: Fienberg Stephen E.
Lei Jing
Wang Yu-Xiang
Publication venue
Publication date: 08/05/2016
Field of study

We define On-Average KL-Privacy and present its properties and connections to differential privacy, generalization and information-theoretic quantities including max-information and mutual information. The new definition significantly weakens differential privacy, while preserving its minimalistic design features such as composition over small group and multiple queries as well as closeness to post-processing. Moreover, we show that On-Average KL-Privacy is **equivalent** to generalization for a large class of commonly-used tools in statistics and machine learning that samples from Gibbs distributions---a class of distributions that arises naturally from the maximum entropy principle. In addition, a byproduct of our analysis yields a lower bound for generalization error in terms of mutual information which reveals an interesting interplay with known upper bounds that use the same quantity

arXiv.org e-Print Archive