704 research outputs found
A Study of AI Population Dynamics with Million-agent Reinforcement Learning
We conduct an empirical study on discovering the ordered collective dynamics
obtained by a population of intelligence agents, driven by million-agent
reinforcement learning. Our intention is to put intelligent agents into a
simulated natural context and verify if the principles developed in the real
world could also be used in understanding an artificially-created intelligent
population. To achieve this, we simulate a large-scale predator-prey world,
where the laws of the world are designed by only the findings or logical
equivalence that have been discovered in nature. We endow the agents with the
intelligence based on deep reinforcement learning (DRL). In order to scale the
population size up to millions agents, a large-scale DRL training platform with
redesigned experience buffer is proposed. Our results show that the population
dynamics of AI agents, driven only by each agent's individual self-interest,
reveals an ordered pattern that is similar to the Lotka-Volterra model studied
in population biology. We further discover the emergent behaviors of collective
adaptations in studying how the agents' grouping behaviors will change with the
environmental resources. Both of the two findings could be explained by the
self-organization theory in nature.Comment: Full version of the paper presented at AAMAS 2018 (International
Conference on Autonomous Agents and Multiagent Systems
INTUITIVE DECISION THEORY ANALYSIS AND THE EVALUATION MODEL
Intuitive decision-making studies the decision-makerâs decision-making behavior from the perspective of image thinking, which it poses a challenge to the classic decision-making hypothesis pursuing âoptimal decisionâ because the outcomes of intuitive decision-making are difficulty to measure and its process isnât easy to describe and control. Therefore it has not drawn the expertsâ attention. This paper tries to establish an evaluation model of the intuitive decision-making as to giving a direction and inspiration of the quantization of intuitive decision-making, based on the systematic analysis of the existing domestic and international theory of intuitive decision-making. Key words: Intuitive decision-making, Thinking in images, The evaluation mode
Integrable Open Spin Chains from Flavored ABJM Theory
We compute the two-loop anomalous dimension matrix in the scalar sector of
planar flavored ABJM theory. Using coordinate Bethe ansatz, we
obtain the reflection matrix and confirm that the boundary Yang-Baxter
equations are satisfied. This establishes the integrability of this theory in
the scalar sector at the two-loop order.Comment: v2, 25 pages, 2 figures, minor corrections, references adde
Laser Intensity Noise Suppression for Preparing Audio-Frequency 795 nm Squeezed Vacuum State of Light at Rubidium D1 Line
Laser intensity noise suppression has essential effects on preparation and
characterization of the audio-frequency squeezed vacuum state of light based on
a sub-threshold optical parametric oscillator (OPO).We have implemented two
feedback loops by using relevant acousto-optical modulators (AOM) to stabilize
the intensity of 795-nm near infrared (NIR) fundamental laser and 397.5-nm
ultraviolet (UV) laser generated by cavity-enhanced frequency doubling.Typical
peak-to-peak laser intensity fluctuation with a bandwidth of kHz in a
half hour has been improved from to for 795-nm NIR
laser beam, and from to for 397.5-nm UV laser beam,
respectively. The squeezing level of the squeezed vacuum state at 795 nm
prepared by the sub-threshold OPO with a PPKTP crystal has been improved from
-3.3 to -4.0 dB around 39 kHz of audio analysis frequency range.Comment: 5 pages, 4 figure
- âŠ