3,854 research outputs found

    A Study of AI Population Dynamics with Million-agent Reinforcement Learning

    Get PDF
    We conduct an empirical study on discovering the ordered collective dynamics obtained by a population of intelligence agents, driven by million-agent reinforcement learning. Our intention is to put intelligent agents into a simulated natural context and verify if the principles developed in the real world could also be used in understanding an artificially-created intelligent population. To achieve this, we simulate a large-scale predator-prey world, where the laws of the world are designed by only the findings or logical equivalence that have been discovered in nature. We endow the agents with the intelligence based on deep reinforcement learning (DRL). In order to scale the population size up to millions agents, a large-scale DRL training platform with redesigned experience buffer is proposed. Our results show that the population dynamics of AI agents, driven only by each agent's individual self-interest, reveals an ordered pattern that is similar to the Lotka-Volterra model studied in population biology. We further discover the emergent behaviors of collective adaptations in studying how the agents' grouping behaviors will change with the environmental resources. Both of the two findings could be explained by the self-organization theory in nature.Comment: Full version of the paper presented at AAMAS 2018 (International Conference on Autonomous Agents and Multiagent Systems

    Measuring Value Understanding in Language Models through Discriminator-Critique Gap

    Full text link
    Recent advancements in Large Language Models (LLMs) have heightened concerns about their potential misalignment with human values. However, evaluating their grasp of these values is complex due to their intricate and adaptable nature. We argue that truly understanding values in LLMs requires considering both "know what" and "know why". To this end, we present the Value Understanding Measurement (VUM) framework that quantitatively assesses both "know what" and "know why" by measuring the discriminator-critique gap related to human values. Using the Schwartz Value Survey, we specify our evaluation values and develop a thousand-level dialogue dataset with GPT-4. Our assessment looks at both the value alignment of LLM's outputs compared to baseline answers and how LLM responses align with reasons for value recognition versus GPT-4's annotations. We evaluate five representative LLMs and provide strong evidence that the scaling law significantly impacts "know what" but not much on "know why", which has consistently maintained a high level. This may further suggest that LLMs might craft plausible explanations based on the provided context without truly understanding their inherent value, indicating potential risks

    Predication Method for China’s Civil Aviation Fuel Consumption

    Get PDF
    With the China’s civil aviation industry gradual market-oriented and the rapid development of China’s economy, China’s civil aviation transportation fuel consumption has grown significantly in nearly past three decades. Therefore, it’s a very important strategic significance of the prediction of China’s civil aviation transportation fuel consumption. In this paper, gray system and neural network approach, combined with China’s civil aviation industry 1980-2010 total traffic volume of the data, we establish gray system GM (1,1) model and BP neural network model for civil aviation transport volume. Training and simulation of the back propagation neutral network model and the gray system GM(1,1) used MATLAB. BP neural network modeling takes into account in three factors: the number of aircraft aviation industry, flight hours and total turnover. The fitting precision of the gray system GM(1,1) model is 64.2% while the fitting precision of the back propagation neutral network model is 90.16%. Thus, the back propagation neutral network model is better for estimating Civil aviation fuel consumption

    ab-plane tilt angles in REBCO conductors

    Full text link
    Critical current (Ic) of REBCO tapes is strongly aniso-tropic with respect to the orientation of the magnetic field. Usually, Ic is at maximum when the ab-plane of the REBCO crystal is parallel to the magnetic field. In commercial REBCO tapes, it is commonly assumed that the ab-plane is coincide with the tape plane. While in fact, the ab-plane is near but slightly tilted from the tape plane in the transverse direction. To accurately measure Ic as a function of the field angle {\theta} , which is defined as the angle between ab-plane and the magnetic field direction, and to design and fabricate REBCO mag-net coils based on the measured Ic(angle), it is important to measure the tilt angle. In this work, we used x-ray diffraction (XRD) to measure the tilt angles at room temperature for a large number of REBCO conductors made by SuperPower Inc. Transmission electron mi-croscopy (TEM) was also used to investigate the origin of this tilt. The measured data are presented, and the measurement uncer-tainty is discussed.Comment: 4 pages, 7 figure
    • …
    corecore