398 research outputs found
A Study of AI Population Dynamics with Million-agent Reinforcement Learning
We conduct an empirical study on discovering the ordered collective dynamics
obtained by a population of intelligence agents, driven by million-agent
reinforcement learning. Our intention is to put intelligent agents into a
simulated natural context and verify if the principles developed in the real
world could also be used in understanding an artificially-created intelligent
population. To achieve this, we simulate a large-scale predator-prey world,
where the laws of the world are designed by only the findings or logical
equivalence that have been discovered in nature. We endow the agents with the
intelligence based on deep reinforcement learning (DRL). In order to scale the
population size up to millions agents, a large-scale DRL training platform with
redesigned experience buffer is proposed. Our results show that the population
dynamics of AI agents, driven only by each agent's individual self-interest,
reveals an ordered pattern that is similar to the Lotka-Volterra model studied
in population biology. We further discover the emergent behaviors of collective
adaptations in studying how the agents' grouping behaviors will change with the
environmental resources. Both of the two findings could be explained by the
self-organization theory in nature.Comment: Full version of the paper presented at AAMAS 2018 (International
Conference on Autonomous Agents and Multiagent Systems
An embedded improved soil berm in an excavation - mechanisms and capacity
Ph.DDOCTOR OF PHILOSOPH
Measuring Value Understanding in Language Models through Discriminator-Critique Gap
Recent advancements in Large Language Models (LLMs) have heightened concerns
about their potential misalignment with human values. However, evaluating their
grasp of these values is complex due to their intricate and adaptable nature.
We argue that truly understanding values in LLMs requires considering both
"know what" and "know why". To this end, we present the Value Understanding
Measurement (VUM) framework that quantitatively assesses both "know what" and
"know why" by measuring the discriminator-critique gap related to human values.
Using the Schwartz Value Survey, we specify our evaluation values and develop a
thousand-level dialogue dataset with GPT-4. Our assessment looks at both the
value alignment of LLM's outputs compared to baseline answers and how LLM
responses align with reasons for value recognition versus GPT-4's annotations.
We evaluate five representative LLMs and provide strong evidence that the
scaling law significantly impacts "know what" but not much on "know why", which
has consistently maintained a high level. This may further suggest that LLMs
might craft plausible explanations based on the provided context without truly
understanding their inherent value, indicating potential risks
Optimizing Gear Shifting Strategy for Off-Road Vehicle with Dynamic Programming
Gear shifting strategy of vehicle is important aid for the acquisition of dynamic performance and high economy. A dynamic programming (DP) algorithm is used to optimize the gear shifting schedule for off-road vehicle by using an objective function that weighs fuel use and trip time. The optimization is accomplished through discrete dynamic programming and a trade-off between trip time and fuel consumption is analyzed. By using concave and convex surface road as road profile, an optimal gear shifting strategy is used to control the longitudinal behavior of the vehicle. Simulation results show that the trip time can be reduced by powerful gear shifting strategy and fuel consumption can achieve high economy with economical gear shifting strategy in different initial conditions and route cases
Interference-aware coordinated power allocation in autonomous Wi-Fi environment
Self-managed access points (APs) with growing intelligence can optimize their own performances but pose potential negative impacts on others without energy ef ciency. In this paper, we focus on modeling the coordinated interaction among interest-independent and self-con gured APs, and conduct the power allocation case study in the autonomous Wi-Fi scenario. Speci cally, we build a `coordination Wi-Fi platform (CWP), a public platform for APs interacting with each other. OpenWrt-based APs in the physical world are mapped to virtual agents (VAs) in CWP, which communicate with each other through a standard request-reply process de ned as AP talk protocol (ATP).With ATP, an active interference measurement methodology is proposed re ecting both in-range interference and hidden terminal interference, and the Nash bargaining-based power control is further formulated for interference reductions. CWP is deployed in a real of ce environment, where coordination interactions between VAs can bring a maximum 40-Mb/s throughput improvement with the Nash bargaining-based power control in the multi-AP experiments
Photomolecular Effect: Visible Light Interaction with Air-Water Interface
Although water is almost transparent to visible light, we demonstrate that
the air-water interface interacts strongly with visible light via what we
hypothesize as the photomolecular effect. In this effect, transverse-magnetic
polarized photons cleave off water clusters from the air-water interface. We
use over 10 different experiments to demonstrate the existence of this effect
and its dependence on the wavelength, incident angle and polarization of
visible light. We further demonstrate that visible light heats up thin fogs,
suggesting that this process can impact weather, climate, and the earth's water
cycle. Our study suggests that the photomolecular effect should happen widely
in nature, from clouds to fogs, ocean to soil surfaces, and plant
transpiration, and can also lead to new applications in energy and clear water
Research on Influence Factors of Crowdfunding
Crowdfunding - as a booming innovative internet-based financial model is one of the solution to SMEs and entrepreneurs to develop new products in a difficult financing situation. Factors were extracted by studying the process of crowdfunding, combined with relevant literature. Applying the method of optimal scaling regression, this paper researched into the influential factors affecting crowdfunding project financing, based on a survey about 314 projects funded in crowdfunding website in 2013. It was found out that the economic, customer participation, trust, information quality and social network have positive effects on crowdfunding project financing, customer participation making the most influence. Corresponding conclusions and suggestions were put forward to help financing individuals or groups to improve their performance in crowdfunding. Key words: Crowdfunding; Financing; Optimal scaling; Influence factors; Customer participatio
- …