Search CORE

398 research outputs found

A Study of AI Population Dynamics with Million-agent Reinforcement Learning

Author: Bai Yiwei
Wang Jun
Wen Ying
Yang Yaodong
Yu Lantao
Yu Yong
Zhang Weinan
Publication venue
Publication date: 14/05/2018
Field of study

We conduct an empirical study on discovering the ordered collective dynamics obtained by a population of intelligence agents, driven by million-agent reinforcement learning. Our intention is to put intelligent agents into a simulated natural context and verify if the principles developed in the real world could also be used in understanding an artificially-created intelligent population. To achieve this, we simulate a large-scale predator-prey world, where the laws of the world are designed by only the findings or logical equivalence that have been discovered in nature. We endow the agents with the intelligence based on deep reinforcement learning (DRL). In order to scale the population size up to millions agents, a large-scale DRL training platform with redesigned experience buffer is proposed. Our results show that the population dynamics of AI agents, driven only by each agent's individual self-interest, reveals an ordered pattern that is similar to the Lotka-Volterra model studied in population biology. We further discover the emergent behaviors of collective adaptations in studying how the agents' grouping behaviors will change with the environmental resources. Both of the two findings could be explained by the self-organization theory in nature.Comment: Full version of the paper presented at AAMAS 2018 (International Conference on Autonomous Agents and Multiagent Systems

arXiv.org e-Print Archive

UCL Discovery

An embedded improved soil berm in an excavation - mechanisms and capacity

Author: ZHANG YAODONG
Publication venue
Publication date: 19/11/2005
Field of study

Ph.DDOCTOR OF PHILOSOPH

ScholarBank@NUS

Measuring Value Understanding in Language Models through Discriminator-Critique Gap

Author: Bai Fengshuo
Gao Jun
Yang Yaodong
Zhang Zhaowei
Publication venue
Publication date: 18/10/2023
Field of study

Recent advancements in Large Language Models (LLMs) have heightened concerns about their potential misalignment with human values. However, evaluating their grasp of these values is complex due to their intricate and adaptable nature. We argue that truly understanding values in LLMs requires considering both "know what" and "know why". To this end, we present the Value Understanding Measurement (VUM) framework that quantitatively assesses both "know what" and "know why" by measuring the discriminator-critique gap related to human values. Using the Schwartz Value Survey, we specify our evaluation values and develop a thousand-level dialogue dataset with GPT-4. Our assessment looks at both the value alignment of LLM's outputs compared to baseline answers and how LLM responses align with reasons for value recognition versus GPT-4's annotations. We evaluate five representative LLMs and provide strong evidence that the scaling law significantly impacts "know what" but not much on "know why", which has consistently maintained a high level. This may further suggest that LLMs might craft plausible explanations based on the provided context without truly understanding their inherent value, indicating potential risks

arXiv.org e-Print Archive

Optimizing Gear Shifting Strategy for Off-Road Vehicle with Dynamic Programming

Author: Wenming Zhang
Xinxin Zhao
Yali Feng
Yaodong Yang
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2014
Field of study

Gear shifting strategy of vehicle is important aid for the acquisition of dynamic performance and high economy. A dynamic programming (DP) algorithm is used to optimize the gear shifting schedule for off-road vehicle by using an objective function that weighs fuel use and trip time. The optimization is accomplished through discrete dynamic programming and a trade-off between trip time and fuel consumption is analyzed. By using concave and convex surface road as road profile, an optimal gear shifting strategy is used to control the longitudinal behavior of the vehicle. Simulation results show that the trip time can be reduced by powerful gear shifting strategy and fuel consumption can achieve high economy with economical gear shifting strategy in different initial conditions and route cases

Crossref

Directory of Open Access Journals

Interference-aware coordinated power allocation in autonomous Wi-Fi environment

Author: Han Zhu
Jiang Chunxiao
Yu Shui
Yuan Jian
Zhang Yaodong
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2016
Field of study

Self-managed access points (APs) with growing intelligence can optimize their own performances but pose potential negative impacts on others without energy ef ciency. In this paper, we focus on modeling the coordinated interaction among interest-independent and self-con gured APs, and conduct the power allocation case study in the autonomous Wi-Fi scenario. Speci cally, we build a `coordination Wi-Fi platform (CWP), a public platform for APs interacting with each other. OpenWrt-based APs in the physical world are mapped to virtual agents (VAs) in CWP, which communicate with each other through a standard request-reply process de ned as AP talk protocol (ATP).With ATP, an active interference measurement methodology is proposed re ecting both in-range interference and hidden terminal interference, and the Nash bargaining-based power control is further formulated for interference reductions. CWP is deployed in a real of ce environment, where coordination interactions between VAs can bring a maximum 40-Mb/s throughput improvement with the Nash bargaining-based power control in the multi-AP experiments

Deakin Research Online

University of Houston Institutional Repository (UHIR)

Photomolecular Effect: Visible Light Interaction with Air-Water Interface

Author: Chen Gang
Lv Guangxin
Tu Yaodong
Zhang James H.
Publication venue
Publication date: 28/10/2023
Field of study

Although water is almost transparent to visible light, we demonstrate that the air-water interface interacts strongly with visible light via what we hypothesize as the photomolecular effect. In this effect, transverse-magnetic polarized photons cleave off water clusters from the air-water interface. We use over 10 different experiments to demonstrate the existence of this effect and its dependence on the wavelength, incident angle and polarization of visible light. We further demonstrate that visible light heats up thin fogs, suggesting that this process can impact weather, climate, and the earth's water cycle. Our study suggests that the photomolecular effect should happen widely in nature, from clouds to fogs, ocean to soil surfaces, and plant transpiration, and can also lead to new applications in energy and clear water

arXiv.org e-Print Archive

Research on Influence Factors of Crowdfunding

Author: YAO Huili
ZHANG Yaodong
Publication venue: Canadian Research & Development Center of Sciences and Cultures
Publication date: 30/11/2014
Field of study

Crowdfunding - as a booming innovative internet-based financial model is one of the solution to SMEs and entrepreneurs to develop new products in a difficult financing situation. Factors were extracted by studying the process of crowdfunding, combined with relevant literature. Applying the method of optimal scaling regression, this paper researched into the influential factors affecting crowdfunding project financing, based on a survey about 314 projects funded in crowdfunding website in 2013. It was found out that the economic, customer participation, trust, information quality and social network have positive effects on crowdfunding project financing, customer participation making the most influence. Corresponding conclusions and suggestions were put forward to help financing individuals or groups to improve their performance in crowdfunding. Key words: Crowdfunding; Financing; Optimal scaling; Influence factors; Customer participatio

CSCanada.net: E-Journals (Canadian Academy of Oriental and Occidental Culture, Canadian Research & Development Center of Sciences and Cultures)