Search CORE

123,178 research outputs found

A multi-agent architecture for dynamic scheduling of steel hot rolling

Author: Cowling P.
Ouelhadj Djamila
Petrovic S.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2003
Field of study

Portsmouth University Research Portal (Pure)

A generic agent-based framework for cooperative search using pattern matching and reinforcement learning

Author: Beullens P.
Martin Simon
Ouelhadj Djamila
Ozcan E.
Publication venue
Publication date: 04/12/2011
Field of study

Portsmouth University Research Portal (Pure)

Global adaptation in networks of selfish components: emergent associative memory at the system scale

Author: Branchtein M. C.
C. L. Buckley
Hinton G. E.
Hinton G. E.
Hopfield J. J.
Kirkpatrick S.
Pavlicev M.
Richard A. Watson
Rob Mills
Publication venue: 'MIT Press - Journals'
Publication date: 01/07/2011
Field of study

In some circumstances complex adaptive systems composed of numerous self-interested agents can self-organise into structures that enhance global adaptation, efficiency or function. However, the general conditions for such an outcome are poorly understood and present a fundamental open question for domains as varied as ecology, sociology, economics, organismic biology and technological infrastructure design. In contrast, sufficient conditions for artificial neural networks to form structures that perform collective computational processes such as associative memory/recall, classification, generalisation and optimisation, are well-understood. Such global functions within a single agent or organism are not wholly surprising since the mechanisms (e.g. Hebbian learning) that create these neural organisations may be selected for this purpose, but agents in a multi-agent system have no obvious reason to adhere to such a structuring protocol or produce such global behaviours when acting from individual self-interest. However, Hebbian learning is actually a very simple and fully-distributed habituation or positive feedback principle. Here we show that when self-interested agents can modify how they are affected by other agents (e.g. when they can influence which other agents they interact with) then, in adapting these inter-agent relationships to maximise their own utility, they will necessarily alter them in a manner homologous with Hebbian learning. Multi-agent systems with adaptable relationships will thereby exhibit the same system-level behaviours as neural networks under Hebbian learning. For example, improved global efficiency in multi-agent systems can be explained by the inherent ability of associative memory to generalise by idealising stored patterns and/or creating new combinations of sub-patterns. Thus distributed multi-agent systems can spontaneously exhibit adaptive global behaviours in the same sense, and by the same mechanism, as the organisational principles familiar in connectionist models of organismic learning

Southampton (e-Prints Soton)

Crossref

Decentralized Cooperative Planning for Automated Vehicles with Hierarchical Monte Carlo Tree Search

Author: Kurzer Karl
Zhou Chenyang
Zöllner J. Marius
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 25/07/2018
Field of study

Today's automated vehicles lack the ability to cooperate implicitly with others. This work presents a Monte Carlo Tree Search (MCTS) based approach for decentralized cooperative planning using macro-actions for automated vehicles in heterogeneous environments. Based on cooperative modeling of other agents and Decoupled-UCT (a variant of MCTS), the algorithm evaluates the state-action-values of each agent in a cooperative and decentralized manner, explicitly modeling the interdependence of actions between traffic participants. Macro-actions allow for temporal extension over multiple time steps and increase the effective search depth requiring fewer iterations to plan over longer horizons. Without predefined policies for macro-actions, the algorithm simultaneously learns policies over and within macro-actions. The proposed method is evaluated under several conflict scenarios, showing that the algorithm can achieve effective cooperative planning with learned macro-actions in heterogeneous environments

arXiv.org e-Print Archive

Crossref