Search CORE

16,047 research outputs found

Recommended from our members

A Boolean complete neural model of adaptive behavior

Author: Hampson Steve
Kibler Dennis
Publication venue: eScholarship, University of California
Publication date: 01/01/1982
Field of study

A multi-layered neural assembly is developed which has the capability of learning arbitrary Boolean functions. Though the model neuron is more powerful than those previously considered, assemblies of neurons are needed to detect non-linearly separable patterns. Algorithms for learning at the neuron and assembly level are described. The model permits multiple output systens to share a common memory. Learned evaluation allows sequences of actions to be organized. Computer simulations demonstrate the capabilities of the model

eScholarship - University of California

Neural network based architectures for aerospace applications

Author: Ricart Richard
Publication venue
Publication date
Field of study

A brief history of the field of neural networks research is given and some simple concepts are described. In addition, some neural network based avionics research and development programs are reviewed. The need for the United States Air Force and NASA to assume a leadership role in supporting this technology is stressed

NASA Technical Reports Server

MULTIAGENT LEARNING FOR BLACK BOX SYSTEM REWARD FUNCTIONS

Author: Bagnell J. A.
Bilimoria K. D.
Dietterich T. G.
Jefferies P.
Jennings N. R.
McGlohon M.
Parkes D.
Stone P.
Tuyls K.
Whiteson S.
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date
Field of study

Crossref

Quicker Q-Learning in Multi-Agent Systems

Author: Agogino Adrian K.
Tumer Kagan
Publication venue
Publication date
Field of study

Multi-agent learning in Markov Decisions Problems is challenging because of the presence ot two credit assignment problems: 1) How to credit an action taken at time step t for rewards received at t' greater than t; and 2) How to credit an action taken by agent i considering the system reward is a function of the actions of all the agents. The first credit assignment problem is typically addressed with temporal difference methods such as Q-learning OK TD(lambda) The second credit assi,onment problem is typically addressed either by hand-crafting reward functions that assign proper credit to an agent, or by making certain independence assumptions about an agent's state-space and reward function. To address both credit assignment problems simultaneously, we propose the Q Updates with Immediate Counterfactual Rewards-learning (QUICR-learning) designed to improve both the convergence properties and performance of Q-learning in large multi-agent problems. Instead of assuming that an agent s value function can be made independent of other agents, this method suppresses the impact of other agents using counterfactual rewards. Results on multi-agent grid-world problems over multiple topologies show that QUICR-learning can achieve up to thirty fold improvements in performance over both conventional and local Q-learning in the largest tested systems

NASA Technical Reports Server

Survey of Recent Multi-Agent Reinforcement Learning Algorithms Utilizing Centralized Training

Author: Asher Derrik E.
Basak Anjon
Dorothy Michael
Fernandez Rolando
Sharma Piyush K.
Zaroukian Erin
Publication venue
Publication date: 29/07/2021
Field of study

Much work has been dedicated to the exploration of Multi-Agent Reinforcement Learning (MARL) paradigms implementing a centralized learning with decentralized execution (CLDE) approach to achieve human-like collaboration in cooperative tasks. Here, we discuss variations of centralized training and describe a recent survey of algorithmic approaches. The goal is to explore how different implementations of information sharing mechanism in centralized learning may give rise to distinct group coordinated behaviors in multi-agent systems performing cooperative tasks.Comment: This article appeared in the news at: https://www.army.mil/article/247261/army_researchers_develop_innovative_framework_for_training_a

arXiv.org e-Print Archive