24,917 research outputs found
A Parameterisation of Algorithms for Distributed Constraint Optimisation via Potential Games
This paper introduces a parameterisation of learning algorithms for distributed constraint optimisation problems (DCOPs). This parameterisation encompasses many algorithms developed in both the computer science and game theory literatures. It is built on our insight that when formulated as noncooperative games, DCOPs form a subset of the class of potential games. This result allows us to prove convergence properties of algorithms developed in the computer science literature using game theoretic methods. Furthermore, our parameterisation can assist system designers by making the pros and cons of, and the synergies between, the various DCOP algorithm components clear
An agent-driven semantical identifier using radial basis neural networks and reinforcement learning
Due to the huge availability of documents in digital form, and the deception
possibility raise bound to the essence of digital documents and the way they
are spread, the authorship attribution problem has constantly increased its
relevance. Nowadays, authorship attribution,for both information retrieval and
analysis, has gained great importance in the context of security, trust and
copyright preservation. This work proposes an innovative multi-agent driven
machine learning technique that has been developed for authorship attribution.
By means of a preprocessing for word-grouping and time-period related analysis
of the common lexicon, we determine a bias reference level for the recurrence
frequency of the words within analysed texts, and then train a Radial Basis
Neural Networks (RBPNN)-based classifier to identify the correct author. The
main advantage of the proposed approach lies in the generality of the semantic
analysis, which can be applied to different contexts and lexical domains,
without requiring any modification. Moreover, the proposed system is able to
incorporate an external input, meant to tune the classifier, and then
self-adjust by means of continuous learning reinforcement.Comment: Published on: Proceedings of the XV Workshop "Dagli Oggetti agli
Agenti" (WOA 2014), Catania, Italy, Sepember. 25-26, 201
Probably Approximately Correct Nash Equilibrium Learning
We consider a multi-agent noncooperative game with agents' objective
functions being affected by uncertainty. Following a data driven paradigm, we
represent uncertainty by means of scenarios and seek a robust Nash equilibrium
solution. We treat the Nash equilibrium computation problem within the realm of
probably approximately correct (PAC) learning. Building upon recent
developments in scenario-based optimization, we accompany the computed Nash
equilibrium with a priori and a posteriori probabilistic robustness
certificates, providing confidence that the computed equilibrium remains
unaffected (in probabilistic terms) when a new uncertainty realization is
encountered. For a wide class of games, we also show that the computation of
the so called compression set - a key concept in scenario-based optimization -
can be directly obtained as a byproduct of the proposed solution methodology.
Finally, we illustrate how to overcome differentiability issues, arising due to
the introduction of scenarios, and compute a Nash equilibrium solution in a
decentralized manner. We demonstrate the efficacy of the proposed approach on
an electric vehicle charging control problem.Comment: Preprint submitted to IEEE Transactions on Automatic Contro
Probabilistic and Distributed Control of a Large-Scale Swarm of Autonomous Agents
We present a novel method for guiding a large-scale swarm of autonomous
agents into a desired formation shape in a distributed and scalable manner. Our
Probabilistic Swarm Guidance using Inhomogeneous Markov Chains (PSG-IMC)
algorithm adopts an Eulerian framework, where the physical space is partitioned
into bins and the swarm's density distribution over each bin is controlled.
Each agent determines its bin transition probabilities using a
time-inhomogeneous Markov chain. These time-varying Markov matrices are
constructed by each agent in real-time using the feedback from the current
swarm distribution, which is estimated in a distributed manner. The PSG-IMC
algorithm minimizes the expected cost of the transitions per time instant,
required to achieve and maintain the desired formation shape, even when agents
are added to or removed from the swarm. The algorithm scales well with a large
number of agents and complex formation shapes, and can also be adapted for area
exploration applications. We demonstrate the effectiveness of this proposed
swarm guidance algorithm by using results of numerical simulations and hardware
experiments with multiple quadrotors.Comment: Submitted to IEEE Transactions on Robotic
Coverage and Field Estimation on Bounded Domains by Diffusive Swarms
In this paper, we consider stochastic coverage of bounded domains by a
diffusing swarm of robots that take local measurements of an underlying scalar
field. We introduce three control methodologies with diffusion, advection, and
reaction as independent control inputs. We analyze the diffusion-based control
strategy using standard operator semigroup-theoretic arguments. We show that
the diffusion coefficient can be chosen to be dependent only on the robots'
local measurements to ensure that the swarm density converges to a function
proportional to the scalar field. The boundedness of the domain precludes the
need to impose assumptions on decaying properties of the scalar field at
infinity. Moreover, exponential convergence of the swarm density to the
equilibrium follows from properties of the spectrum of the semigroup generator.
In addition, we use the proposed coverage method to construct a
time-inhomogenous diffusion process and apply the observability of the heat
equation to reconstruct the scalar field over the entire domain from
observations of the robots' random motion over a small subset of the domain. We
verify our results through simulations of the coverage scenario on a 2D domain
and the field estimation scenario on a 1D domain.Comment: To appear in the proceedings of the 55th IEEE Conference on Decision
and Control (CDC 2016
- âŠ