21,690 research outputs found
Markov Decision Processes with Applications in Wireless Sensor Networks: A Survey
Wireless sensor networks (WSNs) consist of autonomous and resource-limited
devices. The devices cooperate to monitor one or more physical phenomena within
an area of interest. WSNs operate as stochastic systems because of randomness
in the monitored environments. For long service time and low maintenance cost,
WSNs require adaptive and robust methods to address data exchange, topology
formulation, resource and power optimization, sensing coverage and object
detection, and security challenges. In these problems, sensor nodes are to make
optimized decisions from a set of accessible strategies to achieve design
goals. This survey reviews numerous applications of the Markov decision process
(MDP) framework, a powerful decision-making tool to develop adaptive algorithms
and protocols for WSNs. Furthermore, various solution methods are discussed and
compared to serve as a guide for using MDPs in WSNs
Renyi Entropy based Target Tracking in Mobile Sensor Networks
This paper proposes an entropy based target tracking approach for mobile sensor networks. The proposed tracking algorithm runs a target state estimation stage and a motion control stage alternatively. A distributed particle filter is developed to estimate the target position in the first stage. This distributed particle filter does not require to transmit the weighted particles from one sensor node to another. Instead, a Gaussian mixture model is formulated to approximate the posterior distribution represented by the weighted particles via an EM algorithm. The EM algorithm is developed in a distributed form to compute the parameters of Gaussian mixture model via local communication, which leads to the distributed implementation of the particle filter. A flocking controller is developed to control the mobile sensor nodes to track the target in the second stage. The flocking control algorithm includes three components. Collision avoidance component is based on the design of a separation potential function. Alignment component is based on a consensus algorithm. Navigation component is based on the minimization of an quadratic Renyi entropy. The quadratic Renyi entropy of Gaussian mixture model has an analytical expression so that its optimization is feasible in mobile sensor networks. The proposed active tracking algorithm is tested in simulation. © 2011 IFAC
Bibliographic Review on Distributed Kalman Filtering
In recent years, a compelling need has arisen to understand the effects of distributed information structures on estimation and filtering. In this paper, a bibliographical review on distributed Kalman filtering (DKF) is provided.\ud
The paper contains a classification of different approaches and methods involved to DKF. The applications of DKF are also discussed and explained separately. A comparison of different approaches is briefly carried out. Focuses on the contemporary research are also addressed with emphasis on the practical applications of the techniques. An exhaustive list of publications, linked directly or indirectly to DKF in the open literature, is compiled to provide an overall picture of different developing aspects of this area
Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning
Developing a safe and efficient collision avoidance policy for multiple
robots is challenging in the decentralized scenarios where each robot generate
its paths without observing other robots' states and intents. While other
distributed multi-robot collision avoidance systems exist, they often require
extracting agent-level features to plan a local collision-free action, which
can be computationally prohibitive and not robust. More importantly, in
practice the performance of these methods are much lower than their centralized
counterparts.
We present a decentralized sensor-level collision avoidance policy for
multi-robot systems, which directly maps raw sensor measurements to an agent's
steering commands in terms of movement velocity. As a first step toward
reducing the performance gap between decentralized and centralized methods, we
present a multi-scenario multi-stage training framework to find an optimal
policy which is trained over a large number of robots on rich, complex
environments simultaneously using a policy gradient based reinforcement
learning algorithm. We validate the learned sensor-level collision avoidance
policy in a variety of simulated scenarios with thorough performance
evaluations and show that the final learned policy is able to find time
efficient, collision-free paths for a large-scale robot system. We also
demonstrate that the learned policy can be well generalized to new scenarios
that do not appear in the entire training period, including navigating a
heterogeneous group of robots and a large-scale scenario with 100 robots.
Videos are available at https://sites.google.com/view/drlmac
Distributed Bayesian Filtering using Logarithmic Opinion Pool for Dynamic Sensor Networks
The discrete-time Distributed Bayesian Filtering (DBF) algorithm is presented
for the problem of tracking a target dynamic model using a time-varying network
of heterogeneous sensing agents. In the DBF algorithm, the sensing agents
combine their normalized likelihood functions in a distributed manner using the
logarithmic opinion pool and the dynamic average consensus algorithm. We show
that each agent's estimated likelihood function globally exponentially
converges to an error ball centered on the joint likelihood function of the
centralized multi-sensor Bayesian filtering algorithm. We rigorously
characterize the convergence, stability, and robustness properties of the DBF
algorithm. Moreover, we provide an explicit bound on the time step size of the
DBF algorithm that depends on the time-scale of the target dynamics, the
desired convergence error bound, and the modeling and communication error
bounds. Furthermore, the DBF algorithm for linear-Gaussian models is cast into
a modified form of the Kalman information filter. The performance and robust
properties of the DBF algorithm are validated using numerical simulations
- …