4,719 research outputs found

    On the genericity properties in networked estimation: Topology design and sensor placement

    Full text link
    In this paper, we consider networked estimation of linear, discrete-time dynamical systems monitored by a network of agents. In order to minimize the power requirement at the (possibly, battery-operated) agents, we require that the agents can exchange information with their neighbors only \emph{once per dynamical system time-step}; in contrast to consensus-based estimation where the agents exchange information until they reach a consensus. It can be verified that with this restriction on information exchange, measurement fusion alone results in an unbounded estimation error at every such agent that does not have an observable set of measurements in its neighborhood. To over come this challenge, state-estimate fusion has been proposed to recover the system observability. However, we show that adding state-estimate fusion may not recover observability when the system matrix is structured-rank (SS-rank) deficient. In this context, we characterize the state-estimate fusion and measurement fusion under both full SS-rank and SS-rank deficient system matrices.Comment: submitted for IEEE journal publicatio

    State Omniscience for Cooperative Local Catalog Maintenance of Close Proximity Satellite Systems

    Get PDF
    Resiliency in multi-agent system navigation is reliant on the inherent ability of the system to withstand, overcome, or recover from adverse conditions and disturbances. In large part, resiliency is achieved through reducing the impact of critical failure points to the success and/or performance of the system. In this view, decentralized multi-agent architectures have become an attractive solution for multi-agent navigation, but decentralized architectures place the burden of information acquisition directly on the agents themselves. In fact, the design of distributed estimators has been a growing interest to enable complex multi-sensor/multi-agent tasks. In such scenarios, it is important that each local estimator converges to the true global system state - a quality known as state omniscience. Most previous related work has focused on the design of such systems under varying assumptions on the graph topology with simplified information fusion schemes. Contrarily, this work introduces characterizations of state omniscience under generalized graph topologies and generalized information fusion schemes. State omniscience is discussed analogously to observability from classical control theory; and a collection of necessary and sufficient conditions for a distributed estimator to be state omniscient are presented. This dissertation discusses this phenomena in terms of an on-orbit scenarios dubbed the local catalog maintenance problem and the cooperative local catalog maintenance problem. The goal of each agent is to maintain their catalog of all bodies (objects and agents) within this neighborhood through onboard angles-only measurements and cooperative communication with the other agents. A central supervisor selects the target body for each agent, a local controller tracks the selected target body for each agent, and a local estimator coalesces both an agent\u27s measurements and state estimates provided by neighboring agents within the communication graph. Numerical results are provided to demonstrate the supervisor\u27s ability to select an appropriate target subject to an uncertainty threshold, the controller\u27s ability to track the selected target, and the estimator\u27s ability to maintain an accurate and precise estimate of each of the bodies in the local environment

    Optimal design of observable multi-agent networks: A structural system approach

    Get PDF
    This paper introduces a method to design observable directed multi-agent networks, that are: 1) either minimal with respect to a communications-related cost function, or 2) idem, under possible failure of direct communication between two agents. An observable multi-agent network is characterized by agents that update their states using a neighboring rule based on directed communication graph topology in order to share information about their states; furthermore, each agent can infer the initial information shared by all the agents. Sufficient conditions to ensure that 1) is satisfied are obtained by reducing the original problem to the travelling salesman problem (TSP). For the case described in 2), sufficient conditions for the existence of a minimal network are shown to be equivalent to the existence of two disjoint solutions to the TSP. The results obtained are illustrated with an example from the area of cooperative path following of multiple networked vehicles by resorting to an approximate solution to the TSP

    Centralised rehearsal of decentralised cooperation: Multi-agent reinforcement learning for the scalable coordination of residential energy flexibility

    Full text link
    This paper investigates how deep multi-agent reinforcement learning can enable the scalable and privacy-preserving coordination of residential energy flexibility. The coordination of distributed resources such as electric vehicles and heating will be critical to the successful integration of large shares of renewable energy in our electricity grid and, thus, to help mitigate climate change. The pre-learning of individual reinforcement learning policies can enable distributed control with no sharing of personal data required during execution. However, previous approaches for multi-agent reinforcement learning-based distributed energy resources coordination impose an ever greater training computational burden as the size of the system increases. We therefore adopt a deep multi-agent actor-critic method which uses a \emph{centralised but factored critic} to rehearse coordination ahead of execution. Results show that coordination is achieved at scale, with minimal information and communication infrastructure requirements, no interference with daily activities, and privacy protection. Significant savings are obtained for energy users, the distribution network and greenhouse gas emissions. Moreover, training times are nearly 40 times shorter than with a previous state-of-the-art reinforcement learning approach without the factored critic for 30 homes
    • …
    corecore