149 research outputs found

    Distributed Extremum Seeking Control for a Variable Refrigerant Flow System

    Get PDF
    The variable refrigerant flow (VRF) technology has facilitated the development of multi-split ductless air conditioning systems, in which multiple indoor units (IDU) are used to regulate the refrigerant flow to achieve individualized zoning control. Model based control for VRF system demands for more modeling efforts in part due to diverse configuration, as well as changes in load and ambient conditions. As a model-free control strategy, Extremum Seeking Control (ESC) has been investigated for VRF systems. Dong et al. (2015) applied the standard centralized ESC scheme to a VRF system that consists of one outdoor unit (ODU) and four IDU’s. Simulation results have indicated the effectiveness of such strategy. As the number of IDU’s increases, the complexity of centralized controllers will increase accordingly. Therefore distributed ESC becomes a natural consideration for VRF systems with large number of IDU’s. In this paper, the Shashahani gradient based distributed ESC scheme proposed by Poveda and Quijano (2013, 2015), is applied to the four-zone VRF system simulated by Dong et al. (2015). In particular, this scheme is enhanced by appending a band-pass filter array at the output to achieve a better “isolation†among individual input channels. A single-input ESC is applied to the ODU, while the distributed ESC is applied to the four IDU’s with each acting as an agent. For each agent, the respective power consumption is used as feedback. The objective is to minimize the total power consumption of all agents. For the ODU ESC, the compressor suction pressure (PCS) set-point is employed as the manipulative input. For the IDU DESC, the evaporator superheat (SH) set-point is used as the manipulative input for each IDU agent. The distributed ESC scheme assumes full information communication among all IDU’s. Simulation study is performed to evaluate the proposed strategy with the Modelica based dynamic simulation model developed by Dong et al. (2015). The ESC is designed under the ambient condition of 35oC and 40 %RH, respectively. The initial temperature of all four IDUs zone is 29oC, and the zone temperature set-point is 26oC. The heat loads for IDU1 through IDU4 are 3000W, 2600W, 2400W and 2000W, respectively. It takes the average total power about 10000 seconds to converge to about 3200W in steady state, with PCS around 13bar, and the SH values of IDU1 through IDU4 at 4.5oC, 4.5oC, 6oC, and 5.5oC, respectively. The total power consumption was decreased from 4500 W to 3200 W, i.e. by 29%. In comparison with the centralized ESC Dong et al. (2015), the steady state error of total power is less than 50w. Work is under way to improve transient and steady-state performance, as well as simulation of other operation modes.  Â

    Learning in Engineered Multi-agent Systems

    Get PDF
    Consider the problem of maximizing the total power produced by a wind farm. Due to aerodynamic interactions between wind turbines, each turbine maximizing its individual power--as is the case in present-day wind farms--does not lead to optimal farm-level power capture. Further, there are no good models to capture the said aerodynamic interactions, rendering model based optimization techniques ineffective. Thus, model-free distributed algorithms are needed that help turbines adapt their power production on-line so as to maximize farm-level power capture. Motivated by such problems, the main focus of this dissertation is a distributed model-free optimization problem in the context of multi-agent systems. The set-up comprises of a fixed number of agents, each of which can pick an action and observe the value of its individual utility function. An individual's utility function may depend on the collective action taken by all agents. The exact functional form (or model) of the agent utility functions, however, are unknown; an agent can only measure the numeric value of its utility. The objective of the multi-agent system is to optimize the welfare function (i.e. sum of the individual utility functions). Such a collaborative task requires communications between agents and we allow for the possibility of such inter-agent communications. We also pay attention to the role played by the pattern of such information exchange on certain aspects of performance. We develop two algorithms to solve this problem. The first one, engineered Interactive Trial and Error Learning (eITEL) algorithm, is based on a line of work in the Learning in Games literature and applies when agent actions are drawn from finite sets. While in a model-free setting, we introduce a novel qualitative graph-theoretic framework to encode known directed interactions of the form "which agents' action affect which others' payoff" (interaction graph). We encode explicit inter-agent communications in a directed graph (communication graph) and, under certain conditions, prove convergence of agent joint action (under eITEL) to the welfare optimizing set. The main condition requires that the union of interaction and communication graphs be strongly connected; thus the algorithm combines an implicit form of communication (via interactions through utility functions) with explicit inter-agent communications to achieve the given collaborative goal. This work has kinship with certain evolutionary computation techniques such as Simulated Annealing; the algorithm steps are carefully designed such that it describes an ergodic Markov chain with a stationary distribution that has support over states where agent joint actions optimize the welfare function. The main analysis tool is perturbed Markov chains and results of broader interest regarding these are derived as well. The other algorithm, Collaborative Extremum Seeking (CES), uses techniques from extremum seeking control to solve the problem when agent actions are drawn from the set of real numbers. In this case, under the assumption of existence of a local minimizer for the welfare function and a connected undirected communication graph between agents, a result regarding convergence of joint action to a small neighborhood of a local optimizer of the welfare function is proved. Since extremum seeking control uses a simultaneous gradient estimation-descent scheme, gradient information available in the continuous action space formulation is exploited by the CES algorithm to yield improved convergence speeds. The effectiveness of this algorithm for the wind farm power maximization problem is evaluated via simulations. Lastly, we turn to a different question regarding role of the information exchange pattern on performance of distributed control systems by means of a case study for the vehicle platooning problem. In the vehicle platoon control problem, the objective is to design distributed control laws for individual vehicles in a platoon (or a road-train) that regulate inter-vehicle distances at a specified safe value while the entire platoon follows a leader-vehicle. While most of the literature on the problem deals with some inadequacy in control performance when the information exchange is of the nearest neighbor-type, we consider an arbitrary graph serving as information exchange pattern and derive a relationship between how a certain indicator of control performance is related to the information pattern. Such analysis helps in understanding qualitative features of the `right' information pattern for this problem

    A Tutorial on Distributed Optimization for Cooperative Robotics: from Setups and Algorithms to Toolboxes and Research Directions

    Full text link
    Several interesting problems in multi-robot systems can be cast in the framework of distributed optimization. Examples include multi-robot task allocation, vehicle routing, target protection and surveillance. While the theoretical analysis of distributed optimization algorithms has received significant attention, its application to cooperative robotics has not been investigated in detail. In this paper, we show how notable scenarios in cooperative robotics can be addressed by suitable distributed optimization setups. Specifically, after a brief introduction on the widely investigated consensus optimization (most suited for data analytics) and on the partition-based setup (matching the graph structure in the optimization), we focus on two distributed settings modeling several scenarios in cooperative robotics, i.e., the so-called constraint-coupled and aggregative optimization frameworks. For each one, we consider use-case applications, and we discuss tailored distributed algorithms with their convergence properties. Then, we revise state-of-the-art toolboxes allowing for the implementation of distributed schemes on real networks of robots without central coordinators. For each use case, we discuss their implementation in these toolboxes and provide simulations and real experiments on networks of heterogeneous robots

    Mobile robotic network deployment for intruder detection and tracking

    Get PDF
    This thesis investigates the problem of intruder detection and tracking using mobile robotic networks. In the first part of the thesis, we consider the problem of seeking an electromagnetic source using a team of robots that measure the local intensity of the emitted signal. We propose a planner for a team of robots based on Particle Swarm Optimization (PSO) which is a population based stochastic optimization technique. An equivalence is established between particles generated in the traditional PSO technique, and the mobile agents in the swarm. Since the positions of the robots are updated using the PSO algorithm, modifications are required to implement the PSO algorithm on real robots to incorporate collision avoidance strategies. The modifications necessary to implement PSO on mobile robots, and strategies to adapt to real environments are presented in this thesis. Our results are also validated on an experimental testbed. In the second part, we present a game theoretic framework for visibility-based target tracking in multi-robot teams. A team of observers (pursuers) and a team of targets (evaders) are present in an environment with obstacles. The objective of the team of observers is to track the team of targets for the maximum possible time. While the objective of the team of targets is to escape (break line-of-sight) in the minimum time. We decompose the problem into two layers. At the upper level, each pursuer is allocated to an evader through a minimum cost allocation strategy based on the risk of each evader, thereby, decomposing the agents into multiple single pursuer-single evader pairs. Two decentralized allocation strategies are proposed and implemented in this thesis. At the lower level, each pursuer computes its strategy based on the results of the single pursuer-single evader target-tracking problem. We initially address this problem in an environment containing a semi-infinite obstacle with one corner. The pursuer\u27s optimal tracking strategy is obtained regardless of the evader\u27s strategy using techniques from optimal control theory and differential games. Next, we extend the result to an environment containing multiple polygonal obstacles. We construct a pursuit field to provide a guiding vector for the pursuer which is a weighted sum of several component vectors. The performance of different combinations of component vectors is investigated. Finally, we extend our work to address the case when the obstacles are not polygonal, and the observers have constraints in motion

    A Distributed Learning Algorithm with Bit-valued Communications for Multi-agent Welfare Optimization

    Get PDF
    A multi-agent system comprising N agents, each picking actions from a finite set and receiving a payoff that depends on the action of the whole, is considered. The exact form of the payoffs are unknown and only their values can be measured by the respective agents. A decentralized algorithm was proposed by Marden et. al. [1] and in the authors’ earlier work [2] that, in this setting, leads to the agents picking welfare optimizing actions under some restrictive assumptions on the payoff structure. This algorithm is modified in this paper to incorporate exchange of certain bit-valued information between the agents over a directed communication graph. The notion of an interaction graph is then introduced to encode known interaction in the system. Restrictions on the payoff structure are eliminated and conditions that guarantee convergence to welfare minimizing actions w.p. 1 are derived under the assumption that the union of the interaction graph and communication graph is strongly connected.Research partially supported by the US Air Force Office of Scientific Research MURI grant FA9550-09-1-0538 and by the National Science Foundation (NSF) grant CNS-1035655

    From Chemical Plants to Clinical Patients: Process Control Applications in Biomedicine

    Get PDF
    In recent times, there has been a convergence and interaction between the age-long principles of chemical process control (hitherto exclusive to the world of man-made industrial chemical process plants) and the life sciences (particularly biomedicine). This review article presents some examples of application areas in biomedicine where process dynamics and control, as a sub-discipline of process engineering, is being utilized to save human lives. It especially focuses on the extension of the chemical engineer’s “process” and “system” to embrace parts of the human body or microbial cells. The aim of the article is to make the reader appreciate how the traditional chemical engineering tools of process dynamics and control can be applied newly to biomedical problems. This is to stir the readers’ mind to explore other exciting ways of applying control engineering knowledge to solve modern healthcare challenges. The review was conducted under three broad application headings: medical device engineering applications, industrial-scale production of therapeutic substances and elucidatory investigations into complex physiologies

    Resource Management in Distributed Camera Systems

    Get PDF
    The aim of this work is to investigate different methods to solve the problem of allocating the correct amount of resources (network bandwidth and storage space) to video camera systems. Here we explore the intersection between two research areas: automatic control and game theory. Camera systems are a good example of the emergence of the Internet of Things (IoT) and its impact on our daily lives and the environment. We aim to improve today’s systems, shift from resources over-provisioning to allocate dynamically resources where they are needed the most. We optimize the storage and bandwidth allocation of camera systems to limit the impact on the environment as well as provide the best visual quality attainable with the resource limitations. This thesis is written as a collection of papers. It begins by introducing the problem with today’s camera systems, and continues with background information about resource allocation, automatic control and game theory. The third chapter de- scribes the models of the considered systems, their limitations and challenges. It then continues by providing more background on the automatic control and game theory techniques used in the proposed solutions. Finally, the proposed solutions are provided in five papers.Paper I proposes an approach to estimate the amount of data needed by surveillance cameras given camera and scenario parameters. This model is used for calculating the quasi Worst-Case Transmission Times of videos over a network. Papers II and III apply control concepts to camera network storage and bandwidth assignment. They provide simple, yet elegant solutions to the allocation of these resources in distributed camera systems. Paper IV com- bines pricing theory with control techniques to force the video quality of cam- era systems to converge to a common value based solely on the compression parameter of the provided videos. Paper V uses the VCG auction mechanism to solve the storage space allocation problem in competitive camera systems. It allows for a better system-wide visual quality than a simple split allocation given the limited system knowledge, trust and resource constraints
    • …
    corecore