1,810 research outputs found

    Desensitization and Deception in Differential Games with Asymmetric Information

    Full text link
    Desensitization addresses safe optimal planning under parametric uncertainties by providing sensitivity function-based risk measures. This paper expands upon the existing work on desensitization to address safe planning for a class of two-player differential games. In the proposed game, parametric uncertainties correspond to variations in a vector of model parameters about its nominal value. The two players in the proposed formulation are assumed to have information about the nominal value of the parameter vector. However, only one of the players is assumed to have complete knowledge of parametric variation, creating a form of information asymmetry in the proposed game. The lack of knowledge regarding the parametric variations is expected to result in state constraint violations for the player with an information disadvantage. In this regard, a desensitized feedback strategy that provides safe trajectories is proposed for the player with incomplete information. The proposed feedback strategy is evaluated in instances involving one pursuer and one evader with an uncertain dynamic obstacle, where the pursuer is assumed to know only the nominal value of the obstacle's speed. At the same time, the evader knows the obstacle's true speed, and also the fact that the pursuer possesses only the nominal value. Subsequently, deceptive strategies are proposed for the evader, who has an information advantage, and these strategies are assessed against the pursuer's desensitized strategy

    A Real-Time Game Theoretic Planner for Autonomous Two-Player Drone Racing

    Get PDF
    In this article, we propose an online 3-D planning algorithm for a drone to race competitively against a single adversary drone. The algorithm computes an approximation of the Nash equilibrium in the joint space of trajectories of the two drones at each time step, and proceeds in a receding horizon fashion. The algorithm uses a novel sensitivity term, within an iterative best response computational scheme, to approximate the amount by which the adversary will yield to the ego drone to avoid a collision. This leads to racing trajectories that are more competitive than without the sensitivity term. We prove that the fixed point of this sensitivity enhanced iterative best response satisfies the first-order optimality conditions of a Nash equilibrium. We present results of a simulation study of races with 2-D and 3-D race courses, showing that our game theoretic planner significantly outperforms amodel predictive control (MPC) racing algorithm. We also present results of multiple drone racing experiments on a 3-D track in which drones sense each others'' relative position with onboard vision. The proposed game theoretic planner again outperforms the MPC opponent in these experiments where drones reach speeds up to 1.25m/s

    A model predictive control approach to a class of multiplayer minmax differential games

    Get PDF
    In this dissertation, we consider a class of two-team adversarial differential games in which there are multiple mobile dynamic agents on each team. We describe such games in terms of semi-infinite minmax Model Predictive Control (MPC) problems, and present a numerical optimization technique for efficiently solving them. We also describe the implementation of the solution method in both indoor and outdoor robotic testbeds. Our solution method requires one to solve a sequence of Quadratic Programs (QPs), which together efficiently solve the original semi-infinite min- max MPC problem. The solution method separates the problem into two subproblems called the inner and outer subproblems, respectively. The inner subproblem is based on a constrained nonlinear numerical optimization technique called the Phase I-Phase II method, and we develop a customized version of this method. The outer subproblem is about judiciously initializing the inner subproblems to achieve overall convergence; our method guarantees exponential convergence. We focus on a specific semi-infinite minmax MPC problem called the harbor defense problem. First, we present foundational work on this problem in a formulation containing a single defender and single intruder. We next extend the basic formulation to various advanced scenarios that include cases in which there are multiple defenders and intruders, and also ones that include varying assumptions about intruder strategies. Another main contribution is that we implemented our solution method for the harbor defense problem on both real-time indoor and outdoor testbeds, and demonstrated its computational effectiveness. The indoor testbed is a custom-built robotic testbed named HoTDeC (Hovercraft Testbed for Decentralized Control). The outdoor testbed involved full-sized US Naval Academy patrol ships, and the experiment was conducted in Chesapeake Bay in collaboration with the US Naval Academy. The scenario used involved one ship (the intruder) being commanded by a human pilot, and the defender ship being controlled automatically by our semi-infinite minmax MPC algorithm.The results of several experiments are presented. Finally, we present an efficient algorithm for solving a class of matrix games, and show how this approach can be directly used to effectively solve our original continuous space semi-infinite minmax problem using an adaptive approximation

    Collision avoidance strategies for unmanned aerial vehicles in formation flight

    Get PDF
    Collision avoidance strategies for multiple UAVs (Unmanned Aerial Vehicles) based on geometry are investigated in this study. The proposed strategies allow a group of UAVs to avoid obstacles and separate if necessary through a simple algorithm with low computation by expanding the collision-cone approach to formation of UAVs. The geometric approach uses line-of-sight vectors and relative velocity vectors where dynamic constraints are included in the formation. Each UAV can determine which plane and direction are available for collision avoidance. An analysis is performed to define an envelope for collision avoidance, where angular rate limits and obstacle detection range limits are considered. Based on the collision avoidance envelope, each UAV in a formation determines whether the formation can be maintained or not while avoiding obstacles. Numerical simulations are performed to demonstrate the performance of the proposed strategies

    Simulacrum

    Get PDF

    Optimal Collision Avoidance Trajectories for Unmanned/Remotely Piloted Aircraft

    Get PDF
    The post-911 environment has punctuated the force-multiplying capabilities that Remotely Piloted Aircraft (RPA) provides combatant commanders at all echelons on the battlefield. Not only have unmanned aircraft systems made near-revolutionary impacts on the battlefield, their utility and proliferation in law enforcement, homeland security, humanitarian operations, and commercial applications have likewise increased at a rapid rate. As such, under the Federal Aviation Administration (FAA) Modernization and Reform Act of 2012, the United States Congress tasked the FAA to provide for the safe integration of civil unmanned aircraft systems into the national airspace system (NAS) as soon as practicable, but not later than September 30, 2015. However, a necessary entrance criterion to operate RPAs in the NAS is the ability to Sense and Avoid (SAA) both cooperative and noncooperative air traffic to attain a target level of safety as a traditional manned aircraft platform. The goal of this research effort is twofold: First, develop techniques for calculating optimal avoidance trajectories, and second, develop techniques for estimating an intruder aircraft\u27s trajectory in a stochastic environment. This dissertation describes the optimal control problem associated with SAA and uses a direct orthogonal collocation method to solve this problem and then analyzes these results for different collision avoidance scenarios

    Formal methods paradigms for estimation and machine learning in dynamical systems

    Get PDF
    Formal methods are widely used in engineering to determine whether a system exhibits a certain property (verification) or to design controllers that are guaranteed to drive the system to achieve a certain property (synthesis). Most existing techniques require a large amount of accurate information about the system in order to be successful. The methods presented in this work can operate with significantly less prior information. In the domain of formal synthesis for robotics, the assumptions of perfect sensing and perfect knowledge of system dynamics are unrealistic. To address this issue, we present control algorithms that use active estimation and reinforcement learning to mitigate the effects of uncertainty. In the domain of cyber-physical system analysis, we relax the assumption that the system model is known and identify system properties automatically from execution data. First, we address the problem of planning the path of a robot under temporal logic constraints (e.g. "avoid obstacles and periodically visit a recharging station") while simultaneously minimizing the uncertainty about the state of an unknown feature of the environment (e.g. locations of fires after a natural disaster). We present synthesis algorithms and evaluate them via simulation and experiments with aerial robots. Second, we develop a new specification language for tasks that require gathering information about and interacting with a partially observable environment, e.g. "Maintain localization error below a certain level while also avoiding obstacles.'' Third, we consider learning temporal logic properties of a dynamical system from a finite set of system outputs. For example, given maritime surveillance data we wish to find the specification that corresponds only to those vessels that are deemed law-abiding. Algorithms for performing off-line supervised and unsupervised learning and on-line supervised learning are presented. Finally, we consider the case in which we want to steer a system with unknown dynamics to satisfy a given temporal logic specification. We present a novel reinforcement learning paradigm to solve this problem. Our procedure gives "partial credit'' for executions that almost satisfy the specification, which can lead to faster convergence rates and produce better solutions when the specification is not satisfiable

    Triptych: Three Meditations on How Law Rules after Globalization

    Get PDF

    Triptych: Three Mediations on How Law Rules after Globalization

    Get PDF
    • …
    corecore