168 research outputs found

    When Deep Learning Meets Polyhedral Theory: A Survey

    Full text link
    In the past decade, deep learning became the prevalent methodology for predictive modeling thanks to the remarkable accuracy of deep neural networks in tasks such as computer vision and natural language processing. Meanwhile, the structure of neural networks converged back to simpler representations based on piecewise constant and piecewise linear functions such as the Rectified Linear Unit (ReLU), which became the most commonly used type of activation function in neural networks. That made certain types of network structure \unicode{x2014}such as the typical fully-connected feedforward neural network\unicode{x2014} amenable to analysis through polyhedral theory and to the application of methodologies such as Linear Programming (LP) and Mixed-Integer Linear Programming (MILP) for a variety of purposes. In this paper, we survey the main topics emerging from this fast-paced area of work, which bring a fresh perspective to understanding neural networks in more detail as well as to applying linear optimization techniques to train, verify, and reduce the size of such networks

    Non-acyclicity of coset lattices and generation of finite groups

    Get PDF

    Learning-based Predictive Control via Real-time Aggregate Flexibility

    Full text link
    Aggregators have emerged as crucial tools for the coordination of distributed, controllable loads. To be used effectively, an aggregator must be able to communicate the available flexibility of the loads they control, as known as the aggregate flexibility to a system operator. However, most of existing aggregate flexibility measures often are slow-timescale estimations and much less attention has been paid to real-time coordination between an aggregator and an operator. In this paper, we consider solving an online optimization in a closed-loop system and present a design of real-time aggregate flexibility feedback, termed the maximum entropy feedback (MEF). In addition to deriving analytic properties of the MEF, combining learning and control, we show that it can be approximated using reinforcement learning and used as a penalty term in a novel control algorithm -- the penalized predictive control (PPC), which modifies vanilla model predictive control (MPC). The benefits of our scheme are (1). Efficient Communication. An operator running PPC does not need to know the exact states and constraints of the loads, but only the MEF. (2). Fast Computation. The PPC often has much less number of variables than an MPC formulation. (3). Lower Costs. We show that under certain regularity assumptions, the PPC is optimal. We illustrate the efficacy of the PPC using a dataset from an adaptive electric vehicle charging network and show that PPC outperforms classical MPC.Comment: 13 pages, 5 figures, extension of arXiv:2006.1381

    Scalable Synthesis and Verification: Towards Reliable Autonomy

    Get PDF
    We have seen the growing deployment of autonomous systems in our daily life, ranging from safety-critical self-driving cars to dialogue agents. While impactful and impressive, these systems do not often come with guarantees and are not rigorously evaluated for failure cases. This is in part due to the limited scalability of tools available for designing correct-by-construction systems, or verifying them posthoc. Another key limitation is the lack of availability of models for the complex environments with which autonomous systems often have to interact with. In the direction of overcoming these above mentioned bottlenecks to designing reliable autonomous systems, this thesis makes contributions along three fronts. First, we develop an approach for parallelized synthesis from linear-time temporal logic Specifications corresponding to the generalized reactivity (1) fragment. We begin by identifying a special case corresponding to singleton liveness goals that allows for a decomposition of the synthesis problem, which facilitates parallelized synthesis. Based on the intuition from this special case, we propose a more generalized approach for parallelized synthesis that relies on identifying equicontrollable states. Second, we consider learning-based approaches to enable verification at scale for complex systems, and for autonomous systems that interact with black-box environments. For the former, we propose a new abstraction refinement procedure based on machine learning to improve the performance of nonlinear constraint solving algorithms on large-scale problems. For the latter, we present a data-driven approach based on chance-constrained optimization that allows for a system to be evaluated for specification conformance without an accurate model of the environment. We demonstrate this approach on several tasks, including a lane-change scenario with real-world driving data. Lastly, we consider the problem of interpreting and verifying learning-based components such as neural networks. We introduce a new method based on Craig's interpolants for computing compact symbolic abstractions of pre-images for neural networks. Our approach relies on iteratively computing approximations that provably overapproximate and underapproximate the pre-images at all layers. Further, building on existing work for training neural networks for verifiability in the classification setting, we propose extensions that allow us to generalize the approach to more general architectures and temporal specifications.</p

    Optimizing Urban Infrastructure Resilience Under Precipitation and Population Growth Uncertainties

    Get PDF
    Increased urbanization, infrastructure degradation, and climate change threaten to overwhelm stromwater systems across the nation, rendering them ineffective. Green Infrastructure (GI) practices are low cost, low regret strategies that can contribute to urban runoff management. However, questions remain as to how to best distribute GI practices through urban watersheds given the precipitation uncertainty and the hydrological responses to them.First, we develop a two-stage stochastic robust programming model to determine the optimal placement of GI practices across a set of candidate locations in a watershed to minimize the total expected runoff under medium-term precipitation uncertainties. We develop a systemic approach to downscale the existing daily precipitation projections into hourly units and efficiently estimate the corresponding hydrological responses. We conduct a case study for an urban watershed in a mid-sized city in the U.S., perform sensitivity analyses and provide insights.Second, we develop a mathematical model to optimally place GI practices when (re-)designing an urban area, subject to uncertainties in population growth and future precipitation. Specifically, we develop a finite-horizon Markov decision process model to determine the extent to which GI practices need to be incorporated in different parts of a given urban area to maximize their benefits, considering the dynamic changes in population density and precipitation. We conduct a case study, perform sensitivity analyses and provide insights.Finally, we consider a problem of scheduling maintenance crew following a storm event to efficiently maintain GI practices across a watershed to mitigate surface runoff due to future events. Specifically, we investigate a condition for which the polyhedron of the flow shop scheduling problem is integer-optimal. This condition is used to construct a column generation algorithm to solve the problem to optimality. The solution approach is boosted with a heuristic that sequentially solves a series of linear programming models to generate a quality initial solution. The solution approach is also integrated with a commercial solver, which results in significant computational savings. Computational experiments show that the developed algorithm can efficiently solve test problems to near-optimality

    Structure Exploitation in Mixed-Integer Optimization with Applications to Energy Systems

    Get PDF
    Das Ziel dieser Arbeit ist neue numerische Methoden für gemischt-ganzzahlige Optimierungsprobleme zu entwickeln um eine verbesserte Geschwindigkeit und Skalierbarkeit zu erreichen. Dies erfolgt durch Ausnutzung gängiger Problemstrukturen wie separierbarkeit oder Turnpike-eigenschaften. Methoden, die diese Strukturen ausnutzen können, wurden bereits im Bereich der verteilten Optimierung und optimalen Steuerung entwickelt, sie sind jedoch nicht direkt auf gemischt-ganztägige Probleme anwendbar. Um verteilte Rechenressourcen zur Lösung von gemischt-ganzzahligen Problemen nutzen zu können, sind neue Methoden erforderlich. Zu diesem Zweck werden verschiedene Erweiterungen bestehender Methoden sowie neuartige Techniken zur gemischt-ganzzahligen Optimierung vorgestellt. Benchmark-Probleme aus Strom- und Energiesystemen werden verwendet, um zu demonstrieren, dass die vorgestellten Methoden zu schnelleren Laufzeiten führen und die Lösung großer Probleme ermöglichen, die sonst nicht zentral gelöst werden können. Die vorliegende Arbeit enthält die folgenden Beiträge: - Eine Erweiterung des Augmented Lagrangian Alternating Direction Inexact Newton-Algorithmus zur verteilten Optimierung für gemischt-ganzzahlige Probleme. - Ein neuer, teilweise-verteilter Optimierungsalgorithmus für die gemischt-ganzzahlige Optimierung basierend auf äußeren Approximationsverfahren. - Ein neuer Optimierungsalgorithmus für die verteilte gemischt-ganzzahlige Optimierung, der auf branch-and-bound Verfahren basiert. - Eine erste Untersuchung von Turnpike-Eigenschaften bei Optimalsteuerungsproblemen mit gemischten-Ganzzahligen Entscheidungsgrößen und ein spezieller Algorithmus zur Lösung dieser Probleme. - Eine neue Branch-and-Bound Heuristik, die a priori Probleminformationen effizienter nutzt als aktuelle Warmstarttechniken. Schließlich wird gezeigt, dass die Ergebnisse der vorgestellten Optimierungsalgorithmen für verteilte gemischt-ganzzahlige Optimierung stark Partitionierungsabhängig sind. Zu diesem Zweck wird auch eine Untersuchung von Partitionierungsmethoden für die verteilte Optimierung vorgestellt

    Scheduling of multiple vehicle types: The allocation of locomotives to trains.

    Get PDF
    Abstract not available

    Correct-By-Construction Control Synthesis for Systems with Disturbance and Uncertainty

    Full text link
    This dissertation focuses on correct-by-construction control synthesis for Cyber-Physical Systems (CPS) under model uncertainty and disturbance. CPSs are systems that interact with the physical world and perform complicated dynamic tasks where safety is often the overriding factor. Correct-by-construction control synthesis is a concept that provides formal performance guarantees to closed-loop systems by rigorous mathematic reasoning. Since CPSs interact with the environment, disturbance and modeling uncertainty are critical to the success of the control synthesis. Disturbance and uncertainty may come from a variety of sources, such as exogenous disturbance, the disturbance caused by co-existing controllers and modeling uncertainty. To better accommodate the different types of disturbance and uncertainty, the verification and control synthesis methods must be chosen accordingly. Four approaches are included in this dissertation. First, to deal with exogenous disturbance, a polar algorithm is developed to compute an avoidable set for obstacle avoidance. Second, a supervised learning based method is proposed to design a good student controller that has safety built-in and rarely triggers the intervention of the supervisory controller, thus targeting the design of the student controller. Third, to deal with the disturbance caused by co-existing controllers, a Lyapunov verification method is proposed to formally verify the safety of coexisting controllers while respecting the confidentiality requirement. Finally, a data-driven approach is proposed to deal with model uncertainty. A minimal robust control invariant set is computed for an uncertain dynamic system without a given model by first identifying the set of admissible models and then simultaneously computing the invariant set while selecting the optimal model. The proposed methods are applicable to many real-world applications and reflect the notion of using the structure of the system to achieve performance guarantees without being overly conservative.PHDMechanical EngineeringUniversity of Michigan, Horace H. Rackham School of Graduate Studieshttps://deepblue.lib.umich.edu/bitstream/2027.42/145933/1/chenyx_1.pd
    corecore