34 research outputs found
No-Regret Constrained Bayesian Optimization of Noisy and Expensive Hybrid Models using Differentiable Quantile Function Approximations
This paper investigates the problem of efficient constrained global
optimization of hybrid models that are a composition of a known white-box
function and an expensive multi-output black-box function subject to noisy
observations, which often arises in real-world science and engineering
applications. We propose a novel method, Constrained Upper Quantile Bound
(CUQB), to solve such problems that directly exploits the composite structure
of the objective and constraint functions that we show leads substantially
improved sampling efficiency. CUQB is a conceptually simple, deterministic
approach that avoid constraint approximations used by previous methods.
Although the CUQB acquisition function is not available in closed form, we
propose a novel differentiable sample average approximation that enables it to
be efficiently maximized. We further derive bounds on the cumulative regret and
constraint violation under a non-parametric Bayesian representation of the
black-box function. Since these bounds depend sublinearly on the number of
iterations under some regularity assumptions, we establis bounds on the
convergence rate to the optimal solution of the original constrained problem.
In contrast to most existing methods, CUQB further incorporates a simple
infeasibility detection scheme, which we prove triggers in a finite number of
iterations when the original problem is infeasible (with high probability given
the Bayesian model). Numerical experiments on several test problems, including
environmental model calibration and real-time optimization of a reactor system,
show that CUQB significantly outperforms traditional Bayesian optimization in
both constrained and unconstrained cases. Furthermore, compared to other
state-of-the-art methods that exploit composite structure, CUQB achieves
competitive empirical performance while also providing substantially improved
theoretical guarantees
Uncertainty Quantifcation in Vision Based Classifcation
The past decade of artifcial intelligence and deep learning has made tremendous progress
in highly perceptive tasks such as image recognition. Deep learning algorithms map high
dimensional complex representations to low dimensional array mappings. However, these
mappings are generally blindly assumed to be correct, further justifed with high accuracies
on trending datasets. The challenge of creating a comprehensive, explainable and reasonable
deep learning system is yet to be solved. One way to deal with this is by using uncertainty
quantifcation, or uncertainty aware learning, with the help of Bayesian methods.
This thesis contributes to the feld of uncertainty aware learning by demonstrating how
uncertainty can be used to recover performance in case of a physical attack, how uncertainty
can be used to improve sensitivity to noise and how it can be used to improve performance
on dynamic datasets. The frst contribution involves learning from model uncertainty in
the application of deep learning-based semantic segmentation. The second contribution
deals with robustness and sensitivity analysis in image classifcation and fnally, the third
contribution in continual learning by using variance to update the learning rate. The frst
contribution proposes the architecture AdvSegNet which aims to improve the performance
of Bayesian SegNet. In the second contribution, a combined architecture of convolutional
network feature extractor and a Gaussian process (CNN-GP) is made to classify images
under uncertain conditions including noise, blurring and adversarial attacks. Finally, in the
continual learning subject area, the architecture CNN-GP is trained on datasets presented
sequentially. Results show an improvement in performance and sensitivity to adversarial
attack and noisy conditions as well as an improvement in dynamic datasets with a small
number of tasks
Automatic machine learning:methods, systems, challenges
This open access book presents the first comprehensive overview of general methods in Automatic Machine Learning (AutoML), collects descriptions of existing systems based on these methods, and discusses the first international challenge of AutoML systems. The book serves as a point of entry into this quickly-developing field for researchers and advanced students alike, as well as providing a reference for practitioners aiming to use AutoML in their work. The recent success of commercial ML applications and the rapid growth of the field has created a high demand for off-the-shelf ML methods that can be used easily and without expert knowledge. Many of the recent machine learning successes crucially rely on human experts, who select appropriate ML architectures (deep learning architectures or more traditional ML workflows) and their hyperparameters; however the field of AutoML targets a progressive automation of machine learning, based on principles from optimization and machine learning itself
Computer Aided Verification
This open access two-volume set LNCS 11561 and 11562 constitutes the refereed proceedings of the 31st International Conference on Computer Aided Verification, CAV 2019, held in New York City, USA, in July 2019. The 52 full papers presented together with 13 tool papers and 2 case studies, were carefully reviewed and selected from 258 submissions. The papers were organized in the following topical sections: Part I: automata and timed systems; security and hyperproperties; synthesis; model checking; cyber-physical systems and machine learning; probabilistic systems, runtime techniques; dynamical, hybrid, and reactive systems; Part II: logics, decision procedures; and solvers; numerical programs; verification; distributed systems and networks; verification and invariants; and concurrency
Constructive Approximation and Learning by Greedy Algorithms
This thesis develops several kernel-based greedy algorithms for different machine learning problems and analyzes their theoretical and empirical properties. Greedy approaches have been extensively used in the past for tackling problems in combinatorial optimization where finding even a feasible solution can be a computationally hard problem (i.e., not solvable in polynomial time). A key feature of greedy algorithms is that a solution is constructed recursively from the smallest constituent parts. In each step of the constructive process a component is added to the partial solution from the previous step and, thus, the size of the optimization problem is reduced. The selected components are given by optimization problems that are simpler and easier to solve than the original problem. As such schemes are typically fast at constructing a solution they can be very effective on complex optimization problems where finding an optimal/good solution has a high computational cost. Moreover, greedy solutions are rather intuitive and the schemes themselves are simple to design and easy to implement. There is a large class of problems for which greedy schemes generate an optimal solution or a good approximation of the optimum. In the first part of the thesis, we develop two deterministic greedy algorithms for optimization problems in which a solution is given by a set of functions mapping an instance space to the space of reals. The first of the two approaches facilitates data understanding through interactive visualization by providing means for experts to incorporate their domain knowledge into otherwise static kernel principal component analysis. This is achieved by greedily constructing embedding directions that maximize the variance at data points (unexplained by the previously constructed embedding directions) while adhering to specified domain knowledge constraints. The second deterministic greedy approach is a supervised feature construction method capable of addressing the problem of kernel choice. The goal of the approach is to construct a feature representation for which a set of linear hypotheses is of sufficient capacity — large enough to contain a satisfactory solution to the considered problem and small enough to allow good generalization from a small number of training examples. The approach mimics functional gradient descent and constructs features by fitting squared error residuals. We show that the constructive process is consistent and provide conditions under which it converges to the optimal solution. In the second part of the thesis, we investigate two problems for which deterministic greedy schemes can fail to find an optimal solution or a good approximation of the optimum. This happens as a result of making a sequence of choices which take into account only the immediate reward without considering the consequences onto future decisions. To address this shortcoming of deterministic greedy schemes, we propose two efficient randomized greedy algorithms which are guaranteed to find effective solutions to the corresponding problems. In the first of the two approaches, we provide a mean to scale kernel methods to problems with millions of instances. An approach, frequently used in practice, for this type of problems is the Nyström method for low-rank approximation of kernel matrices. A crucial step in this method is the choice of landmarks which determine the quality of the approximation. We tackle this problem with a randomized greedy algorithm based on the K-means++ cluster seeding scheme and provide a theoretical and empirical study of its effectiveness. In the second problem for which a deterministic strategy can fail to find a good solution, the goal is to find a set of objects from a structured space that are likely to exhibit an unknown target property. This discrete optimization problem is of significant interest to cyclic discovery processes such as de novo drug design. We propose to address it with an adaptive Metropolis–Hastings approach that samples candidates from the posterior distribution of structures conditioned on them having the target property. The proposed constructive scheme defines a consistent random process and our empirical evaluation demonstrates its effectiveness across several different application domains
Computer Aided Verification
This open access two-volume set LNCS 11561 and 11562 constitutes the refereed proceedings of the 31st International Conference on Computer Aided Verification, CAV 2019, held in New York City, USA, in July 2019. The 52 full papers presented together with 13 tool papers and 2 case studies, were carefully reviewed and selected from 258 submissions. The papers were organized in the following topical sections: Part I: automata and timed systems; security and hyperproperties; synthesis; model checking; cyber-physical systems and machine learning; probabilistic systems, runtime techniques; dynamical, hybrid, and reactive systems; Part II: logics, decision procedures; and solvers; numerical programs; verification; distributed systems and networks; verification and invariants; and concurrency
Considering stakeholders’ preferences for scheduling slots in capacity constrained airports
Airport slot scheduling has attracted the attention of researchers as a capacity management tool at congested airports. Recent research work has employed multi-objective approaches for scheduling slots at coordinated airports. However, the central question on how to select a commonly accepted airport schedule remains. The various participating stakeholders may have multiple and sometimes conflicting objectives stemming from their decision-making needs. This complex decision environment renders the identification of a commonly accepted solution rather difficult. In this presentation, we propose a multi-criteria decision-making technique that incorporates the priorities and preferences of the stakeholders in order to determine the best compromise solution