Search CORE

3,287 research outputs found

A Formalization of Robustness for Deep Neural Networks

Author: Dreossi Tommaso
Ghosh Shromona
Sangiovanni-Vincentelli Alberto
Seshia Sanjit A.
Publication venue
Publication date: 24/03/2019
Field of study

Deep neural networks have been shown to lack robustness to small input perturbations. The process of generating the perturbations that expose the lack of robustness of neural networks is known as adversarial input generation. This process depends on the goals and capabilities of the adversary, In this paper, we propose a unifying formalization of the adversarial input generation process from a formal methods perspective. We provide a definition of robustness that is general enough to capture different formulations. The expressiveness of our formalization is shown by modeling and comparing a variety of adversarial attack techniques

arXiv.org e-Print Archive

eScholarship - University of California

Robustness Verification for Classifier Ensembles

Author: Gross Dennis
Jansen Nils
Pérez Guillermo A.
Raaijmakers Stephan
Publication venue
Publication date: 01/01/2020
Field of study

We give a formal verification procedure that decides whether a classifier ensemble is robust against arbitrary randomized attacks. Such attacks consist of a set of deterministic attacks and a distribution over this set. The robustness-checking problem consists of assessing, given a set of classifiers and a labelled data set, whether there exists a randomized attack that induces a certain expected loss against all classifiers. We show the NP-hardness of the problem and provide an upper bound on the number of attacks that is sufficient to form an optimal randomized attack. These results provide an effective way to reason about the robustness of a classifier ensemble. We provide SMT and MILP encodings to compute optimal randomized attacks or prove that there is no attack inducing a certain expected loss. In the latter case, the classifier ensemble is provably robust. Our prototype implementation verifies multiple neural-network ensembles trained for image-classification tasks. The experimental results using the MILP encoding are promising both in terms of scalability and the general applicability of our verification procedure

arXiv.org e-Print Archive

Institutional Repository Universiteit Antwerpen

Robustness of 3D Deep Learning in an Adversarial Setting

Author: Kwiatkowska Marta
Wicker Matthew
Publication venue
Publication date: 01/04/2019
Field of study

Understanding the spatial arrangement and nature of real-world objects is of paramount importance to many complex engineering tasks, including autonomous navigation. Deep learning has revolutionized state-of-the-art performance for tasks in 3D environments; however, relatively little is known about the robustness of these approaches in an adversarial setting. The lack of comprehensive analysis makes it difficult to justify deployment of 3D deep learning models in real-world, safety-critical applications. In this work, we develop an algorithm for analysis of pointwise robustness of neural networks that operate on 3D data. We show that current approaches presented for understanding the resilience of state-of-the-art models vastly overestimate their robustness. We then use our algorithm to evaluate an array of state-of-the-art models in order to demonstrate their vulnerability to occlusion attacks. We show that, in the worst case, these networks can be reduced to 0% classification accuracy after the occlusion of at most 6.5% of the occupied input space.Comment: 10 pages, 8 figures, 1 tabl

arXiv.org e-Print Archive

Crossref

Oxford University Research Archive

Towards Logical Specification of Statistical Machine Learning

Author: B Alpern
Cynthia Dwork
D Hughes
G Bana
GH Wright von
Guy Katz
JY Halpern
JY Halpern
M Burrows
PF Syverson
R Fagin
Raúl Pardo
Rohit Chadha
SA Kripke
SA Seshia
T Williamson
Xiaowei Huang
Y Kawamoto
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 28/03/2020
Field of study

We introduce a logical approach to formalizing statistical properties of machine learning. Specifically, we propose a formal model for statistical classification based on a Kripke model, and formalize various notions of classification performance, robustness, and fairness of classifiers by using epistemic logic. Then we show some relationships among properties of classifiers and those between classification performance and robustness, which suggests robustness-related properties that have not been formalized in the literature as far as we know. To formalize fairness properties, we define a notion of counterfactual knowledge and show techniques to formalize conditional indistinguishability by using counterfactual epistemic operators. As far as we know, this is the first work that uses logical formulas to express statistical properties of machine learning, and that provides epistemic (resp. counterfactually epistemic) views on robustness (resp. fairness) of classifiers.Comment: SEFM'19 conference paper (full version with errors corrected

arXiv.org e-Print Archive

Crossref