Search CORE

577 research outputs found

A Game-Based Approximate Verification of Deep Neural Networks with Provable Guarantees

Author: Huang Xiaowei
Kwiatkowska Marta
Ruan Wenjie
Wicker Matthew
Wu Min
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

Despite the improved accuracy of deep neural networks, the discovery of adversarial examples has raised serious safety concerns. In this paper, we study two variants of pointwise robustness, the maximum safe radius problem, which for a given input sample computes the minimum distance to an adversarial example, and the feature robustness problem, which aims to quantify the robustness of individual features to adversarial perturbations. We demonstrate that, under the assumption of Lipschitz continuity, both problems can be approximated using finite optimisation by discretising the input space, and the approximation has provable guarantees, i.e., the error is bounded. We then show that the resulting optimisation problems can be reduced to the solution of two-player turn-based games, where the first player selects features and the second perturbs the image within the feature. While the second player aims to minimise the distance to an adversarial example, depending on the optimisation objective the first player can be cooperative or competitive. We employ an anytime approach to solve the games, in the sense of approximating the value of a game by monotonically improving its upper and lower bounds. The Monte Carlo tree search algorithm is applied to compute upper bounds for both games, and the Admissible A* and the Alpha-Beta Pruning algorithms are, respectively, used to compute lower bounds for the maximum safety radius and feature robustness games. When working on the upper bound of the maximum safe radius problem, our tool demonstrates competitive performance against existing adversarial example crafting algorithms. Furthermore, we show how our framework can be deployed to evaluate pointwise robustness of neural networks in safety-critical applications such as traffic sign recognition in self-driving cars

arXiv.org e-Print Archive

University of Liverpool Repository

Oxford University Research Archive

Lancaster E-Prints

When to trust AI: advances and challenges for certification of neural networks

Author: Kwiatkowska Marta
Zhang Xiyue
Publication venue: Polish Information Processing Society
Publication date: 26/09/2023
Field of study

Artificial intelligence (AI) has been advancing at a fast pace and it is now poised for deployment in a wide range of applications, such as autonomous systems, medical diagnosis and natural language processing. Early adoption of AI technology for real-world applications has not been without problems, particularly for neural networks, which may be unstable and susceptible to adversarial examples. In the longer term, appropriate safety assurance techniques need to be developed to reduce potential harm due to avoidable system failures and ensure trustworthiness. Focusing on certification and explainability, this paper provides an overview of techniques that have been developed to ensure safety of AI decisions and discusses future challenges

Oxford University Research Archive

When to Trust AI: Advances and Challenges for Certification of Neural Networks

Author: Kwiatkowska Marta
Zhang Xiyue
Publication venue
Publication date: 20/09/2023
Field of study

arXiv.org e-Print Archive

What, Indeed, is an Achievable Provable Guarantee for Learning-Enabled Safety Critical Systems

Author: Bensalem Saddek
Cheng Chih-Hong
Huang Wei
Huang Xiaowei
Wu Changshun
Zhao Xingyu
Publication venue
Publication date: 20/07/2023
Field of study

Machine learning has made remarkable advancements, but confidently utilising learning-enabled components in safety-critical domains still poses challenges. Among the challenges, it is known that a rigorous, yet practical, way of achieving safety guarantees is one of the most prominent. In this paper, we first discuss the engineering and research challenges associated with the design and verification of such systems. Then, based on the observation that existing works cannot actually achieve provable guarantees, we promote a two-step verification method for the ultimate achievement of provable statistical guarantees

arXiv.org e-Print Archive