LANCE: Stress-testing Visual Models by Generating Language-guided
  Counterfactual Images

Chattopadhyay, Prithvijit; Hoffman, Judy; Prabhu, Viraj; Yenamandra, Sriram

LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images

Authors: Prithvijit Chattopadhyay
Judy Hoffman
Viraj Prabhu
Sriram Yenamandra
Publication date: 30 May 2023
Publisher

Abstract

We propose an automated algorithm to stress-test a trained visual model by generating language-guided counterfactual test images (LANCE). Our method leverages recent progress in large language modeling and text-based image editing to augment an IID test set with a suite of diverse, realistic, and challenging test images without altering model weights. We benchmark the performance of a diverse set of pretrained models on our generated data and observe significant and consistent performance drops. We further analyze model sensitivity across different types of edits, and demonstrate its applicability at surfacing previously unknown class-level model biases in ImageNet.Comment: Project webpage: https://virajprabhu.github.io/lance-web

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2305.19164

Last time updated on 02/06/2023