Search CORE

16 research outputs found

Certified Robustness of Nearest Neighbors against Data Poisoning and Backdoor Attacks

Author: Cao Xiaoyu
Gong Neil Zhenqiang
Jia Jinyuan
Liu Yupei
Publication venue
Publication date: 02/12/2021
Field of study

Data poisoning attacks and backdoor attacks aim to corrupt a machine learning classifier via modifying, adding, and/or removing some carefully selected training examples, such that the corrupted classifier makes incorrect predictions as the attacker desires. The key idea of state-of-the-art certified defenses against data poisoning attacks and backdoor attacks is to create a majority vote mechanism to predict the label of a testing example. Moreover, each voter is a base classifier trained on a subset of the training dataset. Classical simple learning algorithms such as k nearest neighbors (kNN) and radius nearest neighbors (rNN) have intrinsic majority vote mechanisms. In this work, we show that the intrinsic majority vote mechanisms in kNN and rNN already provide certified robustness guarantees against data poisoning attacks and backdoor attacks. Moreover, our evaluation results on MNIST and CIFAR10 show that the intrinsic certified robustness guarantees of kNN and rNN outperform those provided by state-of-the-art certified defenses. Our results serve as standard baselines for future certified defenses against data poisoning attacks and backdoor attacks.Comment: To appear in AAAI Conference on Artificial Intelligence, 202

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Revisiting Adversarially Learned Injection Attacks Against Recommender Systems

Author: Burke Robin
Kingma Diederik
Mehta Bhaskar
Rendle Steffen
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 28/08/2020
Field of study

Recommender systems play an important role in modern information and e-commerce applications. While increasing research is dedicated to improving the relevance and diversity of the recommendations, the potential risks of state-of-the-art recommendation models are under-explored, that is, these models could be subject to attacks from malicious third parties, through injecting fake user interactions to achieve their purposes. This paper revisits the adversarially-learned injection attack problem, where the injected fake user `behaviors' are learned locally by the attackers with their own model -- one that is potentially different from the model under attack, but shares similar properties to allow attack transfer. We found that most existing works in literature suffer from two major limitations: (1) they do not solve the optimization problem precisely, making the attack less harmful than it could be, (2) they assume perfect knowledge for the attack, causing the lack of understanding for realistic attack capabilities. We demonstrate that the exact solution for generating fake users as an optimization problem could lead to a much larger impact. Our experiments on a real-world dataset reveal important properties of the attack, including attack transferability and its limitations. These findings can inspire useful defensive methods against this possible existing attack.Comment: Accepted at Recsys 2

arXiv.org e-Print Archive

Crossref

Attacking Recommender Systems with Augmented User Profiles

Author: Biggio Battista
Burke LimitedRobin
Carlos
David
Diederik
Goodfellow Ian J.
Gulrajani Ishaan
He Xiangnan
Lee Daniel D.
Li Bo
Li Hui
Li Hui
Liu Ming-Yu
O'Mahony Michael P.
Radford Alec
Roli Fabio
Sandvig Jeff J.
Sedhain Suvash
Shyong
Wang Cheng
Xing Xinyu
Yu Lantao
Zhang Shuai
Zhang Yongfeng
Zhao Zhengli
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 23/07/2020
Field of study

Recommendation Systems (RS) have become an essential part of many online services. Due to its pivotal role in guiding customers towards purchasing, there is a natural motivation for unscrupulous parties to spoof RS for profits. In this paper, we study the shilling attack: a subsistent and profitable attack where an adversarial party injects a number of user profiles to promote or demote a target item. Conventional shilling attack models are based on simple heuristics that can be easily detected, or directly adopt adversarial attack methods without a special design for RS. Moreover, the study on the attack impact on deep learning based RS is missing in the literature, making the effects of shilling attack against real RS doubtful. We present a novel Augmented Shilling Attack framework (AUSH) and implement it with the idea of Generative Adversarial Network. AUSH is capable of tailoring attacks against RS according to budget and complex attack goals, such as targeting a specific user group. We experimentally show that the attack impact of AUSH is noticeable on a wide range of RS including both classic and modern deep learning based RS, while it is virtually undetectable by the state-of-the-art attack detection model.Comment: CIKM 2020. 10 pages, 2 figure

arXiv.org e-Print Archive

Crossref