Search CORE

24,849 research outputs found

Ethical Challenges in Data-Driven Dialogue Systems

Author: Angelard-Gontier Nicolas
Fried Genevieve
Henderson Peter
Ke Nan Rosemary
Lowe Ryan
Pineau Joelle
Sinha Koustuv
Publication venue
Publication date: 24/11/2017
Field of study

The use of dialogue systems as a medium for human-machine interaction is an increasingly prevalent paradigm. A growing number of dialogue systems use conversation strategies that are learned from large datasets. There are well documented instances where interactions with these system have resulted in biased or even offensive conversations due to the data-driven training process. Here, we highlight potential ethical issues that arise in dialogue systems research, including: implicit biases in data-driven systems, the rise of adversarial examples, potential sources of privacy violations, safety concerns, special considerations for reinforcement learning systems, and reproducibility concerns. We also suggest areas stemming from these issues that deserve further investigation. Through this initial survey, we hope to spur research leading to robust, safe, and ethically sound dialogue systems.Comment: In Submission to the AAAI/ACM conference on Artificial Intelligence, Ethics, and Societ

arXiv.org e-Print Archive

Crossref

PolyPublie

An Automated Social Graph De-anonymization Technique

Author: Criminisi A.
Dwork C.
Ho T. K.
Narayanan A.
Publication venue
Publication date: 07/08/2014
Field of study

We present a generic and automated approach to re-identifying nodes in anonymized social networks which enables novel anonymization techniques to be quickly evaluated. It uses machine learning (decision forests) to matching pairs of nodes in disparate anonymized sub-graphs. The technique uncovers artefacts and invariants of any black-box anonymization scheme from a small set of examples. Despite a high degree of automation, classification succeeds with significant true positive rates even when small false positive rates are sought. Our evaluation uses publicly available real world datasets to study the performance of our approach against real-world anonymization strategies, namely the schemes used to protect datasets of The Data for Development (D4D) Challenge. We show that the technique is effective even when only small numbers of samples are used for training. Further, since it detects weaknesses in the black-box anonymization scheme it can re-identify nodes in one social network when trained on another.Comment: 12 page

arXiv.org e-Print Archive

CiteSeerX

Crossref