Search CORE

50 research outputs found

Classical Structured Prediction Losses for Sequence to Sequence Learning

Author: Auli Michael
Edunov Sergey
Grangier David
Ott Myle
Ranzato Marc'Aurelio
Publication venue
Publication date: 01/01/2018
Field of study

There has been much recent work on training neural attention models at the sequence-level using either reinforcement learning-style methods or by optimizing the beam. In this paper, we survey a range of classical objective functions that have been widely used to train linear models for structured prediction and apply them to neural sequence to sequence models. Our experiments show that these losses can perform surprisingly well by slightly outperforming beam search optimization in a like for like setup. We also report new state of the art results on both IWSLT'14 German-English translation as well as Gigaword abstractive summarization. On the larger WMT'14 English-French translation task, sequence-level training achieves 41.5 BLEU which is on par with the state of the art.Comment: 10 pages, NAACL 201

arXiv.org e-Print Archive

Crossref

On Human Predictions with Explanations and Predictions of Machine Learning Models: A Case Study on Deception Detection

Author: Akoglu Leman
Feng Song
Feng Vanessa Wei
Gyöngyi Zoltán
Hardt Moritz
Kim Been
Kim Been
Kleinberg Jon
Kleinberg Jon
Krauss Robert M
Lundberg Scott M
Ott Myle
Ott Myle
Ribeiro Marco Tulio
Singla Adish
Zhu Xiaojin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 08/01/2019
Field of study

Humans are the final decision makers in critical tasks that involve ethical and legal concerns, ranging from recidivism prediction, to medical diagnosis, to fighting against fake news. Although machine learning models can sometimes achieve impressive performance in these tasks, these tasks are not amenable to full automation. To realize the potential of machine learning for improving human decisions, it is important to understand how assistance from machine learning models affects human performance and human agency. In this paper, we use deception detection as a testbed and investigate how we can harness explanations and predictions of machine learning models to improve human performance while retaining human agency. We propose a spectrum between full human agency and full automation, and develop varying levels of machine assistance along the spectrum that gradually increase the influence of machine predictions. We find that without showing predicted labels, explanations alone slightly improve human performance in the end task. In comparison, human performance is greatly improved by showing predicted labels (>20% relative improvement) and can be further improved by explicitly suggesting strong machine performance. Interestingly, when predicted labels are shown, explanations of machine predictions induce a similar level of accuracy as an explicit statement of strong machine performance. Our results demonstrate a tradeoff between human performance and human agency and show that explanations of machine predictions can moderate this tradeoff.Comment: 17 pages, 19 figures, in Proceedings of ACM FAT* 2019, dataset & demo available at https://deception.machineintheloop.co

arXiv.org e-Print Archive

Crossref

Search Rank Fraud De-Anonymization in Online Systems

Author: Akoglu Leman
Akoglu Leman
Fei Geli
Kaghazgaran Parisa
Karger David R
Mukherjee Arjun
Ott Myle
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 23/06/2018
Field of study

We introduce the fraud de-anonymization problem, that goes beyond fraud detection, to unmask the human masterminds responsible for posting search rank fraud in online systems. We collect and study search rank fraud data from Upwork, and survey the capabilities and behaviors of 58 search rank fraudsters recruited from 6 crowdsourcing sites. We propose Dolos, a fraud de-anonymization system that leverages traits and behaviors extracted from these studies, to attribute detected fraud to crowdsourcing site fraudsters, thus to real identities and bank accounts. We introduce MCDense, a min-cut dense component detection algorithm to uncover groups of user accounts controlled by different fraudsters, and leverage stylometry and deep learning to attribute them to crowdsourcing site profiles. Dolos correctly identified the owners of 95% of fraudster-controlled communities, and uncovered fraudsters who promoted as many as 97.5% of fraud apps we collected from Google Play. When evaluated on 13,087 apps (820,760 reviews), which we monitored over more than 6 months, Dolos identified 1,056 apps with suspicious reviewer groups. We report orthogonal evidence of their fraud, including fraud duplicates and fraud re-posts.Comment: The 29Th ACM Conference on Hypertext and Social Media, July 201

arXiv.org e-Print Archive

Crossref