Search CORE

2,279 research outputs found

Automated Crowdturfing Attacks and Defenses in Online Review Systems

Author: Arisoy Ebru
Fei Geli
Kakhki Arash Molavi
Kim Gyuwan
Lee Kyumin
Lee Kyumin
Li Fangtao
Maas Andrew L.
Maxwell Harper F.
Mukherjee Arjun
Sutskever Ilya
Publication venue
Publication date: 07/09/2017
Field of study

Malicious crowdsourcing forums are gaining traction as sources of spreading misinformation online, but are limited by the costs of hiring and managing human workers. In this paper, we identify a new class of attacks that leverage deep learning language models (Recurrent Neural Networks or RNNs) to automate the generation of fake online reviews for products and services. Not only are these attacks cheap and therefore more scalable, but they can control rate of content output to eliminate the signature burstiness that makes crowdsourced campaigns easy to detect. Using Yelp reviews as an example platform, we show how a two phased review generation and customization attack can produce reviews that are indistinguishable by state-of-the-art statistical detectors. We conduct a survey-based user study to show these reviews not only evade human detection, but also score high on "usefulness" metrics by users. Finally, we develop novel automated defenses against these attacks, by leveraging the lossy transformation introduced by the RNN training and generation cycle. We consider countermeasures against our mechanisms, show that they produce unattractive cost-benefit tradeoffs for attackers, and that they can be further curtailed by simple constraints imposed by online service providers

arXiv.org e-Print Archive

Crossref

Resolving X-Ray Photoelectron Spectra of Ionic Liquids with Difference Spectroscopy

Author: Clarke Coby James
Harper Jason B
Hawker Rebecca R
Licence Peter
Maxwell-Hogg Steven
Smith Emily F
Publication venue: 'Royal Society of Chemistry (RSC)'
Publication date: 07/01/2019
Field of study

X-ray photoelectron spectroscopy (XPS) is a powerful element-specific technique to determine the composition and chemical state of all elements in an involatile sample. However, for elements such as carbon, the wide variety of chemical states produce complex spectra that are difficult to interpret, consequently concealing important information due to the uncertainty in signal identity. Here we report a process whereby chemical modification of carbon structures with electron withdrawing groups can reveal this information, providing accurate, highly refined fitting models far more complex than previously possible. This method is demonstrated with functionalised ionic liquids bearing chlorine or trifluoromethane groups that shift electron density from targeted locations. By comparing the C 1s spectra of non-functional ionic liquids to their functional analogues, a series of difference spectra can be produced to identify exact binding energies of carbon photoemissions, which can be used to improve the C 1s peak fitting of both samples. Importantly, ionic liquids possess ideal chemical and physical properties, which enhance this methodology to enable significant progress in XPS peak fitting and data interpretation

Repository@Nottingham

Evaluating Conversational Recommender Systems via User Simulation

Author: Aliannejadi Mohammad
Azzopardi Leif
Belz Anja
Griol David
Li Xiujun
Maxwell Harper F.
Schatzmann Jost
Serban Iulian Vlad
Zhang Yongfeng
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 15/06/2020
Field of study

Conversational information access is an emerging research area. Currently, human evaluation is used for end-to-end system evaluation, which is both very time and resource intensive at scale, and thus becomes a bottleneck of progress. As an alternative, we propose automated evaluation by means of simulating users. Our user simulator aims to generate responses that a real human would give by considering both individual preferences and the general flow of interaction with the system. We evaluate our simulation approach on an item recommendation task by comparing three existing conversational recommender systems. We show that preference modeling and task-specific interaction models both contribute to more realistic simulations, and can help achieve high correlation between automatic evaluation measures and manual human assessments.Comment: Proceedings of the 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '20), 202

arXiv.org e-Print Archive

Crossref

Estimating Error and Bias in Offline Evaluation Results

Author: Beel Joeran
Blei David M.
Bottou Léon
Griffiths Thomas L.
Maxwell Harper F.
Michael
Roc'io
Swaminathan Adith
Teh Yee Whye
Publication venue: 'IUScholarWorks'
Publication date: 26/01/2020
Field of study

Offline evaluations of recommender systems attempt to estimate users’ satisfaction with recommendations using static data from prior user interactions. These evaluations provide researchers and developers with first approximations of the likely performance of a new system and help weed out bad ideas before presenting them to users. However, offline evaluation cannot accurately assess novel, relevant recommendations, because the most novel items were previously unknown to the user, so they are missing from the historical data and cannot be judged as relevant. We present a simulation study to estimate the error that such missing data causes in commonly-used evaluation metrics in order to assess its prevalence and impact. We find that missing data in the rating or observation process causes the evaluation protocol to systematically mis-estimate metric values, and in some cases erroneously determine that a popularity-based recommender outperforms even a perfect personalized recommender. Substantial breakthroughs in recommendation quality, therefore, will be difficult to assess with existing offline techniques

arXiv.org e-Print Archive

Crossref

Boise State University - ScholarWorks

Early findings from a large-scale user study of CHESTNUT: Validations and implications

Author: F Maxwell Harper
G Semeraro
K Lee
K Lee
M de Gemmis
P Adamopoulos
U Bhandari
X Xiaosong Zhou
X Zhou
Publication venue
Publication date: 01/01/2020
Field of study

Towards a serendipitous recommender system with user-centred understanding, we have built CHESTNUT , an Information Theory-based Movie Recommender System, which introduced a more comprehensive understanding of the concept. Although off-line evaluations have already demonstrated that CHESTNUT has greatly improved serendip-ity performance, feedback on CHESTNUT from real-world users through online services are still unclear now. In order to evaluate how serendip-itous results could be delivered by CHESTNUT , we consequently designed , organized and conducted large-scale user study, which involved 104 participants from 10 campuses in 3 countries. Our preliminary feedback has shown that, compared with mainstream collaborative filtering techniques, though CHESTNUT limited users' feelings of unex-pectedness to some extent, it showed significant improvement in their feelings about certain metrics being both beneficial and interesting, which substantially increased users' experience of serendipity. Based on them, we have summarized three key takeaways, which would be beneficial for further designs and engineering of serendipitous recommender systems, from our perspective. All details of our large-scale user study could be found at https://github.com/unnc-idl-ucc/Early-Lessons-From-CHESTNU

Nottingham ePrints

Nottingham eTheses

Crossref

Nobody cares if you liked Star Wars: KNN graph construction on the cheap

Author: Andrei Z. Broder
F. Maxwell Harper
G Linden
P Li
Y Bachrach
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/08/2018
Field of study

International audienceK-Nearest-Neighbors (KNN) graphs play a key role in a large range of applications. A KNN graph typically connects entities characterized by a set of features so that each entity becomes linked to its k most similar counterparts according to some similarity function. As datasets grow, KNN graphs are unfortunately becoming increasingly costly to construct, and the general approach, which consists in reducing the number of comparisons between entities, seems to have reached its full potential. In this paper we propose to overcome this limit with a simple yet powerful strategy that samples the set of features of each entity and only keeps the least popular features. We show that this strategy outperforms other more straightforward policies on a range of four representative datasets: for instance, keeping the 25 least popular items reduces computational time by up to 63%, while producing a KNN graph close to the ideal one

HAL-CentraleSupelec

Crossref

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

HAL-Rennes 1