5 research outputs found
Locality-Sensitive Hashing Does Not Guarantee Privacy! Attacks on Google's FLoC and the MinHash Hierarchy System
Recently proposed systems aim at achieving privacy using locality-sensitive
hashing. We show how these approaches fail by presenting attacks against two
such systems: Google's FLoC proposal for privacy-preserving targeted
advertising and the MinHash Hierarchy, a system for processing mobile users'
traffic behavior in a privacy-preserving way. Our attacks refute the pre-image
resistance, anonymity, and privacy guarantees claimed for these systems.
In the case of FLoC, we show how to deanonymize users using Sybil attacks and
to reconstruct 10% or more of the browsing history for 30% of its users using
Generative Adversarial Networks. We achieve this only analyzing the hashes used
by FLoC. For MinHash, we precisely identify the movement of a subset of
individuals and, on average, we can limit users' movement to just 10% of the
possible geographic area, again using just the hashes. In addition, we refute
their differential privacy claims.Comment: 14 pages, 9 figures submitted to PETS 202
Distributed Mining of Popular Paths in Road Networks
International audienceWe consider the problem of finding large scale mobility patterns. A common challenge in mobility tracking systems is that large quantity of data is spread out spatially and temporally across many tracking sensors. We thus devise a spatial sampling and information exchange protocol that provides probabilistic guarantees on detecting prominent patterns. For this purpose, we define a general notion of significant popular paths that can capture many different types of motion. We design a summary sketch for the data at each tracking node, which can be updated efficiently, and then aggregated across devices to reconstruct the prominent paths in the global data. The algorithm is scalable, even with large number of mobile targets. It uses a hierarchic query system that automatically prioritizes important trajectories – those that are long and popular. We show further that this scheme can in fact give good results by sampling relatively few sensors and targets, and works for streaming spatial data. We prove differential privacy guarantees for the randomized algorithm. Extensive experiments on real GPS data show that the method is efficient and accurate, and is useful in predicting motion of travelers even with small samples
Proceedings / 6th International Symposium of Industrial Engineering - SIE 2015, 24th-25th September, 2015, Belgrade
editors Vesna Spasojević-Brkić, Mirjana Misita, Dragan D. Milanovi
Proceedings / 6th International Symposium of Industrial Engineering - SIE 2015, 24th-25th September, 2015, Belgrade
editors Vesna Spasojević-Brkić, Mirjana Misita, Dragan D. Milanovi