Search CORE

15,803 research outputs found

A novel privacy preserving user identification approach for network traffic

Author: Ahmed
Al-Bataineh
Alexa
Androulidakis
Aupy
Bolzoni
Braga
Canini
Cho
Chun
Cisco
Cisco
Claffy
Claise
Clarke
Cohen
Conti
Crotti
Debar
Deutschmann
Dharmapurikar
Dharmapurikar
Duffield
Duffield
Estan
F. Li
FBI
FBI
Fridman
He
Hofstede
Hofstede
Internetlivestats
Iranmanesh
Jadidi
Jain
Juniper
Jurga
Kim
Li
Liu
Lu
Mahoney
MathWorks
MIT
Muraleedharan
N. Clarke
NIKSUN
Pao
RSA
S. Furnell
Saevanee
Sangster
Sflow
Sibai
SimpleWeb
Smallwood
Smith
Song
Song
Song
Sourdis
Sun
Svozil
Tegeler
UNB
Verizon
Vezzani
Wang
Wang
Wang
Winter
Wireshark
Xplico
Yu
Zanero
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

The prevalence of the Internet and cloud-based applications, alongside the technological evolution of smartphones, tablets and smartwatches, has resulted in users relying upon network connectivity more than ever before. This results in an increasingly voluminous footprint with respect to the network traffic that is created as a consequence. For network forensic examiners, this traffic represents a vital source of independent evidence in an environment where anti-forensics is increasingly challenging the validity of computer-based forensics. Performing network forensics today largely focuses upon an analysis based upon the Internet Protocol (IP) address – as this is the only characteristic available. More typically, however, investigators are not actually interested in the IP address but rather the associated user (whose account might have been compromised). However, given the range of devices (e.g., laptop, mobile, and tablet) that a user might be using and the widespread use of DHCP, IP is not a reliable and consistent means of understanding the traffic from a user. This paper presents a novel approach to the identification of users from network traffic using only the meta-data of the traffic (i.e. rather than payload) and the creation of application-level user interactions, which are proven to provide a far richer discriminatory feature set to enable more reliable identity verification. A study involving data collected from 46 users over a two-month period generated over 112 GBs of meta-data traffic was undertaken to examine the novel user-interaction based feature extraction algorithm. On an individual application basis, the approach can achieve recognition rates of 90%, with some users experiencing recognition performance of 100%. The consequence of this recognition is an enormous reduction in the volume of traffic an investigator has to analyse, allowing them to focus upon a particular suspect or enabling them to disregard traffic and focus upon what is left

Crossref

Repository@Nottingham

Plymouth Electronic Archive and Research Library

Portsmouth University Research Portal (Pure)

Research Online @ ECU

Profiling user activities with minimal traffic traces

Author: J Zhang
L Sweeney
T Fawcett
TTT Nguyen
Publication venue
Publication date: 07/04/2015
Field of study

Understanding user behavior is essential to personalize and enrich a user's online experience. While there are significant benefits to be accrued from the pursuit of personalized services based on a fine-grained behavioral analysis, care must be taken to address user privacy concerns. In this paper, we consider the use of web traces with truncated URLs - each URL is trimmed to only contain the web domain - for this purpose. While such truncation removes the fine-grained sensitive information, it also strips the data of many features that are crucial to the profiling of user activity. We show how to overcome the severe handicap of lack of crucial features for the purpose of filtering out the URLs representing a user activity from the noisy network traffic trace (including advertisement, spam, analytics, webscripts) with high accuracy. This activity profiling with truncated URLs enables the network operators to provide personalized services while mitigating privacy concerns by storing and sharing only truncated traffic traces. In order to offset the accuracy loss due to truncation, our statistical methodology leverages specialized features extracted from a group of consecutive URLs that represent a micro user action like web click, chat reply, etc., which we call bursts. These bursts, in turn, are detected by a novel algorithm which is based on our observed characteristics of the inter-arrival time of HTTP records. We present an extensive experimental evaluation on a real dataset of mobile web traces, consisting of more than 130 million records, representing the browsing activities of 10,000 users over a period of 30 days. Our results show that the proposed methodology achieves around 90% accuracy in segregating URLs representing user activities from non-representative URLs

arXiv.org e-Print Archive

Crossref