Synthetic Data Generation and Defense in Depth Measurement of Web Applications

A. Shiravi; C. Dwork; D. Mutz; F. Valeur; G.F. Cretu-Ciocarlie; H. Cavusoglu; K.L. Ingham; L. Sweeney; M. Tavallaee; N. Boggs

Synthetic Data Generation and Defense in Depth Measurement of Web Applications

Authors: A. Shiravi
C. Dwork
D. Mutz
F. Valeur
G.F. Cretu-Ciocarlie
H. Cavusoglu
K.L. Ingham
L. Sweeney
M. Tavallaee
N. Boggs
Publication date: 1 January 2014
Publisher: 'Columbia University Libraries/Information Services'
Doi

Abstract

Measuring security controls across multiple layers of defense requires realistic data sets and repeatable experiments. However, data sets that are collected from real users often cannot be freely exchanged due to privacy and regulatory concerns. Synthetic datasets, which can be shared, have in the past had critical flaws or at best been one time collections of data focusing on a single layer or type of data. We present a framework for generating synthetic datasets with normal and attack data for web applications across multiple layers simultaneously. The framework is modular and designed for data to be easily recreated in order to vary parameters and allow for inline testing. We build a prototype data generator using the framework to generate nine datasets with data logged on four layers: network, file accesses, system calls, and database simultaneously. We then test nineteen security controls spanning all four layers to determine their sensitivity to dataset changes, compare performance even across layers, compare synthetic data to real production data, and calculate combined defense in depth performance of sets of controls

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Sustaining member

Columbia University Academic Commons

oai:academiccommons.columbia.e...

Last time updated on 02/10/2018

Crossref

info:doi/10.1007%2F978-3-319-1...

Last time updated on 22/03/2019