Search CORE

5 research outputs found

Security Analysis of MVhash-B Similarity Hashing

Author: Chang Donghoon
Sanadhya Somitra
Singh Monika
Publication venue: (Print) 1558-7215
Publication date: 01/01/2016
Field of study

In the era of big data, the volume of digital data is increasing rapidly, causing new challenges for investigators to examine the same in a reasonable amount of time. A major requirement of modern forensic investigation is the ability to perform automatic filtering of correlated data, and thereby reducing and focusing the manual effort of the investigator. Approximate matching is a technique to find “closeness” between two digital artifacts. mvHash-B is a well-known approximate matching scheme used for finding similarity between two digital objects and produces a ‘score of similarity’ on a scale of 0 to 100. However, no security analysis of mvHash-B is available in the literature. In this work, we perform the first academic security analysis of this algorithm and show that it is possible for an attacker to “fool” it by causing the similarity score to be close to zero even when the objects are very similar. By similarity of the objects, we mean semantic similarity for text and visual match for images. The designers of mvHash-B had claimed that the scheme is secure against ‘active manipulation’. We contest this claim in this work. We propose an algorithm that starts with a given document and produces another one of the same size without influencing its semantic and visual meaning (for text and image files, respectively) but which has low similarity score as measured by mvHash-B. In our experiments, we show that the similarity score can be reduced from 100 to less than 6 for text and image documents. We performed experiments with 50 text files and 200 images and the average similarity score between the original file and the file produced by our algorithm was found to be 4 for text files and 6 for image files. In fact, if the original file size is small then the similarity score between the two files was close to 0, almost always. To improve the security of mvHash-B against active adversaries, we propose a modification in the scheme. We show that the modification prevents the attack we describe in this work

Crossref

Embry-Riddle Aeronautical University

Author
Publication venue: (Print) 1558-7215
Publication date: 01/01/2016
Field of study

Embry-Riddle Aeronautical University

Back Matter

Author
Publication venue: (Print) 1558-7215
Publication date: 30/06/2016
Field of study

Embry-Riddle Aeronautical University

Front Matter

Author
Publication venue: (Print) 1558-7215
Publication date: 30/06/2016
Field of study

Embry-Riddle Aeronautical University