Search CORE

438 research outputs found

Computing Aumann's Integral

Author: Baier Robert
Lempio Frank
Publication venue
Publication date: 01/01/1994
Field of study

A Fuzzy Hashing Approach Based on Random Sequences and Hamming Distance

Author: Baier Harald
Breitinger Frank
Publication venue: Scholarly Commons
Publication date: 01/05/2012
Field of study

Hash functions are well-known methods in computer science to map arbitrary large input to bit strings of a fixed length that serve as unique input identifier/fingerprints. A key property of cryptographic hash functions is that even if only one bit of the input is changed the output behaves pseudo randomly and therefore similar files cannot be identified. However, in the area of computer forensics it is also necessary to find similar files (e.g. different versions of a file), wherefore we need a similarity preserving hash function also called fuzzy hash function. In this paper we present a new approach for fuzzy hashing called bbHash. It is based on the idea to ‘rebuild’ an input as good as possible using a fixed set of randomly chosen byte sequences called building blocks of byte length l (e.g. l= 128 ). The proceeding is as follows: slide through the input byte-by-byte, read out the current input byte sequence of length l , and compute the Hamming distances of all building blocks against the current input byte sequence. Each building block with Hamming distance smaller than a certain threshold contributes the file’s bbHash. We discuss (dis- )advantages of our bbHash to further fuzzy hash approaches. A key property of bbHash is that it is the first fuzzy hashing approach based on a comparison to external data structures. Keywords: Fuzzy hashing, similarity preserving hash function, similarity digests, Hamming distance, computer forensics

TUbiblio

Embry-Riddle Aeronautical University

Approximating Reachable Sets by Extrapolation Methods

Author: Baier Robert
Lempio Frank
Publication venue
Publication date: 01/01/1994
Field of study

EPub Bayreuth

An Efficient Similarity Digests Database Lookup -- a Logarithmic Divide and Conquer Approach

Author: Baier Harald
Breitinger Frank
Rathgeb Christian
Publication venue: Digital Commons @ New Haven
Publication date: 01/01/2014
Field of study

Investigating seized devices within digital forensics represents a challenging task due to the increasing amount of data. Common procedures utilize automated file identification, which reduces the amount of data an investigator has to examine manually. In the past years the research field of approximate matching arises to detect similar data. However, if n denotes the number of similarity digests in a database, then the lookup for a single similarity digest is of complexity of O(n). This paper presents a concept to extend existing approximate matching algorithms, which reduces the lookup complexity from O(n) to O(log(n)). Our proposed approach is based on the well-known divide and conquer paradigm and builds a Bloom filter-based tree data structure in order to enable an efficient lookup of similarity digests. Further, it is demonstrated that the presented technique is highly scalable operating a trade-off between storage requirements and computational efficiency. We perform a theoretical assessment based on recently published results and reasonable magnitudes of input data, and show that the complexity reduction achieved by the proposed technique yields a 220-fold acceleration of look-up costs

Directory of Open Access Journals

Digital Commons @ New Haven

Embry-Riddle Aeronautical University

An Efficient Similarity Digests Database Lookup – A Logarithmic Divide & Conquer Approach

Author: Baier Harald
Breitinger Frank
Rathgeb Christian
Publication venue: (Print) 1558-7215
Publication date: 01/01/2014
Field of study

Directory of Open Access Journals

Embry-Riddle Aeronautical University

On the Database Lookup Problem of Approximate Matching

Author: Baier Harald
Breitinger Frank
White Douglas
Publication venue: Digital Commons @ New Haven
Publication date: 01/05/2014
Field of study

Investigating seized devices within digital forensics gets more and more difficult due to the increasing amount of data. Hence, a common procedure uses automated file identification which reduces the amount of data an investigator has to look at by hand. Besides identifying exact duplicates, which is mostly solved using cryptographic hash functions, it is also helpful to detect similar data by applying approximate matching. Let x denote the number of digests in a database, then the lookup for a single similarity digest has the complexity of O(x). In other words, the digest has to be compared against all digests in the database. In contrast, cryptographic hash values are stored within binary trees or hash tables and hence the lookup complexity of a single digest isO(log2(x)) or O(1), respectively. In this paper we present and evaluate a concept to extend existing approximate matching algorithms, which reduces the lookup complexity from O(x) to O(1). Therefore, instead of using multiple small Bloom filters (which is the common procedure), we demonstrate that a single, huge Bloom filter has a far better performance. Our evaluation demonstrates that current approximate matching algorithms are too slow (e.g., over 21 min to compare 4457 digests of a common file corpus against each other) while the improved version solves this challenge within seconds. Studying the precision and recall rates shows that our approach works as reliably as the original implementations. We obtain this benefit by accuracy–the comparison is now a file-against-set comparison and thus it is not possible to see which file in the database is matched

Elsevier - Publisher Connector

Crossref

Digital Commons @ New Haven

Stability and Convergence of Euler's Method for State-Constrained Differential Inclusions

Author: Baier Robert
Chahma Ilyes Aïssa
Lempio Frank
Publication venue
Publication date: 01/01/2007
Field of study

EPub Bayreuth

On Efficiency of Artifact Lookup Strategies in Digital Forensics

Author: Baier Harald
Breitinger Frank
Liebler Lorenz
Schmitt Patrick
Publication venue: Digital Commons @ New Haven
Publication date: 01/04/2019
Field of study

In recent years different strategies have been proposed to handle the problem of ever-growing digital forensic databases. One concept to deal with this data overload is data reduction, which essentially means to separate the wheat from the chaff, e.g., to filter in forensically relevant data. A prominent technique in the context of data reduction are hash-based solutions. Data reduction is achieved because hash values (of possibly large data input) are much smaller than the original input. Today\u27s approaches of storing hash-based data fragments reach from large scale multithreaded databases to simple Bloom filter representations. One main focus was put on the field of approximate matching, where sorting is a problem due to the fuzzy nature of the approximate hashes. A crucial step during digital forensic analysis is to achieve fast query times during lookup (e.g., against a blacklist), especially in the scope of small or ordinary resource availability. However, a comparison of different database and lookup approaches is considerably hard, as most techniques partially differ in considered use-case and integrated features, respectively. In this work we discuss, reassess and extend three widespread lookup strategies suitable for storing hash-based fragments: (1) Hash database for hash-based carving (hashdb), (2) hierarchical Bloom filter trees (hbft) and (3) flat hash maps (fhmap). We outline the capabilities of the different approaches, integrate new extensions, discuss possible features and perform a detailed evaluation with a special focus on runtime efficiency. Our results reveal major advantages for fhmap in case of runtime performance and applicability. Hbft showed a comparable runtime efficiency in case of lookups, but hbft suffers from pitfalls with respect to extensibility and maintenance. Finally, hashdb performs worst in case of a single core environment in all evaluation scenarios. However, hashdb is the only candidate which offers full parallelization capabilities, transactional features, and a Single-level storage

Digital Commons @ New Haven

Globalisierung in der Speisekammer - Band 1: Wege zu einer nachhaltigen Entwicklung im Bedürfnisfeld Ernährung [Globalisation in the Pantry - Volume 1: Ways of a Sustainable Development in the Food Sector]

Author: Baier Alexandra
Ebinger Frank
Jäger Manuela
Tappeser Beatrix
Publication venue
Publication date: 01/01/1999
Field of study

In Band 1 der Studie "Globalisierung in der Speisekammer" befassen sich die Autoren mit den vergangenen und künftigen Entwicklungen in Landwirtschaft und Ernährung. Mit Blick auf die verschiedenen Akteure weisen sie auf Risiken und Handlungsmöglichkeiten hin. Zur Bewertung zukünftiger Lösungen legen die Autoren das Konzept der Nachhaltigen Entwicklung zugrunde. Es verbindet in einzigartiger Weise die Aspekte Ökologie, Ökonomie und Soziales, die alle drei für den Ernährungssektor eine bedeutende Rolle spielen. Die Autoren beleuchten die wirtschaftlichen Konstellationen, die für mögliche Lösungen von entscheidender Bedeutung sind und zeigen viele positive Beispiele auf

Organic Eprints

Error bounds for Euler approximation of linear-quadratic control problems with bang-bang solutions

Author: Alt Walter
Baier Robert
Gerdts Matthias
Lempio Frank
Publication venue
Publication date: 01/01/2011
Field of study

EPub Bayreuth