Search CORE

90 research outputs found

Design and Evaluation of Packet Classification Systems, Doctoral Dissertation, December 2006

Author: Song Haoyu
Publication venue: Washington University Open Scholarship
Publication date: 01/01/2006
Field of study

Although many algorithms and architectures have been proposed, the design of efficient packet classification systems remains a challenging problem. The diversity of filter specifications, the scale of filter sets, and the throughput requirements of high speed networks all contribute to the difficulty. We need to review the algorithms from a high-level point-of-view in order to advance the study. This level of understanding can lead to significant performance improvements. In this dissertation, we evaluate several existing algorithms and present several new algorithms as well. The previous evaluation results for existing algorithms are not convincing because they have not been done in a consistent way. To resolve this issue, an objective evaluation platform needs to be developed. We implement and evaluate several representative algorithms with uniform criteria. The source code and the evaluation results are both published on a web-site to provide the research community a benchmark for impartial and thorough algorithm evaluations. We propose several new algorithms to deal with the different variations of the packet classification problem. They are: (1) the Shape Shifting Trie algorithm for longest prefix matching, used in IP lookups or as a building block for general packet classification algorithms; (2) the Fast Hash Table lookup algorithm used for exact flow match; (3) the longest prefix matching algorithm using hash tables and tries, used in IP lookups or packet classification algorithms;(4) the 2D coarse-grained tuple-space search algorithm with controlled filter expansion, used for two-dimensional packet classification or as a building block for general packet classification algorithms; (5) the Adaptive Binary Cutting algorithm used for general multi-dimensional packet classification. In addition to the algorithmic solutions, we also consider the TCAM hardware solution. In particular, we address the TCAM filter update problem for general packet classification and provide an efficient algorithm. Building upon the previous work, these algorithms significantly improve the performance of packet classification systems and set a solid foundation for further study

Washington University St. Louis: Open Scholarship

Fast Packet Classification Using Bloom Filters

Author: Dharmapurikar Sarang
Lockwood John
Song Haoyu
Turner Jonathan
Publication venue: Washington University Open Scholarship
Publication date: 01/01/2006
Field of study

While the problem of general packet classification has received a great deal of attention from researchers over the last ten years, there is still no really satisfactory solution. Ternary Content Addressable Memory (TCAM), although widely used in practice, is both expensive and consumes a lot of power. Algorithmic solutions, which rely on commodity memory chips, are relatively inexpensive and power-efficient, but have not been able to match the generality and performance of TCAMs. In this paper we propose a new approach to packet classification, which combines architectural and algorithmic techniques. Our starting point is the well-known crossproducting algorithm, which is fast but has significant memory overhead due to the extra rules needed to represent the crossproducts. We show how to modify the crossproduct method in a way that drastically reduces the memory required, without compromising on performance. We avoid unnecessary accesses to off-chip memory by filtering off-chip accesses using on-chip Bloom filters. For packets that match p rules in a rule set, our algorithm requires just 4 + p + ǫ independent memory accesses on average, to return all matching rules, where ǫ á 1 is a small constant that depends on the false positive rate of the Bloom filters. Each memory access is just 256 bits, making it practical to classify small packets at OC-192 link rates using two commodity SRAM chips. For rule set sizes ranging from a few hundred to several thousand filters, the average rule set expansion factor attributable to the algorithm is just 1.2. The memory consumption per rule is 36 bytes in the average case

Washington University St. Louis: Open Scholarship

Models, Algorithms, and Architectures for Scalable Packet Classification

Author: Taylor David Edward
Turner Jonathan S.
Publication venue: Washington University Open Scholarship
Publication date: 28/07/2004
Field of study

The growth and diversiﬁcation of the Internet imposes increasing demands on the performance and functionality of network infrastructure. Routers, the devices responsible for the switch-ing and directing of trafﬁc in the Internet, are being called upon to not only handle increased volumes of trafﬁc at higher speeds, but also impose tighter security policies and provide support for a richer set of network services. This dissertation addresses the searching tasks performed by Internet routers in order to forward packets and apply network services to packets belonging to deﬁned trafﬁc ﬂows. As these searching tasks must be performed for each packet traversing the router, the speed and scalability of the solutions to the route lookup and packet classiﬁcation problems largely determine the realizable performance of the router, and hence the Internet as a whole. Despite the energetic attention of the academic and corporate research communities, there remains a need for search engines that scale to support faster communication links, larger route tables and ﬁlter sets and increasingly complex ﬁlters. The major contributions of this work include the design and analysis of a scalable hardware implementation of a Longest Preﬁx Matching (LPM) search engine for route lookup, a survey and taxonomy of packet classiﬁcation techniques, a thorough analysis of packet classiﬁcation ﬁlter sets, the design and analysis of a suite of performance evaluation tools for packet classiﬁcation algorithms and devices, and a new packet classiﬁcation algorithm that scales to support high-speed links and large ﬁlter sets classifying on additional packet ﬁelds

Washington University St. Louis: Open Scholarship

Algorithms and Architectures for Network Search Processors

Author: Dharmapurikar Sarang
Publication venue: Washington University Open Scholarship
Publication date: 01/01/2006
Field of study

The continuous growth in the Internet’s size, the amount of data traﬃc, and the complexity of processing this traﬃc gives rise to new challenges in building high-performance network devices. One of the most fundamental tasks performed by these devices is searching the network data for predeﬁned keys. Address lookup, packet classiﬁcation, and deep packet inspection are some of the operations which involve table lookups and searching. These operations are typically part of the packet forwarding mechanism, and can create a performance bottleneck. Therefore, fast and resource eﬃcient algorithms are required. One of the most commonly used techniques for such searching operations is the Ternary Content Addressable Memory (TCAM). While TCAM can oﬀer very fast search speeds, it is costly and consumes a large amount of power. Hence, designing cost-eﬀective, power-eﬃcient, and high-speed search techniques has received a great deal of attention in the research and industrial community. In this thesis, we propose a generic search technique based on Bloom ﬁlters. A Bloom ﬁlter is a randomized data structure used to represent a set of bit-strings compactly and support set membership queries. We demonstrate techniques to convert the search process into table lookups. The resulting table data structures are kept in the oﬀ-chip memory and their Bloom ﬁlter representations are kept in the on-chip memory. An item needs to be looked up in the oﬀ-chip table only when it is found in the on-chip Bloom ﬁlters. By ﬁltering the oﬀ-chip memory accesses in this fashion, the search operations can be signiﬁcantly accelerated. Our approach involves a unique combination of algorithmic and architectural techniques that outperform some of the current techniques in terms of cost-eﬀectiveness, speed, and power-eﬃciency

CiteSeerX

Washington University St. Louis: Open Scholarship

Randomized Packet Filtering through Specialized Partitioning of Rulesets

Author: BONELLI NICOLA
Luca Abeni
PROCISSI GREGORIO
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2013
Field of study

A key issue in high speed traffic processing is to immediately detect potentially interesting packets. At very high speed, this operation is particularly crucial as filtering packets close to the wire relieves real applications from handling large volumes of (uninteresting) data. This paper proposes a fast and randomized approach to packet filtering based on partitioning rule databases for their storage in fast and compact Bloom filters that can be placed in fast cache memory. Database partitioning is obtained by a specially tailored clustering algorithm and the results show that even large rulesets can be divided into a limited number of partitions and accommodated in reasonably small Bloom filters

Archivio della Ricerca - Università di Pisa

Design of a multiple bloom filter for distributed navigation routing

Author: Cheng Yongqiang
Ji Yuanxiang
Jiang Ping
Wang Xiaonian
Zhu Jin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 07/03/2013
Field of study

Unmanned navigation of vehicles and mobile robots can be greatly simplified by providing environmental intelligence with dispersed wireless sensors. The wireless sensors can work as active landmarks for vehicle localization and routing. However, wireless sensors are often resource scarce and require a resource-saving design. In this paper, a multiple Bloom-filter scheme is proposed to compress a global routing table for a wireless sensor. It is used as a lookup table for routing a vehicle to any destination but requires significantly less memory space and search effort. An error-expectation-based design for a multiple Bloom filter is proposed as an improvement to the conventional false-positive-rate-based design. The new design is shown to provide an equal relative error expectation for all branched paths, which ensures a better network load balance and uses less memory space. The scheme is implemented in a project for wheelchair navigation using wireless camera motes. © 2013 IEEE

Repository@Hull - Worktribe

Performance Analysis of Data Transmission Error Detection and Correction Based Golay Code with Bloom Filter

Author: RAHUMAN KALEEL
Publication venue: 'CIRWOLRD'
Publication date: 01/07/2016
Field of study

Channel coding is consistently incorporated to bring in plenteous ovation quality in wireless floating communications transceiver to prevent channel reduction right to inter-symbol postponement, multipath refugee flow, and thermal dish induced by electronic travel devices. Tremendous facilitate and fancy throughput hardware for encoder and decoder could be snug as a bug in a rug in air mail field. Due to the channel acquiring plot, the GOLAY conduct has become a well known of the close but no cigar favourable error-correcting codes. In this freebie, a dressy algorithm has been coming for CRC based encoding step by step diagram, which minus of complete linear feedback incline registers (LFSR). In this super structure, our trade is to raw material a GOLAY conduct based encoder and decoder house per CRC processing technique.A Bloom filter is a efficient contend filter which is chiefly hand me down in distinctive areas. In circumstance that, fancy throughput and silent power cash on barrelhead have been ordained in the Bloom filter architecture. The proposed concern is to study a Bloom filter by the whole of hamming transcend based breakneck dumb thing to do detection and hell to pay methodology and this employment is to recuperate the beg borrow or steal data transmission. The live high on hog filter course of action is to catch a glimpse of the lapse small amount location per the repetitive bits acquire process and to authoritative the error for XOR based outstrip calculation process. The Bloom filter architecture is used to art an adjunct of the hash rate for allocated transmitted data merger and to surge the lapse identification methodology. This rule of thumb is to optimize the decoder process and effectively notice the error location, previously to approve to error by bit reverser interpretation process

KHALSA PUBLICATIONS

GPU Accelerated protocol analysis for large and long-term traffic traces

Author: Nottingham Alastair Timothy
Publication venue: Faculty of Science, Computer Science
Publication date: 01/01/2016
Field of study

This thesis describes the design and implementation of GPF+, a complete general packet classification system developed using Nvidia CUDA for Compute Capability 3.5+ GPUs. This system was developed with the aim of accelerating the analysis of arbitrary network protocols within network traffic traces using inexpensive, massively parallel commodity hardware. GPF+ and its supporting components are specifically intended to support the processing of large, long-term network packet traces such as those produced by network telescopes, which are currently difficult and time consuming to analyse. The GPF+ classifier is based on prior research in the field, which produced a prototype classifier called GPF, targeted at Compute Capability 1.3 GPUs. GPF+ greatly extends the GPF model, improving runtime flexibility and scalability, whilst maintaining high execution efficiency. GPF+ incorporates a compact, lightweight registerbased state machine that supports massively-parallel, multi-match filter predicate evaluation, as well as efficient arbitrary field extraction. GPF+ tracks packet composition during execution, and adjusts processing at runtime to avoid redundant memory transactions and unnecessary computation through warp-voting. GPF+ additionally incorporates a 128-bit in-thread cache, accelerated through register shuffling, to accelerate access to packet data in slow GPU global memory. GPF+ uses a high-level DSL to simplify protocol and filter creation, whilst better facilitating protocol reuse. The system is supported by a pipeline of multi-threaded high-performance host components, which communicate asynchronously through 0MQ messaging middleware to buffer, index, and dispatch packet data on the host system. The system was evaluated using high-end Kepler (Nvidia GTX Titan) and entry level Maxwell (Nvidia GTX 750) GPUs. The results of this evaluation showed high system performance, limited only by device side IO (600MBps) in all tests. GPF+ maintained high occupancy and device utilisation in all tests, without significant serialisation, and showed improved scaling to more complex filter sets. Results were used to visualise captures of up to 160 GB in seconds, and to extract and pre-filter captures small enough to be easily analysed in applications such as Wireshark

South East Academic Libraries System (SEALS)

Rhodes Repository (SEALS)

Classifying DDoS packets in high-speed networks

Author: Xiang Yang
Zhou Wanlei
Publication venue: International Journal of Computer Science and Network Security
Publication date: 01/01/2006
Field of study

Recently high-speed networks have been utilized by attackers as Distributed Denial of Service (DDoS) attack infrastructure. Services on high-speed networks also have been attacked by successive waves of the DDoS attacks. How to sensitively and accurately detect the attack traffic, and quickly filter out the attack packets are still the major challenges in DDoS defense. Unfortunately most current defense approaches can not efficiently fulfill these tasks. Our approach is to find the network anomalies by using neural network and classify DDoS packets by a Bloom filter-based classifier (BFC). BFC is a set of spaceefficient data structures and algorithms for packet classification. The evaluation results show that the simple complexity, high classification speed and accuracy and low storage requirements of this classifier make it not only suitable for DDoS filtering in high-speed networks, but also suitable for other applications such as string matching for intrusion detection systems and IP lookup for programmable routers.<br /

Deakin Research Online

Hardware support for real-time network security and packet classification using field programmable gate arrays

Author: Guinde Nitesh Bhicu
Publication venue: Digital Commons @ NJIT
Publication date: 31/05/2010
Field of study

Deep packet inspection and packet classification are the most computationally expensive operations in a Network Intrusion Detection (NID) system. Deep packet inspection involves content matching where the payload of the incoming packets is matched against a set of signatures in the database. Packet classification involves inspection of the packet header fields and is basically a multi-dimensional matching problem. Any matching in software is very slow in comparison to current network speeds. Also, both of these problems need a solution which is scalable and can work at high speeds. Due to the high complexity of these matching problems, only Field-Programmable Gate Array (FPGA) or Application-Specific Integrated Circuit (ASIC) platforms can facilitate efficient designs. Two novel FPGA-based NID solutions were developed and implemented that not only carry out pattern matching at high speed but also allow changes to the set of stored patterns without resource/hardware reconfiguration; to their advantage, the solutions can easily be adopted by software or ASIC approaches as well. In both solutions, the proposed NID system can run while pattern updates occur. The designs can operate at 2.4 Gbps line rates, and have a memory consumption of around 17 bits per character and a logic cell usage of around 0.05 logic cells per character, which are the smallest compared to any other existing FPGA-based solution. In addition to these solutions for pattern matching, a novel packet classification algorithm was developed and implemented on a FPGA. The method involves a two-field matching process at a time that then combines the constituent results to identify longer matches involving more header fields. The design can achieve a throughput larger than 9.72 Gbps and has an on-chip memory consumption of around 256Kbytes when dealing with more than 10,000 rules (without using external RAM). This memory consumption is the lowest among all the previously proposed FPGA-based designs for packet classification

Digital Commons @ New Jersey Institute of Technology (NJIT)