Search CORE

87 research outputs found

On the Exploitation of a High-throughput SHA-256 FPGA Design for HMAC

Author: Costas E. Goutis
Dobbertin H.
George S. Athanasiou
George Theodoridis
Harris E. Michail
Hodjat A.
Kim M.
Michail H.
Perset K.
Pouffary Y.
Rogawski M.
Vasilis Kelefouras
Zeghid M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/03/2012
Field of study

High-throughput and area-efficient designs of hash functions and corresponding mechanisms for Message Authentication Codes (MACs) are in high demand due to new security protocols that have arisen and call for security services in every transmitted data packet. For instance, IPv6 incorporates the IPSec protocol for secure data transmission. However, the IPSec's performance bottleneck is the HMAC mechanism which is responsible for authenticating the transmitted data. HMAC's performance bottleneck in its turn is the underlying hash function. In this article a high-throughput and small-size SHA-256 hash function FPGA design and the corresponding HMAC FPGA design is presented. Advanced optimization techniques have been deployed leading to a SHA-256 hashing core which performs more than 30% better, compared to the next better design. This improvement is achieved both in terms of throughput as well as in terms of throughput/area cost factor. It is the first reported SHA-256 hashing core that exceeds 11Gbps (after place and route in Xilinx Virtex 6 board)

Crossref

Ktisis

Sheffield Hallam University Research Archive

A framework for automatically generating optimized digital designs from C-language loops

Author: Holland Wesley James
Publication venue: Scholars Junction
Publication date: 03/05/2008
Field of study

Reconfigurable computing has the potential for providing significant performance increases to a number of computing applications. However, realizing these benefits requires digital design experience and knowledge of hardware description languages (HDLs). While a number of tools have focused on translation of high-level languages (HLLs) to HDLs, the tools do not always create optimized digital designs that are competitive with hand-coded solutions. This work describes an automatic optimization in the C-to-HDL transformation that reorganizes operations between pipeline stages in order to reduce critical path lengths. The effects of this optimization are examined on the MD5, SHA-1, and Smith-Waterman algorithms. Results show that the optimization results in performance gains of 13%-37% and that the automatically-generated implementations perform comparably to hand-coded implementations

Scholars Junction - Mississippi State University Institutional Repository

A framework for automatically generating optimized digital designs from C-language loops

Author: Holland Wesley James
Publication venue: Scholars Junction
Publication date: 01/05/2008
Field of study

Mississippi State University Libraries ETD database

Scholars Junction - Mississippi State University Institutional Repository

High Throughput Hardware/Software Co-design Approach SHA-256 Hashing Cryptographic Module in IPSec/IPv6

Author: A. A. Gregoriades
C.E. Goutis
Dr.Harris Michai
Publication venue: Global Journals Inc. (US)
Publication date: 12/06/2010
Field of study

Global Journal of Computer Science and Technology (GJCST)

SHA-2 Acceleration Meeting the Needs of Emerging Applications: A Comparative Survey

Author: Cilardo Alessandro
Martino Raffaele
Publication venue
Publication date: 01/01/2020
Field of study

Archivio della ricerca - Università degli studi di Napoli Federico II

Open Access Repository

A Novel Single Pass Authenticated Encryption Stream Cipher for Software Defined Radios

Author: Khajuria Samant
Publication venue
Publication date: 15/11/2012
Field of study

VBN

High Level Synthesis and Evaluation of the Secure Hash Standard for FPGAs

Author: Janik Ian Spencer
Publication venue: 'University of Windsor Leddy Library'
Publication date: 19/10/2015
Field of study

Secure hash algorithms (SHAs) are important components of cryptographic applications. SHA performance on central processing units (CPUs) is slow, therefore, acceleration must be done using hardware such as Field Programmable Gate Arrays (FPGAs). Considerable work has been done in academia using FPGAs to accelerate SHAs. These designs were implemented using Hardware Description Language (HDL) based design methodologies, which are tedious and time consuming. High Level Synthesis (HLS) enables designers to synthesize optimized FPGA hardware from algorithm specifications in programming languages such as C/C++. This substantially reduces the design cost and time. In this thesis, the Altera SDK for OpenCL (AOCL) HLS tool was used to synthesize the SHAs on FPGAs and to explore the design space of the algorithms. The results were evaluated against the previous HDL based designs. Synthesized FPGA hardware performance was comparable to the HDL based designs despite the simpler and faster design process

Scholarship at UWindsor

Recommended from our members

Cryptoraptor : high throughput reconfigurable cryptographic processor for symmetric key encryption and cryptographic hash functions

Author: Sayilar Gokhan
Publication venue
Publication date: 03/02/2015
Field of study

textIn cryptographic processor design, the selection of functional primitives and connection structures between these primitives are extremely crucial to maximize throughput and flexibility. Hence, detailed analysis on the specifications and requirements of existing crypto-systems plays a crucial role in cryptographic processor design. This thesis provides the most comprehensive literature review that we are aware of on the widest range of existing cryptographic algorithms, their specifications, requirements, and hardware structures. In the light of this analysis, it also describes a high performance, low power, and highly flexible cryptographic processor, Cryptoraptor, that is designed to support both today's and tomorrow's encryption standards. To the best of our knowledge, the proposed cryptographic processor supports the widest range of cryptographic algorithms compared to other solutions in the literature and is the only crypto-specific processor targeting the future standards as well. Unlike previous work, we aim for maximum throughput for all known encryption standards, and to support future standards as well. Our 1GHz design achieves a peak throughput of 128Gbps for AES-128 which is competitive with ASIC designs and has 25X and 160X higher throughput per area than CPU and GPU solutions, respectively.Electrical and Computer Engineerin

Texas ScholarWorks

Autotuning the Intel HLS Compiler using the Opentuner Framework

Author: Janzen Chandler
Publication venue: 'University of Saskatchewan Library'
Publication date: 28/10/2019
Field of study

High level synthesis (HLS) tools can be used to improve design flow and decrease verification times for field programmable gate array (FPGA) and application specific integrated circuit (ASIC) design. The Intel HLS Compiler is a high level synthesis tool that takes in untimed C/C++ as input and generates production-quality register transfer level (RTL) code that is optimized for Intel FPGAs. The translation does, however, require multiple iterations and manual optimizations to get comparable synthesized results to that of a solution written in a hardware descriptive language. The synthesis results can vary greatly based upon coding style and optimization techniques, and typically require an in-depth knowledge of FPGAs to fully optimize the translation which limits the audience of the tool. The extra abstraction that the C/C++ source code presents can also make it difficult to meet more specific design requirements; this includes designs to meet specific resource usage or performance based metrics. To improve the quality of results generated by the Intel HLS Compiler without a manual iterative process that requires an in-depth knowledge of FPGAs, this research proposes a method of automating some of the optimization techniques that improve the synthesized design through an autotuning process. The proposed approach utilizes the PyCParser library to parse C source files and the OpenTuner Framework to autotune the synthesis to provide a method that generates results that better meet the needs of the designer's requirements through lower FPGA resource usage or increased design performance. Such functionality is not currently available in Intel's commercial tools. The proposed approach was tested with the CHStone Benchmarking Suite of C programs as well as a standard digital signal processing finite impulse response filter. The results show that the commercial HLS tool can be automatically autotuned through placeholder injection using a source parsing tool for C code and using the OpenTuner Framework to autotune the results. For designs that are small in nature and include conducive structures to be autotuned, the results indicate resource usage reductions and/or performance increases of up to 40% as compared to the default Intel HLS Compiler results. The method developed in this research also allows additional design targets to be specified through the autotuner for consideration in the synthesized design which can yield results that are better matched to a design's requirements

University of Saskatchewan Research Archive