Search CORE

27 research outputs found

Fast Embedded Software Hashing

Author: Dag Arne Osvik
Publication venue: International Association for Cryptologic Research (IACR)
Publication date: 23/03/2012
Field of study

We present new software speed records for several popular hash functions on low-end 8-bit AVR microcontrollers. Target algorithms include widely deployed hash functions like SHA-1 and SHA-256 as well as the SHA-3 (second round) candidates Blake-32 and Skein-256. A significant aspect of our implementations is that they reduce the overall resource requirements, improving not only execution time but also RAM footprint and sometimes ROM/Flash memory footprint at the same time, providing the best memory/performance trade-os reported so far. We believe that our results will shed new light on the ongoing SHA-3 competition, and be helpful for the next stage of the competition

Cryptology ePrint Archive

Efficient Cache Attacks on AES, and Countermeasures

Author: Osvik Dag Arne
Shamir Adi
Tromer Eran
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 16/12/2011
Field of study

We describe several software side-channel attacks based on inter-process leakage through the state of the CPU's memory cache. This leakage reveals memory access patterns, which can be used for cryptanalysis of cryptographic primitives that employ data-dependent table lookups. The attacks allow an unprivileged process to attack other processes running in parallel on the same processor, despite partitioning methods such as memory protection, sandboxing, and virtualization. Some of our methods require only the ability to trigger services that perform encryption or MAC using the unknown key, such as encrypted disk partitions or secure network links. Moreover, we demonstrate an extremely strong type of attack, which requires knowledge of neither the specific plaintexts nor ciphertexts and works by merely monitoring the effect of the cryptographic process on the cache. We discuss in detail several attacks on AES and experimentally demonstrate their applicability to real systems, such as OpenSSL and Linux's dm-crypt encrypted partitions (in the latter case, the full key was recovered after just 800 writes to the partition, taking 65 milliseconds). Finally, we discuss a variety of countermeasures which can be used to mitigate such attacks

Infoscience - École polytechnique fédérale de Lausanne

Multi-Stream Hashing on the PlayStation 3

Author: Bos Joppe Willem
Casati Nathalie
Osvik Dag Arne
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 20/12/2011
Field of study

With process technology providing more and more transistors per chip, still following Moore's \law", processor designers have used a number of techniques to make those transistors useful. Lately they have started placing multiple processor cores on each chip; an example is the Cell Broadband Engine, which serves as the heart of Sony's PlayStation 3 game console. We present high-performance multi-stream versions of cryptographic hash functions from the MD/SHA-family. Our implementations require 1.74, 3.51 and 8.18 cycles per byte per SPE when using the cryptographic hash functions MD5, SHA-1 and SHA-256 respectively. To the best of our knowledge these are the fastest implementations of these hash functions for the Cell processor. These implementations can be useful for cryptanalytic use as well as for utilizing the SPEs as cryptographic accelerators

Infoscience - École polytechnique fédérale de Lausanne

Fast Implementations of AES on Various Platforms

Author: Dag Arne Osvik
Deian Stefan
Joppe W. Bos
Publication venue: International Association for Cryptologic Research (IACR)
Publication date: 06/11/2009
Field of study

This paper presents new software speed records for encryption and decryption using the block cipher AES-128 for different architectures. Target platforms are 8-bit AVR microcontrollers, NVIDIA graphics processing units (GPUs) and the Cell broadband engine. The new AVR implementation requires 124.6 and 181.3 cycles per byte for encryption and decryption with a code size of less than two kilobyte. Compared to the previous AVR records for encryption our code is 38 percent smaller and 1.24 times faster. The byte-sliced implementation for the synergistic processing elements of the Cell architecture achieves speed of 11.7 and 14.4 cycles per byte for encryption and decryption. Similarly, our fastest GPU implementation, running on the GTX 295 and handling many input streams in parallel, delivers throughputs of 0.17 and 0.19 cycles per byte for encryption and decryption respectively. Furthermore, this is the first AES implementation for the GPU which implements both encryption and decryption

Cryptology ePrint Archive

A kilobit special number field sieve factorization

Author: Aoki Kazumaro
Franke Jens
Kleinjung Thorsten
Lenstra Arjen K.
Osvik Dag Arne
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/06/2010
Field of study

We describe how we reached a new factoring milestone by completing the first special number field sieve factorization of a number having more than 1024 bits, namely the Mersenne number 21039 -1. Although this factorization is orders of magnitude 'easier' than a factorization of a 1024-bit RSA modulus is believed to be, the methods we used to obtain our result shed new light on the feasibility of the latter computation. © International Association for Cryptology Research 2007

Infoscience - École polytechnique fédérale de Lausanne

A heterogeneous computing environment to solve the 768-bit RSA challenge

Author: A.K. Lenstra
A.K. Lenstra
Andrey Timofeev
Arjen K. Lenstra
D. Coppersmith
D. Coppersmith
D.M. Gordon
Dag Arne Osvik
Emmanuel Thomé
H. Stockinger
Heinz Stockinger
Jens Franke
Joppe W. Bos
K. Aoki
K. Aoki
Kazumaro Aoki
Michela Thiémard
N. Capit
P.W. Shor
Pascal Jermini
Paul Leyland
Peter L. Montgomery
Scott Contini
T. Kleinjung
Thorsten Kleinjung
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Speeding up Serpent

Author: Dag Arne Osvik
Publication venue
Publication date: 01/01/2000
Field of study

We present a method for finding efficient instruction sequences for the Serpent S-boxes. Current implementations need many registers to store temporary variables, yet the common x86 processors only have 8 registers, of which even fewer are available for computations. The instructions are also destructive, replacing one input with the output. Alternative versions of the S-box instructions are presented, requiring only 5 registers and also utilizing parallelism. Speedup of C language implementations of 24% is shown on the Pentium Pro Processor, and 42% on the Pentium, both compared to the previously fastest known implementation of Serpent

CiteSeerX

A More Compact AES

Author: Canright David
Osvik Dag Arne
Publication venue
Publication date: 01/01/2009
Field of study

We explore ways to reduce the number of bit operations required to implement AES. One way involves optimizing the composite field approach for entire rounds of AES. Another way is integrating the Galois multiplications of MixColumns with the linear transformations of the S-box. Combined with careful optimizations, these reduce the number of bit operations to encrypt one block by 9.0%, compared to earlier work that used the composite field only in the S-box. For decryption, the improvement is 13.5%. This work may be useful both as a starting point for a bit-sliced software implementation, where reducing operations increases speed, and also for hardware with limited resources

Calhoun, Institutional Archive of the Naval Postgraduate School