Paper 2009/501

Fast Implementations of AES on Various Platforms

Joppe W. Bos, Dag Arne Osvik, and Deian Stefan


This paper presents new software speed records for encryption and decryption using the block cipher AES-128 for different architectures. Target platforms are 8-bit AVR microcontrollers, NVIDIA graphics processing units (GPUs) and the Cell broadband engine. The new AVR implementation requires 124.6 and 181.3 cycles per byte for encryption and decryption with a code size of less than two kilobyte. Compared to the previous AVR records for encryption our code is 38 percent smaller and 1.24 times faster. The byte-sliced implementation for the synergistic processing elements of the Cell architecture achieves speed of 11.7 and 14.4 cycles per byte for encryption and decryption. Similarly, our fastest GPU implementation, running on the GTX 295 and handling many input streams in parallel, delivers throughputs of 0.17 and 0.19 cycles per byte for encryption and decryption respectively. Furthermore, this is the first AES implementation for the GPU which implements both encryption and decryption.

Note: Fixed AVR cycle numbers in the abstract.

Available format(s)
Publication info
Published elsewhere. Unknown where it was published
Contact author(s)
joppe bos @ epfl ch
2009-11-06: revised
2009-10-20: received
See all versions
Short URL
Creative Commons Attribution


      author = {Joppe W.  Bos and Dag Arne Osvik and Deian Stefan},
      title = {Fast Implementations of AES on Various Platforms},
      howpublished = {Cryptology ePrint Archive, Paper 2009/501},
      year = {2009},
      note = {\url{}},
      url = {}
Note: In order to protect the privacy of readers, does not use cookies or embedded third party content.