Paper 2016/471

NTRU Modular Lattice Signature Scheme on CUDA GPUs

Wei Dai, John Schanck, Berk Sunar, William Whyte, and Zhenfei Zhang


In this work we show how to use Graphics Processing Units (GPUs) with Compute Unified Device Architecture (CUDA) to accelerate a lattice based signature scheme, namely, the NTRU modular lattice signature (NTRU-MLS) scheme. Lattice based schemes require operations on large vectors that are perfect candidates for GPU implementations. In addition, similar to most lattice based signature schemes, NTRU-MLS provides transcript security with a rejection sampling technique. With a GPU implementation, we are able to generate many candidates simultaneously, and hence mitigate the performance slowdown from rejection sampling. Our implementation results show that for the original NTRU-MLS parameter sets, we obtain a 2x improvement in the signing speed; for the revised parameter sets, where acceptance rate of rejection sampling is down to around 1%, our implementation can be as much as 47x faster than a CPU implementation.

Available format(s)
Public-key cryptography
Publication info
Published elsewhere. Minor revision. 11th International Workshop on Security and High Performance Computing Systems (SHPCS 2016)
NTRUdigital signatureslattice techniquesCUDAGPU
Contact author(s)
wdai @ wpi edu
2016-07-14: revised
2016-05-17: received
See all versions
Short URL
Creative Commons Attribution


      author = {Wei Dai and John Schanck and Berk Sunar and William Whyte and Zhenfei Zhang},
      title = {{NTRU} Modular Lattice Signature Scheme on {CUDA} {GPUs}},
      howpublished = {Cryptology ePrint Archive, Paper 2016/471},
      year = {2016},
      note = {\url{}},
      url = {}
Note: In order to protect the privacy of readers, does not use cookies or embedded third party content.