Paper 2025/749

GOLF: Unleashing GPU-Driven Acceleration for FALCON Post-Quantum Cryptography

Ruihao Dai, Nanjing University of Posts and Telecommunications
Jiankuo Dong, Nanjing University of Posts and Telecommunications
Mingrui Qiu, Nanjing University of Posts and Telecommunications
Zhenjiang Dong, Nanjing University of Posts and Telecommunications
Fu Xiao, Nanjing University of Posts and Telecommunications
Jingqiang Lin, University of Science and Technology of China
Abstract

Quantum computers leverage qubits to solve certain computational problems significantly faster than classical computers. This capability poses a severe threat to traditional cryptographic algorithms, leading to the rise of post-quantum cryptography (PQC) designed to withstand quantum attacks. FALCON, a lattice-based signature algorithm, has been selected by the National Institute of Standards and Technology (NIST) as part of its post-quantum cryptography standardization process. However, due to the computational complexity of PQC, especially in cloud-based environments, throughput limitations during peak demand periods have become a bottleneck, particularly for FALCON. In this paper, we introduce GOLF (GPU-accelerated Optimization for Lattice-based FALCON), a novel GPU-based parallel acceleration framework for FALCON. GOLF includes algorithm porting to the GPU, compatibility modifications, multi-threaded parallelism with distinct data, single-thread optimization for single tasks, and specific enhancements to the Fast Fourier Transform (FFT) module within FALCON. Our approach achieves unprecedented performance in FALCON acceleration on GPUs. On the NVIDIA RTX 4090, GOLF reaches a signature generation throughput of 42.02 kops/s and a signature verification throughput of 10,311.04 kops/s. These results represent a 58.05 / 73.14 improvement over the reference FALCON implementation and a 7.17 / 3.79 improvement compared to the fastest known GPU implementation to date. GOLF demonstrates that GPU acceleration is not only feasible for post-quantum cryptography but also crucial for addressing throughput bottlenecks in real-world applications.

Metadata
Available format(s)
PDF
Category
Implementation
Publication info
Preprint.
Keywords
GPUFALCONPost-quantum cryptographyCUDA
Contact author(s)
russdrh @ gmail com
djiankuo @ gmail com
1024041125 @ njupt edu cn
dongzhenjiang @ njupt edu cn
xiaof @ njupt edu cn
linjq @ ustc edu cn
History
2025-04-28: approved
2025-04-27: received
See all versions
Short URL
https://ia.cr/2025/749
License
Creative Commons Attribution
CC BY

BibTeX

@misc{cryptoeprint:2025/749,
      author = {Ruihao Dai and Jiankuo Dong and Mingrui Qiu and Zhenjiang Dong and Fu Xiao and Jingqiang Lin},
      title = {{GOLF}: Unleashing {GPU}-Driven Acceleration for {FALCON} Post-Quantum Cryptography},
      howpublished = {Cryptology {ePrint} Archive, Paper 2025/749},
      year = {2025},
      url = {https://eprint.iacr.org/2025/749}
}
Note: In order to protect the privacy of readers, eprint.iacr.org does not use cookies or embedded third party content.