FalconSign: An Efficient and High-Throughput Hardware Architecture for Falcon Signature Generation

Authors

  • Yi Ouyang Beijing National Research Center for Information Science and Technology (BNRist), School of Integrated Circuits, Tsinghua University, Beijing, China
  • Yihong Zhu Beijing National Research Center for Information Science and Technology (BNRist), School of Integrated Circuits, Tsinghua University, Beijing, China
  • Wenping Zhu Beijing National Research Center for Information Science and Technology (BNRist), School of Integrated Circuits, Tsinghua University, Beijing, China
  • Bohan Yang Beijing National Research Center for Information Science and Technology (BNRist), School of Integrated Circuits, Tsinghua University, Beijing, China
  • Zirui Zhang Beijing National Research Center for Information Science and Technology (BNRist), School of Integrated Circuits, Tsinghua University, Beijing, China
  • Hanning Wang Beijing National Research Center for Information Science and Technology (BNRist), School of Integrated Circuits, Tsinghua University, Beijing, China
  • Qichao Tao Beijing National Research Center for Information Science and Technology (BNRist), School of Integrated Circuits, Tsinghua University, Beijing, China
  • Min Zhu Wuxi Micro Innovation Integrated Circuit Design Co., Ltd., Wuxi, China
  • Shaojun Wei Beijing National Research Center for Information Science and Technology (BNRist), School of Integrated Circuits, Tsinghua University, Beijing, China
  • Leibo Liu Beijing National Research Center for Information Science and Technology (BNRist), School of Integrated Circuits, Tsinghua University, Beijing, China

DOI:

https://doi.org/10.46586/tches.v2025.i1.203-226

Keywords:

Post-quantum cryptography, Falcon, Lattice, Fast-Fourier Sampling, Floating-point, High-performance, Configurable, FPGA

Abstract

Falcon is a lattice-based quantum-resistant digital signature scheme renowned for its high signature generation/verification speed and compact signature size. The scheme has been selected to be drafted in the third round of the post-quantum cryptography (PQC) standardization process due to its unique attributes and robust security features. Despite its strengths, there has been a lack of research on hardware acceleration, primarily due to its complex calculation flow and floating-point operations, which hinders its widespread adoption. To address this issue, we propose FalconSign, a high-performance, configurable crypto-processor designed to accelerate Falcon signature generation on FPGA/ASIC through algorithmhardware co-design. Our approach involves a new scheduling flow and architecture for Fast-Fourier Sampling to enhance computing unit reuse and reduce processing time. Additionally, we introduce several optimized modules, including configurable randomness generation units, parallel floating-point processing units, and an optimized SamplerZ module, to improve execution efficiency. Furthermore, this paper presents a finely optimized hardware accelerator for the Falcon scheme. Our FPGA implementation results demonstrate a throughput improvement of approximately 5.1 x compared to state-of-the-art designs, with 2.8x/4.5x/4.2x/3.2x fewer in the area (LUTs/FFs/DSPs/BRAMs)-time product, for NIST security level V. The crypto-processor occupies an area of 0.71 mm2 and achieves 5.2k OPS at throughput on the TSMC 28nm process for NIST security level I.

Downloads

Published

2024-12-09

Issue

Section

Articles

How to Cite

FalconSign: An Efficient and High-Throughput Hardware Architecture for Falcon Signature Generation. (2024). IACR Transactions on Cryptographic Hardware and Embedded Systems, 2025(1), 203-226. https://doi.org/10.46586/tches.v2025.i1.203-226