DSpace at EWHA: Accelerating Fully Homomorphic Encryption Through Architecture-Centric Analysis and Optimization

Browse

My Repository

DSpace at EWHA자연과학대학 수학전공 Journal papers

View : 436 Download: 0

Accelerating Fully Homomorphic Encryption Through Architecture-Centric Analysis and Optimization

Title: Accelerating Fully Homomorphic Encryption Through Architecture-Centric Analysis and Optimization

Authors: Jung, Wonkyung; Lee, Eojin; Kim, Sangpyo; Kim, Jongmin; Kim, Namhoon; Lee, Keewoo; Min, Chohong; Cheon, Jung Hee; Ahn, Jung Ho

Ewha Authors: 민조홍

SCOPUS Author ID: 민조홍

Issue Date: 2021

Journal Title: IEEE ACCESS

ISSN: 2169-3536

Citation: IEEE ACCESS vol. 9, pp. 98772 - 98789

Keywords: Computer applications; computer architecture; cryptography; multicore processing

Publisher: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Indexed: SCIE; SCOPUS

Document Type: Article

Abstract: Homomorphic Encryption (HE) has drawn significant attention as a privacy-preserving approach for cloud computing because it allows computation on encrypted messages called ciphertexts. Among the numerous HE schemes proposed thus far, HE for Arithmetic of Approximate Numbers (HEAAN) is rapidly gaining in popularity across a wide range of applications, as it supports messages that can tolerate approximate computations with no limit on the number of arithmetic operations applicable to the ciphertexts. A critical shortcoming of HE is the high computation complexity of ciphertext arithmetic; specifically, HE multiplication (HE Mul) is more than 10,000 times slower than the corresponding multiplication between unencrypted messages. This has led to a large body of HE acceleration studies, including those that exploit FPGAs; however, a rigorous analysis of the computational complexity and data access patterns of HE Mul is lacking. Moreover, the proposals mostly focused on designs with small parameter sizes, making it difficult accurately to estimate the performance of the HE accelerators when conducting a series of complex arithmetic operations. In this paper, we first describe how HE Mul of HEAAN is performed in a manner friendly to non-crypto experts. Then, we conduct a disciplined analysis of its computational and memory-access characteristics, through which we (1) extract parallelism in the key functions composing HE Mul and (2) demonstrate how to map the parallelism effectively to popular parallel processing platforms, CPUs and GPUs, by applying a series of optimizations such as transposing matrices and pinning data to threads. This leads to performance improvements of HE Mul on a CPU and a GPU by 2.06x and 4.05x, respectively, over the reference HEAAN running on a CPU with 24 threads.