Cited 0 time in
THOR: Secure Transformer Inference with Homomorphic Encryption
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Moon, Jungho | - |
| dc.contributor.author | Yoo, Dongwoo | - |
| dc.contributor.author | Jiang, Xiaoqian | - |
| dc.contributor.author | Kim, Miran | - |
| dc.date.accessioned | 2025-12-18T02:30:31Z | - |
| dc.date.available | 2025-12-18T02:30:31Z | - |
| dc.date.issued | 2025-11 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/209896 | - |
| dc.description.abstract | As large language models are increasingly deployed in cloud environments, privacy concerns have become a significant issue. To address this challenge, we present THOR, a non-interactive framework for secure transformer inference using homomorphic encryption. We first propose efficient matrix multiplication algorithms based on diagonal-major encoding and compact ciphertext packing. We extend these basic algorithms to support plaintext-ciphertext matrix multiplication (PC-MM) using parallel submatrix computation and ciphertext-ciphertext multiplication (CC-MM) with a baby-step giant-step strategy. We also design efficient evaluation strategies for non-linear functions such as softmax, LayerNorm, GELU, and Tanh, by integrating advanced approximation techniques with adaptive iterative methods. Our matrix multiplication algorithms outperform state-of-the-art methods, achieving up to 5.3X speedup in PC-MM for ℝ 768 X 768 X ℝ768X128 over BOLT (Pang et al., IEEE S&P 2024) and 9.7X in CC-MM for 12X (ℝ64X128 X ℝ128X128) over Powerformer (Park et al., Preprint). THOR enables secure inference on the BERT-base model with 128 tokens in 10 minutes on a single GPU, while maintaining comparable accuracy on GLUE tasks. | - |
| dc.format.extent | 15 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Association for Computing Machinery, Inc | - |
| dc.title | THOR: Secure Transformer Inference with Homomorphic Encryption | - |
| dc.type | Article | - |
| dc.identifier.doi | 10.1145/3719027.3765150 | - |
| dc.identifier.scopusid | 2-s2.0-105023900652 | - |
| dc.identifier.wosid | 001657120200255 | - |
| dc.identifier.bibliographicCitation | CCS 2025 - Proceedings of the 2025 ACM SIGSAC Conference on Computer and Communications Security, pp 3765 - 3779 | - |
| dc.citation.title | CCS 2025 - Proceedings of the 2025 ACM SIGSAC Conference on Computer and Communications Security | - |
| dc.citation.startPage | 3765 | - |
| dc.citation.endPage | 3779 | - |
| dc.type.docType | Proceedings Paper | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalResearchArea | Telecommunications | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Theory & Methods | - |
| dc.relation.journalWebOfScienceCategory | Telecommunications | - |
| dc.subject.keywordPlus | Ciphertext | - |
| dc.subject.keywordPlus | Cryptography | - |
| dc.subject.keywordPlus | Encryption algorithms | - |
| dc.subject.keywordPlus | Inference engines | - |
| dc.subject.keywordPlus | Matrix algebra | - |
| dc.subject.keywordPlus | Program processors | - |
| dc.subject.keywordPlus | Security of data | - |
| dc.subject.keywordAuthor | Homomorphic encryption | - |
| dc.subject.keywordAuthor | Matrix computation | - |
| dc.subject.keywordAuthor | Transformer | - |
| dc.identifier.url | https://dl.acm.org/doi/10.1145/3719027.3765150 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
