Doubly Efficient Fuzzy Private Set Intersection for High-dimensional Data with Cosine Similarity
- Authors
- Son, Hyunjung; Paik, Seunghun; Kim, Yunki; Kim, Sunpill; Chung, Heewon; Seo, Jaehong
- Issue Date
- Dec-2025
- Publisher
- IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
- Keywords
- Protocols; Vectors; High dimensional data; Noise; Approximation error; Recommender systems; Privacy; Measurement; Homomorphic encryption; Hands; Fuzzy matchingfuzzy private set intersection; homomorphic encryption
- Citation
- IEEE ACCESS, v.13, pp 217108 - 217125
- Pages
- 18
- Indexed
- SCIE
SCOPUS
- Journal Title
- IEEE ACCESS
- Volume
- 13
- Start Page
- 217108
- End Page
- 217125
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/210313
- DOI
- 10.1109/ACCESS.2025.3648455
- ISSN
- 2169-3536
- Abstract
- Fuzzy private set intersection (fuzzy PSI) is a cryptographic protocol that enables privacy-preserving similarity matching, where a client securely learns which of its items are sufficiently close to some item in the service provider’s dataset. This functionality is essential for various real-world applications including biometric authentication, information retrieval, or recommendation systems. However, existing fuzzy PSI protocols suffer from two major barriers to deployment. First, many cannot handle high-dimensional feature vectors, e.g., from 128 to 512, in practical applications because of their exponential computation/communication overheads in dimension. Furthermore, existing protocols supporting L<inf>2</inf> distance cannot accommodate cosine similarity.We found that their distributional assumptions on the data enforce overly strict similarity matching thresholds for the application, leading to prohibitively large false negatives. In this paper, we present FPHC, the first fuzzy PSI protocol that efficiently handles high-dimensional data with cosine similarity. FPHC features linear complexity on both computation and communication with respect to the dimension of set elements. We leverage CKKS, a homomorphic encryption scheme supporting approximate real-valued arithmetic, to compute similarity scores and threshold comparison, along with a clever packing method for efficiency. Moreover, we introduce a novel proof technique to harmonize the approximation error from the sign function with the noise flooding, proving the security of FPHC under the semi-honest model. Finally, we extend our FPHC to functionalities such as labeled or circuit fuzzy PSI. Through experiments, we demonstrate that FPHC can perform fuzzy PSI over 512-dimensional data in a few minutes, which was computationally infeasible in other previous fuzzy PSI proposals.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 자연과학대학 > 서울 수학과 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.