Enhancing Post-Training Quantization Robustness in RRAM-Based CIM via σ-Clipping of Weights
- Authors
- Oh, Byeongchan; Youn, Sangwook; Kim, Hyungjin; Kim, Tae-Hyeon
- Issue Date
- May-2026
- Publisher
- AMER CHEMICAL SOC
- Keywords
- resistive random-access memory (RRAM); compute-in-memory(CIM); hardware neural network; clipping; post-training quantization (PTQ)
- Citation
- ACS APPLIED ELECTRONIC MATERIALS, v.8, no.9, pp 4041 - 4051
- Pages
- 11
- Indexed
- SCIE
SCOPUS
- Journal Title
- ACS APPLIED ELECTRONIC MATERIALS
- Volume
- 8
- Number
- 9
- Start Page
- 4041
- End Page
- 4051
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/213354
- DOI
- 10.1021/acsaelm.6c00222
- ISSN
- 2637-6113
2637-6113
- Abstract
- Resistive random-access memory (RRAM)-based computing-in-memory (CIM) has emerged as a promising solution for energy-efficient deep learning inference by eliminating the memory bottleneck of conventional von Neumann architectures. However, the limited bit precision and inherent variability of memristive devices pose significant challenges to inference accuracy. In this study, we present a post-training quantization (PTQ) framework that incorporates sigma-based clipping to suppress weight outliers before quantization. This technique narrows the dynamic range of weights, improving quantization resolution and robustness. Using a three-layer fully connected neural network trained on binarized MNIST data, we analyzed the impact of sigma-clipping on PTQ performance across z-score thresholds. Clipping improved inference accuracy and reduced accuracy variance across retrained models. Furthermore, we experimentally validated our approach using a 48 & times; 48 Al2O3/TiO x -based RRAM crossbar array. Despite conductance deviations during weight programming, clipping improved inference accuracy from 96.27% to 97.76%, confirming its effectiveness under hardware variation. In simulation, when variation corresponding to experimentally observed fluctuation was introduced, PTQ accuracy dropped from 96.42% to 83.67%, while the clipped model maintained 96.96%, demonstrating a 12.75%p improvement. These results establish sigma-clipping as a practical and lightweight strategy to enhance the quantization robustness of CIM-based neural networks.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 신소재공학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.