Gaussian-Sigmoid Reinforcement Transistors: Resolving Exploration-Exploitation Trade-Off Through Gate Voltage-Controlled Activation Functions
- Authors
- Park, Jisoo; Seo, Juhyung; Koo, Ryun-Han; Jayasuriya, Dinithi; Jayasinghe, Nethmi; Shin, Wonjun; Trivedi, Amit R.; Yoo, Hocheon
- Issue Date
- Dec-2025
- Publisher
- John Wiley & Sons Ltd.
- Keywords
- gaussian-sigmoid mixed function; heterojunction; neuromorphic; reinforcement learning; thin film transistor
- Citation
- Advanced Functional Materials, v.35, no.49, pp 1 - 11
- Pages
- 11
- Indexed
- SCIE
SCOPUS
- Journal Title
- Advanced Functional Materials
- Volume
- 35
- Number
- 49
- Start Page
- 1
- End Page
- 11
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/210110
- DOI
- 10.1002/adfm.202512407
- ISSN
- 1616-301X
1616-3028
- Abstract
- Reinforcement learning (RL) relies on Gaussian and sigmoid functions to balance exploration and exploitation, but implementing these functions in hardware typically requires iterative computations, increasing power and circuit complexity. Here, Gaussian-sigmoid reinforcement transistors (GS-RTs) are reported that integrate both activation functions into a single device. The transistors feature a vertical n-p-i-p heterojunction stack composed of a-IGZO and DNTT, with asymmetric source-drain contacts and a parylene interlayer that enables voltage-tunable transitions between sigmoid, Gaussian, and mixed responses. This architecture emulates the behavior of three transistors in one, reducing the required circuit complexity from dozens of transistors to fewer than a few. The GS-RT exhibits a peak current of 5.95 mu A at VG = -17 V and supports nonlinear transfer characteristics suited for neuromorphic computing. In a multi-armed bandit task, GS-RT-based RL policies demonstrate 20% faster convergence and 30% higher final reward compared to conventional sigmoid- or Gaussian-based approaches. Extending this advantage further, GS-RT-based activation function in deep RL for cartpole balancing significantly outperforms the traditional ReLU-based activation function in terms of faster learning and tolerance to input perturbations.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.