Gaussian-Sigmoid Reinforcement Transistors: Resolving Exploration-Exploitation Trade-Off Through Gate Voltage-Controlled Activation Functions

Park, Jisoo; Seo, Juhyung; Koo, Ryun-Han; Jayasuriya, Dinithi; Jayasinghe, Nethmi; Shin, Wonjun; Trivedi, Amit R.; Yoo, Hocheon

doi:10.1002/adfm.202512407

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Gaussian-Sigmoid Reinforcement Transistors: Resolving Exploration-Exploitation Trade-Off Through Gate Voltage-Controlled Activation Functions

Authors: Park, Jisoo; Seo, Juhyung; Koo, Ryun-Han; Jayasuriya, Dinithi; Jayasinghe, Nethmi; Shin, Wonjun; Trivedi, Amit R.; Yoo, Hocheon

Issue Date: Dec-2025

Publisher: John Wiley & Sons Ltd.

Keywords: gaussian-sigmoid mixed function; heterojunction; neuromorphic; reinforcement learning; thin film transistor

Citation: Advanced Functional Materials, v.35, no.49, pp 1 - 11

Pages: 11

Indexed: SCIE
SCOPUS

Journal Title: Advanced Functional Materials

Volume: 35

Number: 49

Start Page: 1

End Page: 11

URI: https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/210110

DOI: 10.1002/adfm.202512407

ISSN: 1616-301X
1616-3028

Abstract: Reinforcement learning (RL) relies on Gaussian and sigmoid functions to balance exploration and exploitation, but implementing these functions in hardware typically requires iterative computations, increasing power and circuit complexity. Here, Gaussian-sigmoid reinforcement transistors (GS-RTs) are reported that integrate both activation functions into a single device. The transistors feature a vertical n-p-i-p heterojunction stack composed of a-IGZO and DNTT, with asymmetric source-drain contacts and a parylene interlayer that enables voltage-tunable transitions between sigmoid, Gaussian, and mixed responses. This architecture emulates the behavior of three transistors in one, reducing the required circuit complexity from dozens of transistors to fewer than a few. The GS-RT exhibits a peak current of 5.95 mu A at VG = -17 V and supports nonlinear transfer characteristics suited for neuromorphic computing. In a multi-armed bandit task, GS-RT-based RL policies demonstrate 20% faster convergence and 30% higher final reward compared to conventional sigmoid- or Gaussian-based approaches. Extending this advantage further, GS-RT-based activation function in deep RL for cartpole balancing significantly outperforms the traditional ReLU-based activation function in terms of faster learning and tolerance to input perturbations.

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Yoo, Hocheon photo

Yoo, Hocheon: COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE