Adaptive reinforcement learning for energy-efficient high-recovery closed-circuit reverse osmosis

Moon, Jeongwoo; Yun, Byeongchan; Park, Kiho; Kim, Seong-Su; Lee, Youngjoo; Jeong, Kwanho; Cho, Kyung Hwa

doi:10.1016/j.watres.2026.125855

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Adaptive reinforcement learning for energy-efficient high-recovery closed-circuit reverse osmosis

Authors: Moon, Jeongwoo; Yun, Byeongchan; Park, Kiho; Kim, Seong-Su; Lee, Youngjoo; Jeong, Kwanho; Cho, Kyung Hwa

Issue Date: Jul-2026

Publisher: Elsevier Ltd

Keywords: Closed circuit reverse osmosis; Dynamic modeling; Process control; Proximal policy optimization; Reinforcement learning

Citation: Water Research, v.299, pp 1 - 18

Pages: 18

Indexed: SCIE
SCOPUS

Journal Title: Water Research

Volume: 299

Start Page: 1

End Page: 18

URI: https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/213087

DOI: 10.1016/j.watres.2026.125855

ISSN: 0043-1354
1879-2448

Abstract: Closed-circuit reverse osmosis (CCRO) achieves high recovery; however, its semi-batch purge-and-refill cycles complicate control and optimization. Unlike static rule-based operation, reinforcement learning is increasingly used for operational optimization to adaptively select real-time control setpoints under changing conditions. In this study, a data-calibrated dynamic CCRO simulator was integrated with a reinforcement learning control framework and evaluated under realistic plant-variable conditions. Plant behavior was stably reproduced by the simulator, achieving low RMSE across permeate flow, circulated concentrate flow, membrane inlet pressure, and permeate concentration. A proximal policy optimization agent was trained across 24 environmental settings with over 10 million steps, and the best-performing policy was identified through evaluation of 375 predefined scenarios and nonparametric statistical analyses. Across the evaluation scenarios, the resulting agent achieved a mean specific energy consumption (SEC) of 0.489 kWh/m³ and a mean recovery rate of 95.5%, outperforming a static rule-based controller by 13.14% in SEC and 3.92% in water recovery through adaptive control. Interpretability and feasibility were further supported by explainable AI and edge execution-time analyses on representative hardware. Overall, the proposed framework provides a promising alternative to conventional rule-based CCRO operation in small-scale decentralized plants where continuous expert supervision is impractical.

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 화학공학과 > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher PARK, KIHO photo

PARK, KIHO: COLLEGE OF ENGINEERING (DEPARTMENT OF CHEMICAL ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE