Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Comparative Analysis of Energy Management Strategies for HEV: Dynamic Programming and Reinforcement Learningopen access

Authors
Lee, HeeyunSong, ChangheeKim, NamwookCha, Suk Won
Issue Date
Apr-2020
Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Keywords
Hybrid electric vehicles; Energy management; Optimal control; Engines; Dynamic programming; Fuel economy; Learning (artificial intelligence); Dynamic programming; hybrid electric vehicle; optimal control; reinforcement learning; power management
Citation
IEEE ACCESS, v.8, pp 67112 - 67123
Pages
12
Indexed
SCIE
SCOPUS
Journal Title
IEEE ACCESS
Volume
8
Start Page
67112
End Page
67123
URI
https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/1871
DOI
10.1109/ACCESS.2020.2986373
ISSN
2169-3536
2169-3536
Abstract
Energy management strategy is an important factor in determining the fuel economy of hybrid electric vehicles; thus, much research on how to distribute the required power to engines and motors of hybrid vehicles is required. Recently, various studies have been conducted based on reinforcement learning to optimally control the hybrid electric vehicle. In fact, the fundamental control approach of reinforcement learning shares many control frameworks with the control approach by using deterministic dynamic programming or stochastic dynamic programming. In this study, we compare the reinforcement learning based strategy by using these dynamic programming-based control approaches. For optimal control of hybrid electric vehicle, each control method was compared in terms of fuel efficiency by performing simulation by using various driving cycles. Based on our simulations, we showed the reinforcement learning-based strategy can obtain global optimality in the optimal control problem with an infinite horizon, which can also be obtained by stochastic dynamic programming. We also showed that the reinforcement learning-based strategy can present a solution close to the optimal one using deterministic dynamic programming, while a reinforcement learning-based strategy is more appropriate for a time variant controller with boundary value constraints. In addition, we verified the convergence characteristics of the control strategy based on reinforcement learning, when transfer learning was performed through value initialization using stochastic dynamic programming.
Files in This Item
Appears in
Collections
COLLEGE OF ENGINEERING SCIENCES > DEPARTMENT OF MECHANICAL ENGINEERING > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Nam wook photo

Kim, Nam wook
ERICA 공학대학 (DEPARTMENT OF MECHANICAL ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE