Contrastive-Learning-Based Decision Making for Dynamic Time-Linkage Optimization
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Liu, Xiao-Fang | - |
dc.contributor.author | Gao, Meng | - |
dc.contributor.author | Fang, Yongchun | - |
dc.contributor.author | Zhan, Zhi-Hui | - |
dc.contributor.author | Zhang, Jun | - |
dc.date.accessioned | 2025-10-13T04:30:31Z | - |
dc.date.available | 2025-10-13T04:30:31Z | - |
dc.date.issued | 2025-09 | - |
dc.identifier.issn | 2168-2216 | - |
dc.identifier.issn | 2168-2232 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/126661 | - |
dc.description.abstract | In dynamic time-linkage optimization, current decisions influence the future state of environments. To make good decisions that have a positive impact on future states, existing methods usually build a model to predict the future rewards of solutions for decision making. However, these prediction models present low accuracy since decision data are not enough to train such a complex model. To address this issue, this article proposes a contrastive-learning-based decision making (CLDM) method, which builds a contrastive model to learn the relationship between solutions but not absolute rewards and adopts a quick decision strategy to select solutions. In CLDM, a clustering-based time-linkage detection (CD) strategy is developed to measure the intensity of the time linkage, which determines whether to make decisions based on future rewards. To represent the relative relationship between solutions, a large number of contrastive samples are constructed using the limited historical decisions. A contrastive model is trained for solution comparison in terms of the combination of current fitness and future rewards. Candidate solutions are clustered into multiple groups to filter poor ones, and a few solutions are preserved to rank using the contrastive model. The winner is taken as the decision solution. Integrating CLDM into particle swarm optimization (PSO), a new algorithm named contrastive-learning-based PSO (CL-PSO) is put forward. Experimental results on multiple dynamic time-linkage optimization instances demonstrate that CL-PSO outperforms state-of-the-art algorithms in terms of solution quality. CL-PSO can also well solve the mobile robot path planning problem. | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
dc.title | Contrastive-Learning-Based Decision Making for Dynamic Time-Linkage Optimization | - |
dc.type | Article | - |
dc.publisher.location | 미국 | - |
dc.identifier.doi | 10.1109/TSMC.2025.3611797 | - |
dc.identifier.scopusid | 2-s2.0-105017163021 | - |
dc.identifier.bibliographicCitation | IEEE Transactions on Systems, Man, and Cybernetics: Systems | - |
dc.citation.title | IEEE Transactions on Systems, Man, and Cybernetics: Systems | - |
dc.type.docType | Article | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.subject.keywordAuthor | Contrastive learning | - |
dc.subject.keywordAuthor | dynamic time-linkage optimization | - |
dc.subject.keywordAuthor | evolutionary computation | - |
dc.subject.keywordAuthor | particle swarm optimization (PSO) | - |
dc.subject.keywordAuthor | prediction | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr
COPYRIGHT © 2021 HANYANG UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.