A Change Detection Based Framework for Piecewise Stationary Multi Armed Bandit Problem
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Liu, Fang | - |
dc.contributor.author | Lee, Joohyun Send mail to Lee J. | - |
dc.contributor.author | Shroff, Ness | - |
dc.date.accessioned | 2023-08-16T08:32:27Z | - |
dc.date.available | 2023-08-16T08:32:27Z | - |
dc.date.issued | 2018-02 | - |
dc.identifier.issn | 2159-5399 | - |
dc.identifier.issn | 2374-3468 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/114375 | - |
dc.description.abstract | The multi-armed bandit problem has been extensively studied under the stationary assumption. However in reality, this assumption often does not hold because the distributions of rewards themselves may change over time. In this paper, we propose a change-detection (CD) based framework for multiarmed bandit problems under the piecewise-stationary setting, and study a class of change-detection based UCB (Upper Confidence Bound) policies, CD-UCB, that actively detects change points and restarts the UCB indices. We then develop CUSUM-UCB and PHT-UCB, that belong to the CD-UCB class and use cumulative sum (CUSUM) and Page-Hinkley Test (PHT) to detect changes. We show that CUSUM-UCB obtains the best known regret upper bound under mild assumptions. We also demonstrate the regret reduction of the CD-UCB policies over arbitrary Bernoulli rewards and Yahoo! datasets of webpage click-through rates. Copyright © 2018, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. | - |
dc.format.extent | 8 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | AAAI press | - |
dc.title | A Change Detection Based Framework for Piecewise Stationary Multi Armed Bandit Problem | - |
dc.type | Article | - |
dc.publisher.location | 영국 | - |
dc.identifier.scopusid | 2-s2.0-85060431212 | - |
dc.identifier.bibliographicCitation | Proceedings of the AAAI Conference on Artificial Intelligence, pp 3651 - 3658 | - |
dc.citation.title | Proceedings of the AAAI Conference on Artificial Intelligence | - |
dc.citation.startPage | 3651 | - |
dc.citation.endPage | 3658 | - |
dc.type.docType | Proceeding | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scopus | - |
dc.identifier.url | https://arxiv.org/abs/1711.03539 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr
COPYRIGHT © 2021 HANYANG UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.