Bandit Learning based Stable Matching for Decentralized Task Offloading in Dynamic Fog Computing NetworksBandit Learning based Stable Matching for Decentralized Task Offloading in Dynamic Fog Computing Networks
- Other Titles
- Bandit Learning based Stable Matching for Decentralized Task Offloading in Dynamic Fog Computing Networks
- Authors
- Hoa Tran-Dang; 김동성
- Issue Date
- Jun-2024
- Publisher
- 한국통신학회
- Keywords
- Distributed Task Offloading; Dynamic Fog Computing Networks; Thompson Sampling; Multi-armed bandit; Bandit Learnin
- Citation
- Journal of Communications and Networks, v.26, no.3, pp 356 - 365
- Pages
- 10
- Journal Title
- Journal of Communications and Networks
- Volume
- 26
- Number
- 3
- Start Page
- 356
- End Page
- 365
- URI
- https://scholarworks.bwise.kr/kumoh/handle/2020.sw.kumoh/28777
- DOI
- 10.23919/JCN.2024.000017
- ISSN
- 1229-2370
1976-5541
- Abstract
- This paper deals with the task offloading problem in the dynamic fog computing networks (FCNs) that involves the task and resource allocations between a set of task nodes (TNs) having task computation needs and a set of helper nodes (HNs) having available computing resources. The problem is associated with the presence of selfishness and rational nodes of these nodes, in which the objective of TNs is to minimize the task completion time by offloading the tasks to the HNs while the HNs tend to maximize their monetization of task offloading resources. To tackle this problem, we use the fairness and stability principle of matching theory to assign the tasks of TNs to the resources of HNs based on their mutual preferences in a decentralized manner. However, the uncertainty of computing resource availability of HNs as well as dynamics of QoS requirements of tasks result in the lack of preferences of TN side that mainly poses a critical challenge to obtain a stable and reliable matching outcome. To address this challenge, we develop the first, to our knowledge, Thompson sampling based multi-armed bandit (MAB) learning to acquire better exploitation and exploration trade-off, therefore allowing TNs to achieve the informed preference relations of HNs quickly. Motivated by the above considerations, this paper aims at design a bandit learning based matching model (BLM) to realize the efficient decentralized task offloading algorithms in the dynamic FCNs. Extensive simulation results demonstrate the potential advantages of the TS based learning over the ε-greedy and UCB based baselines
- Files in This Item
-
- Appears in
Collections - School of Electronic Engineering > 1. Journal Articles
![qrcode](https://api.qrserver.com/v1/create-qr-code/?size=55x55&data=https://scholarworks.bwise.kr/kumoh/handle/2020.sw.kumoh/28777)
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.