Multi-Agent Deep Q-Networks for Efficient Edge Federated Learning Communications in Software-Defined IoT

Tam, Prohim; Math, Sa; Lee, Ahyoung; Kim, Seokhoon

Detailed Information

Cited 0 time in webofscience

Cited 9 time in scopus

Metadata Downloads

Multi-Agent Deep Q-Networks for Efficient Edge Federated Learning Communications in Software-Defined IoT

Authors: Tam, Prohim; Math, Sa; Lee, Ahyoung; Kim, Seokhoon

Issue Date: 1-Jan-2022

Publisher: Tech Science Press

Keywords: Deep Q-networks; federated learning; network functions virtualization; quality of service; software-defined networking

Citation: Computers, Materials and Continua, v.71, no.2, pp 3319 - 3335

Pages: 17

Journal Title: Computers, Materials and Continua

Volume: 71

Number: 2

Start Page: 3319

End Page: 3335

URI: https://scholarworks.bwise.kr/sch/handle/2021.sw.sch/20150

DOI: 10.32604/cmc.2022.023215

ISSN: 1546-2218
1546-2226

Abstract: Federated learning (FL) activates distributed on-device computation techniques to model a better algorithm performance with the interaction of local model updates and global model distributions in aggregation averaging processes. However, in large-scale heterogeneous Internet of Things (IoT) cellular networks, massive multi-dimensional model update iterations and resource-constrained computation are challenging aspects to be tackled significantly. This paper introduces the system model of converging software defined networking (SDN) and network functions virtualization (NFV) to enable device/resource abstractions and provide NFV-enabled edge FL (eFL) aggregation servers for advancing automation and controllability. Multi-agent deep Q-networks (MADQNs) target to enforce a self-learning softwarization, optimize resource allocation policies, and advocate computation offloading decisions. With gathered network conditions and resource states, the proposed agent aims to explore various actions for estimating expected longterm rewards in a particular state observation. In exploration phase, optimal actions for joint resource allocation and offloading decisions in different possible states are obtained by maximum Q-value selections. Action-based virtual network functions (VNF) forwarding graph (VNFFG) is orchestrated to map VNFs towards eFL aggregation server with sufficient communication and computation resources in NFV infrastructure (NFVI). The proposed scheme indicates deficient allocation actions, modifies the VNF backup instances, and reallocates the virtual resource for exploitation phase. Deep neural network (DNN) is used as a value function approximator, and epsilon greedy algorithm balances exploration and exploitation. The scheme primarily considers the criticalities of FL model services and congestion states to optimize long-term policy. Simulation results presented the outperformance of the proposed scheme over reference schemes in terms of Quality of Service (QoS) performance metrics, including packet drop ratio, packet drop counts, packet delivery ratio, delay, and throughput.

Files in This Item: There are no files associated with this item.

Appears in Collections: ETC > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Kim, Seok hoon photo

Kim, Seok hoon: College of Software Convergence (Department of Computer Software Engineering)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :1,404,965; Today View :303

RSS_1.0 RSS_2.0 ATOM_1.0

(31538) 22, Soonchunhyang-ro, Asan-si, Chungcheongnam-do, Republic of Korea+82-41-530-1114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE