Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Cutting-Edge Inference: Dynamic DNN Model Partitioning and Resource Scaling for Mobile AI

Full metadata record
DC Field Value Language
dc.contributor.authorLim, Jeong-A-
dc.contributor.authorLee, Joohyun-
dc.contributor.authorKwak, Jeongho-
dc.contributor.authorKim, Yeongjin-
dc.date.accessioned2024-10-10T00:30:20Z-
dc.date.available2024-10-10T00:30:20Z-
dc.date.issued2024-11-
dc.identifier.issn1939-1374-
dc.identifier.urihttps://scholarworks.bwise.kr/erica/handle/2021.sw.erica/120652-
dc.description.abstractRecently, applications using artificial intelligence (AI) technique in mobile devices such as augmented reality have been extensively pervasive. The hardware specifications of mobile devices, dynamic service demands, stochastic network states, and characteristics of DNN (Deep Neural Network) models affect the quality of experience (QoE) of such applications. In this paper, we propose CutEdge, that leverages a virtual queue-based Lyapunov optimization framework to jointly optimize DNN model partitioning between a mobile device and a mobile edge computing (MEC) server and processing/networking resources in a mobile device with respect to internal/external system dynamics. Specifically, CutEdge makes decisions of (i) the partition point of DNN model between the mobile device and MEC server, (ii) GPU clock frequency, and (iii) transmission rates in a mobile device, simultaneously. Then, we theoretically show the optimal trade-off curves among energy consumption, throughput, and end-to-end latency yielded by CutEdge where such QoE metrics have not been jointly addressed in the previous studies. Moreover, we show the impact of joint optimization of three control parameters on the performances via real trace-driven simulations. Finally, we show the superiority of CutEdge over the existing algorithms by experiment on top of implemented testbed using an embedded AI device and an MEC server. © 2008-2012 IEEE.-
dc.format.extent16-
dc.language영어-
dc.language.isoENG-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleCutting-Edge Inference: Dynamic DNN Model Partitioning and Resource Scaling for Mobile AI-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1109/TSC.2024.3466848-
dc.identifier.scopusid2-s2.0-85205146701-
dc.identifier.wosid001386516500010-
dc.identifier.bibliographicCitationIEEE Transactions on Services Computing, v.17, no.6, pp 1 - 16-
dc.citation.titleIEEE Transactions on Services Computing-
dc.citation.volume17-
dc.citation.number6-
dc.citation.startPage1-
dc.citation.endPage16-
dc.type.docTypeArticle-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryComputer Science, Software Engineering-
dc.subject.keywordAuthordeep learning-
dc.subject.keywordAuthorDNN model partitioning-
dc.subject.keywordAuthormobile edge computing-
dc.subject.keywordAuthormobile vision application-
dc.subject.keywordAuthorquality of experience-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/10693347-
Files in This Item
Go to Link
Appears in
Collections
COLLEGE OF ENGINEERING SCIENCES > SCHOOL OF ELECTRICAL ENGINEERING > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Lee, Joo hyun photo

Lee, Joo hyun
ERICA 공학대학 (SCHOOL OF ELECTRICAL ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE