Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Overlapped Data Processing Scheme for Accelerating Training and Validation in Machine Learning

Full metadata record
DC Field Value Language
dc.contributor.authorChoi, Jinseo-
dc.contributor.authorKang, Donghyun-
dc.date.accessioned2023-03-13T04:40:04Z-
dc.date.available2023-03-13T04:40:04Z-
dc.date.created2023-03-13-
dc.date.issued2022-07-
dc.identifier.issn2169-3536-
dc.identifier.urihttps://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/87050-
dc.description.abstractFor several years, machine learning (ML) technologies open up new opportunities which solve traditional problems based on a rich set of hardware resources. Unfortunately, ML technologies sometimes waste available hardware resources (e.g., CPU and GPU) because they spend a lot of time waiting for a previous step inside ML procedure. In this paper, we first study data flows of the ML procedure in detail to find avoidable performance bottlenecks. Then, we propose ol.data, the first software-based data processing scheme that aims to (1) overlap training and validation steps in one epoch or two adjacent epochs, and (2) perform validation steps in parallel, which helps to significantly improve not only the computation time but also the resource utilization. To confirm the positive effectiveness of ol.data, we implemented a convolution neural network (CNN) model with ol.data and compared it with the traditional approaches, Numpy (i.e., baseline) and tf.data on three different datasets. As a result, we confirmed that ol.data reduces the inference time by up to 41.8% and increases the utilization of CPU and GPU resources by up to 75.7% and 38.7%, respectively.-
dc.language영어-
dc.language.isoen-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.relation.isPartOfIEEE ACCESS-
dc.titleOverlapped Data Processing Scheme for Accelerating Training and Validation in Machine Learning-
dc.typeArticle-
dc.type.rimsART-
dc.description.journalClass1-
dc.identifier.wosid000838688800001-
dc.identifier.doi10.1109/ACCESS.2022.3189373-
dc.identifier.bibliographicCitationIEEE ACCESS, v.10, pp.72015 - 72023-
dc.description.isOpenAccessY-
dc.identifier.scopusid2-s2.0-85134259226-
dc.citation.endPage72023-
dc.citation.startPage72015-
dc.citation.titleIEEE ACCESS-
dc.citation.volume10-
dc.contributor.affiliatedAuthorKang, Donghyun-
dc.type.docTypeArticle-
dc.subject.keywordAuthorTraining-
dc.subject.keywordAuthorGraphics processing units-
dc.subject.keywordAuthorTask analysis-
dc.subject.keywordAuthorData processing-
dc.subject.keywordAuthorData models-
dc.subject.keywordAuthorTensors-
dc.subject.keywordAuthorHardware-
dc.subject.keywordAuthorMachine learning-
dc.subject.keywordAuthorTensorFlow-
dc.subject.keywordAuthorCPU-
dc.subject.keywordAuthorGPU utilization-
dc.subject.keywordAuthoroverlapping-
dc.subject.keywordAuthormultiple threads-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaTelecommunications-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kang, Donghyun photo

Kang, Donghyun
College of IT Convergence (컴퓨터공학부(컴퓨터공학전공))
Read more

Altmetrics

Total Views & Downloads

BROWSE