Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Improving Inference Time of Deep Learning Model with Partial Skip of ReLU-fused Matrix Multiplication Operations

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Sungkyun-
dc.contributor.authorKim, Jaemin-
dc.contributor.authorKim, Nahun-
dc.contributor.authorKang, Mincheal-
dc.contributor.authorSeo, Jiwon-
dc.date.accessioned2022-07-06T04:10:07Z-
dc.date.available2022-07-06T04:10:07Z-
dc.date.created2022-06-03-
dc.date.issued2022-04-
dc.identifier.issn0000-0000-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/138782-
dc.description.abstractDeep learning has been expanding its application, while large-scale models tend to perform well. However, as such a model inevitably requires a vast amount of resources and computations, lengthy inference time is a crucial, but essential, consequence that needs to be optimized for the efficient utilization of deep learning. To achieve the goal, we aim at fusing the Rectified Linear Unit and matrix multiplication in the inference process, which we may reduce the total amount of computation by predicting the sign bit of output value. We propose four methods of prediction and statistically choose an optimal method for reducing inference time with low accuracy loss. © 2022 IEEE.-
dc.language영어-
dc.language.isoen-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleImproving Inference Time of Deep Learning Model with Partial Skip of ReLU-fused Matrix Multiplication Operations-
dc.typeArticle-
dc.contributor.affiliatedAuthorSeo, Jiwon-
dc.identifier.doi10.1109/ICEIC54506.2022.9748210-
dc.identifier.scopusid2-s2.0-85128867551-
dc.identifier.wosid000942023400007-
dc.identifier.bibliographicCitation2022 International Conference on Electronics, Information, and Communication, ICEIC 2022, pp.1 - 4-
dc.relation.isPartOf2022 International Conference on Electronics, Information, and Communication, ICEIC 2022-
dc.citation.title2022 International Conference on Electronics, Information, and Communication, ICEIC 2022-
dc.citation.startPage1-
dc.citation.endPage4-
dc.type.rimsART-
dc.type.docTypeProceedings Paper-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaTelecommunications-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.subject.keywordPlusDeep learning-
dc.subject.keywordPlusDeep learning optimization-
dc.subject.keywordPlusFully-connected layer-
dc.subject.keywordPlusInference optimization-
dc.subject.keywordPlusITS applications-
dc.subject.keywordPlusLarge-scale modeling-
dc.subject.keywordPlusLearning models-
dc.subject.keywordPlusLearning optimizations-
dc.subject.keywordPlusMatrix multiplication operation-
dc.subject.keywordPlusOmitted computation-
dc.subject.keywordPlusOptimisations-
dc.subject.keywordPlusMatrix algebra-
dc.subject.keywordAuthordeep learning optimization-
dc.subject.keywordAuthorfully-connected layer-
dc.subject.keywordAuthorinference optimization-
dc.subject.keywordAuthoromitted computation-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/9748210-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Seo, Ji won photo

Seo, Ji won
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE