ReLU가 합성된 행렬 곱 연산의 부분 생략을 통한 딥 러닝 모델 추론 시간 개선

김성균; 안건주; 김나훈; 서지원

doi:10.5626/KTCP.2022.28.3.139

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

ReLU가 합성된 행렬 곱 연산의 부분 생략을 통한 딥 러닝 모델 추론 시간 개선

Full metadata record

DC Field	Value	Language
dc.contributor.author	김성균	-
dc.contributor.author	안건주	-
dc.contributor.author	김나훈	-
dc.contributor.author	서지원	-
dc.date.accessioned	2022-07-06T08:00:12Z	-
dc.date.available	2022-07-06T08:00:12Z	-
dc.date.issued	2022-03	-
dc.identifier.issn	2383-6318	-
dc.identifier.issn	2383-6326	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/139226	-
dc.description.abstract	최근 딥 러닝의 활용 분야가 넓어지는 추세이며, 많은 파라미터들을 가지고 있는 Large-Scale 딥 러닝 모델들이 좋은 성능을 보이는 경향이 있다. 그리고 크기가 큰 모델을 이용한 딥 러닝 추론은 필연적으로 많은 자원과 긴 시간을 요구하므로 딥 러닝 모델의 효율적인 활용을 위해서는 추론 시간의 단축이 필수적으로 요구된다. 본 논문에서는 딥 러닝 추론 과정에서 활성화 함수인 Rectified Linear Unit 과 행렬 곱을 융합하고, 두 연산과정에서 계산할 출력 값의 부호를 미리 예측하여 계산의 양을 줄이는 네 가지 방법을 제안하며, 네 가지 계산 생략 방법의 비교를 통해 정확도를 거의 해치지 않는 선에서 계산의 양을 줄여 추론 시간을 절약하는 최적의 방안을 도출한다.	-
dc.description.abstract	Deep learning has expanded its utilization, and large-scale deep learning models containing many parameters tend to perform well. As large-scale models inevitably require many resources and long inference time, reducing the inference time is essential for efficient utilization of deep learning models. We fuse the activation function Rectified Linear Unit and matrix multiplication in the inference process, and reduce the amount of computation by predicting the sign of the output values to be computed in the computational processes. We propose four methods for reducing the computation and derive an optimal method that saves inference time with low accuracy loss by reducing the amount of computation by comparing these four methods.	-
dc.format.extent	7	-
dc.language	한국어	-
dc.language.iso	KOR	-
dc.publisher	한국정보과학회	-
dc.title	ReLU가 합성된 행렬 곱 연산의 부분 생략을 통한 딥 러닝 모델 추론 시간 개선	-
dc.title.alternative	Improving the Inference Time of the Deep Learning Model with Partial Skip of ReLU-fused Matrix Multiplication Operations	-
dc.type	Article	-
dc.publisher.location	대한민국	-
dc.identifier.doi	10.5626/KTCP.2022.28.3.139	-
dc.identifier.bibliographicCitation	정보과학회 컴퓨팅의 실제 논문지, v.28, no.3, pp 139 - 145	-
dc.citation.title	정보과학회 컴퓨팅의 실제 논문지	-
dc.citation.volume	28	-
dc.citation.number	3	-
dc.citation.startPage	139	-
dc.citation.endPage	145	-
dc.identifier.kciid	ART002820324	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	kci	-
dc.subject.keywordAuthor	딥 러닝 최적화	-
dc.subject.keywordAuthor	계산 생략	-
dc.subject.keywordAuthor	완전 연결 레이어	-
dc.subject.keywordAuthor	추론 최적화	-
dc.subject.keywordAuthor	deep learning optimization	-
dc.subject.keywordAuthor	omitted computation	-
dc.subject.keywordAuthor	fully-connected layer	-
dc.subject.keywordAuthor	inference optimization	-
dc.identifier.url	https://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE11044919&language=ko_KR&hasTopBanner=true	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

Show simple item record

qrcode

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE