임베디드 CPU에서 Winograd 컨볼루션 알고리즘에 기반한 Depthwise 컨볼루션의 구현
- Authors
- 노수민; 박상수; 이민영; 정기석
- Issue Date
- Nov-2022
- Publisher
- 대한임베디드공학회
- Keywords
- Depthwise convolution; Winograd convolution algorithm; SIMD; Tensor ordering
- Citation
- 2022 대한임베디드공학회 추계학술대회, v.0, no.0, pp.270 - 273
- Indexed
- OTHER
- Journal Title
- 2022 대한임베디드공학회 추계학술대회
- Volume
- 0
- Number
- 0
- Start Page
- 270
- End Page
- 273
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/188559
- Abstract
- The depthwise convolution layer is often used in lightweight convolutional neural networks (CNNs). In this study, the Winograd convolution algorithm is used to carry out depthwise convolution, and the proposed method is implemented using NEON instructions on an ARM Cortex-A53 CPU. The performance of the proposed method is evaluated with respect to two tensor storing orders (NCHW and NHWC) and various chunk sizes. In addition, we evaluate end-to-end inference latency of MobileNet-V2 using our method. The results show our method achieves about 2x speedup against the conventional convolution lowering method.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.