Cryptensor: A Resource-Shared Co-Processor to Accelerate Convolutional Neural Network and Polynomial Convolution
DC Field | Value | Language |
---|---|---|
dc.contributor.author | See, Jin-Chuan | - |
dc.contributor.author | Ng, Hui-Fuang | - |
dc.contributor.author | Tan, Hung-Khoon | - |
dc.contributor.author | Chang, Jing-Jing | - |
dc.contributor.author | Mok, Kai-Ming | - |
dc.contributor.author | Lee, Wai-Kong | - |
dc.contributor.author | Lin, Chih-Yang | - |
dc.date.accessioned | 2024-03-20T13:00:14Z | - |
dc.date.available | 2024-03-20T13:00:14Z | - |
dc.date.issued | 2023-12 | - |
dc.identifier.issn | 0278-0070 | - |
dc.identifier.issn | 1937-4151 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/90756 | - |
dc.description.abstract | Practical deployment of convolutional neural network (CNN) and cryptography algorithm on constrained devices are challenging due to the huge computation and memory requirement. Developing separate hardware accelerator for AI and cryptography incur large area consumption, which is not desirable in many applications. This article proposes a viable solution to this issue by expressing the CNN and cryptography as generic-matrix-multiplication (GEMM) operations and map them to the same accelerator for reduced hardware consumption. A novel systolic tensor array (STA) design was proposed to reduce the data movement, effectively reducing the operand registers by 2x. Two novel techniques, input layer extension and polynomial factorization, are proposed to mitigate the under-utilization issue found in existing STA architecture. Additionally, the tensor processing element (TPE) is fused using DSP unit to reduce the look-up table (LUT) and flip-flops (FFs) consumption for implementing multipliers. On top of that, a novel memory efficient factorization technique is proposed to allow computation of polynomial convolution on the same STA. Experimental results show that Cryptensor achieved 21.6% better throughput for VGG-16 implementation on XC7Z020 FPGA; up to 8.40x better-energy efficiency compared to existing ResNet-18 implementation on XC7Z045 FPGA. Cryptensor can also flexibly support multiple security levels in NTRU scheme, with no additional hardware. The proposed hardware unifies the computation of two different domains that are critical for IoT applications, which greatly reduces the hardware consumption on edge nodes. | - |
dc.format.extent | 14 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC | - |
dc.title | Cryptensor: A Resource-Shared Co-Processor to Accelerate Convolutional Neural Network and Polynomial Convolution | - |
dc.type | Article | - |
dc.identifier.wosid | 001123254100032 | - |
dc.identifier.doi | 10.1109/TCAD.2023.3296375 | - |
dc.identifier.bibliographicCitation | IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, v.42, no.12, pp 4735 - 4748 | - |
dc.description.isOpenAccess | N | - |
dc.identifier.scopusid | 2-s2.0-85165251883 | - |
dc.citation.endPage | 4748 | - |
dc.citation.startPage | 4735 | - |
dc.citation.title | IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS | - |
dc.citation.volume | 42 | - |
dc.citation.number | 12 | - |
dc.type.docType | Article | - |
dc.publisher.location | 미국 | - |
dc.subject.keywordAuthor | Convolutional neural network (CNN) | - |
dc.subject.keywordAuthor | cryptography | - |
dc.subject.keywordAuthor | field programmable gate array (FPGA) | - |
dc.subject.keywordAuthor | generic-matrix-multiplication (GEMM) | - |
dc.subject.keywordAuthor | polynomial convolution | - |
dc.subject.keywordAuthor | ResNet-18 | - |
dc.subject.keywordAuthor | systolic tensor array (STA) | - |
dc.subject.keywordAuthor | VGG-16 | - |
dc.subject.keywordPlus | ARRAY | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalResearchArea | Engineering | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Hardware & Architecture | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Interdisciplinary Applications | - |
dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
1342, Seongnam-daero, Sujeong-gu, Seongnam-si, Gyeonggi-do, Republic of Korea(13120)031-750-5114
COPYRIGHT 2020 Gachon University All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.