Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Layer-wise Pruning of Transformer Attention Heads for Efficient Language Modeling

Full metadata record
DC Field Value Language
dc.contributor.authorShim, Kyuhong-
dc.contributor.authorChoi, Iksoo-
dc.contributor.authorSung, Wonyong-
dc.contributor.authorChoi, Jung wook-
dc.date.accessioned2022-07-06T11:33:21Z-
dc.date.available2022-07-06T11:33:21Z-
dc.date.created2022-03-07-
dc.date.issued2021-11-
dc.identifier.issn2163-9612-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/140374-
dc.description.abstractRecently, the necessity of multiple attention heads in transformer architecture has been questioned [1]. Removing less important heads from a large network is a promising strategy to reduce computation cost and parameters. However, pruning out attention heads in multihead attention does not evenly reduce the overall load, because feedforward modules are not affected. In this study, we apply attention head pruning on All-Attention [2] transformer, where savings in the computation are proportional to the number of pruned heads. This improved computing efficiency comes at the cost of pruning sensitivity, which we stabilize with three training techniques. Our attention head pruning enables a considerably fewer number of parameters with a comparable perplexity for transformer-based language modeling.-
dc.language영어-
dc.language.isoen-
dc.publisherIEEE-
dc.titleLayer-wise Pruning of Transformer Attention Heads for Efficient Language Modeling-
dc.typeArticle-
dc.contributor.affiliatedAuthorChoi, Jung wook-
dc.identifier.doi10.1109/ISOCC53507.2021.9613933-
dc.identifier.scopusid2-s2.0-85123372999-
dc.identifier.wosid000861550500152-
dc.identifier.bibliographicCitationProceedings - International SoC Design Conference 2021, ISOCC 2021, pp.357 - 358-
dc.relation.isPartOfProceedings - International SoC Design Conference 2021, ISOCC 2021-
dc.citation.titleProceedings - International SoC Design Conference 2021, ISOCC 2021-
dc.citation.startPage357-
dc.citation.endPage358-
dc.type.rimsART-
dc.type.docTypeProceedings Paper-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryComputer Science, Hardware & Architecture-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.subject.keywordPlusComputational linguistics-
dc.subject.keywordPlusComputation costs-
dc.subject.keywordPlusComputing efficiency-
dc.subject.keywordPlusFeed forward-
dc.subject.keywordPlusLanguage model-
dc.subject.keywordPlusLarger networks-
dc.subject.keywordPlusLayer-wise-
dc.subject.keywordPlusMultihead-
dc.subject.keywordPlusMultihead attention-
dc.subject.keywordPlusPruning-
dc.subject.keywordPlusTransformer-
dc.subject.keywordPlusModeling languages-
dc.subject.keywordAuthormultihead attention-
dc.subject.keywordAuthorpruning-
dc.subject.keywordAuthortransformer-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/9613933-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Choi, Jung wook photo

Choi, Jung wook
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE