Stacked encoder–decoder transformer with boundary smoothing for action segmentation

Kim, G.-H.; Kim, Eunwoo

doi:10.1049/ell2.12678

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Stacked encoder–decoder transformer with boundary smoothing for action segmentationopen access

Authors: Kim, G.-H.; Kim, Eunwoo

Issue Date: Dec-2022

Publisher: John Wiley and Sons Inc

Keywords: artificial intelligence; image and vision processing and display technology

Citation: Electronics Letters, v.58, no.25, pp 972 - 974

Pages: 3

Journal Title: Electronics Letters

Volume: 58

Number: 25

Start Page: 972

End Page: 974

URI: https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/61136

DOI: 10.1049/ell2.12678

ISSN: 0013-5194
1350-911X

Abstract: In this work, a new stacked encoder–decoder transformer (SEDT) model is proposed for action segmentation. SEDT is composed of a series of encoder–decoder modules, each of which consists of an encoder with self-attention layers and a decoder with cross-attention layers. By adding an encoder with self-attention before every decoder, it preserves local information along with global information. The proposed encoder–decoder pair also prevents the accumulation of errors that occur when features are propagated through decoders. Moreover, the approach performs boundary smoothing in order to handle ambiguous action boundaries. Experimental results for two popular benchmark datasets, “GTEA” and “50 Salads”, show that the proposed model is more effective in performance than existing temporal convolutional network based models and the attention-based model, ASFormer. © 2022 The Authors. Electronics Letters published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology.

Files in This Item

Stacked encoder–decoder transformer with boundary smoothing for action segmentation.pdf 509.34 kB

Appears in Collections: College of Software > School of Computer Science and Engineering > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Kim, Eun Woo photo

Kim, Eun Woo: 소프트웨어대학 (소프트웨어학부)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

84, Heukseok-ro, Dongjak-gu, Seoul, Republic of Korea (06974)02-820-6194

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE