Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Convolutional Method for Modeling Video Temporal Context Effectively in Transformer

Authors
Park, Hae SungChoi, Yong Suk
Issue Date
Mar-2023
Publisher
ASSOC COMPUTING MACHINERY
Keywords
Video classification; Transformer; 3D convolution; Self-attention; Temporal feature; Computer Vision
Citation
38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, pp 1205 - 1208
Pages
4
Indexed
SCIE
SCOPUS
Journal Title
38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023
Start Page
1205
End Page
1208
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/196713
DOI
10.1145/3555776.3578481
Abstract
Video understanding remains a challenging task because video understanding models have many parameters to be trained and should capture detailed spatiotemporal contexts in video effectively. Recent methods have typically employed 3D convolution modules or else self-attention modules. However, we identify that when the self-attention mechanism captures temporal semantics, it often struggles to find out proper temporal context for video understanding. In this paper, we propose a new method for enhancing temporal modeling by incorporating 3D convolution modules into attention-based model, transformer. In particular, we replace the temporal attention of the TimeSformer with a 3D convolution module to improve temporal context learning. In contrast to the TimeSformer, our proposed method can focus on modeling temporal details at the low-level encoders, while gradually getting to focus on temporal contexts more globally at the high-level encoders. Our method surpasses the TimeSformer by 2.2% margin on Something-Something v2, which is required complex temporal modeling for getting high performance.
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Choi, Yong Suk photo

Choi, Yong Suk
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE