Semantic-Aware Dynamic Parameter for Video Inpainting Transformer
- Authors
- Lee, Eunhye; Yoo, Jinsu; Yang, Yunjeong; Baik, Sungyong; Kim, Tae Hyun
- Issue Date
- Oct-2023
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Citation
- Proceedings of the IEEE International Conference on Computer Vision, pp 12903 - 12912
- Pages
- 10
- Indexed
- SCIE
SCOPUS
- Journal Title
- Proceedings of the IEEE International Conference on Computer Vision
- Start Page
- 12903
- End Page
- 12912
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/196990
- DOI
- 10.1109/ICCV51070.2023.01190
- ISSN
- 1550-5499
2380-7504
- Abstract
- Recent learning-based video inpainting approaches have achieved considerable progress. However, they still cannot fully utilize semantic information within the video frames and predict improper scene layout, failing to restore clear object boundaries for mixed scenes. To mitigate this problem, we introduce a new transformer-based video inpainting technique that can exploit semantic information within the input and considerably improve reconstruction quality. In this study, we use the mixture-of-experts scheme and train multiple experts to handle mixed scenes, including various semantics. We leverage these multiple experts and produce locally (token-wise) different network parameters to achieve semantic-aware inpainting results. Extensive experiments on YouTube-VOS and DAVIS benchmark datasets demonstrate that, compared with existing conventional video inpainting approaches, the proposed method has superior performance in synthesizing visually pleasing videos with much clearer semantic structures and textures.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles
- 서울 공과대학 > ETC > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.