Semantic-Aware Dynamic Parameter for Video Inpainting Transformer

Lee, Eunhye; Yoo, Jinsu; Yang, Yunjeong; Baik, Sungyong; Kim, Tae Hyun

doi:10.1109/ICCV51070.2023.01190

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Semantic-Aware Dynamic Parameter for Video Inpainting Transformer

Full metadata record

DC Field	Value	Language
dc.contributor.author	Lee, Eunhye	-
dc.contributor.author	Yoo, Jinsu	-
dc.contributor.author	Yang, Yunjeong	-
dc.contributor.author	Baik, Sungyong	-
dc.contributor.author	Kim, Tae Hyun	-
dc.date.accessioned	2024-11-28T14:31:37Z	-
dc.date.available	2024-11-28T14:31:37Z	-
dc.date.issued	2023-10	-
dc.identifier.issn	1550-5499	-
dc.identifier.issn	2380-7504	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/196990	-
dc.description.abstract	Recent learning-based video inpainting approaches have achieved considerable progress. However, they still cannot fully utilize semantic information within the video frames and predict improper scene layout, failing to restore clear object boundaries for mixed scenes. To mitigate this problem, we introduce a new transformer-based video inpainting technique that can exploit semantic information within the input and considerably improve reconstruction quality. In this study, we use the mixture-of-experts scheme and train multiple experts to handle mixed scenes, including various semantics. We leverage these multiple experts and produce locally (token-wise) different network parameters to achieve semantic-aware inpainting results. Extensive experiments on YouTube-VOS and DAVIS benchmark datasets demonstrate that, compared with existing conventional video inpainting approaches, the proposed method has superior performance in synthesizing visually pleasing videos with much clearer semantic structures and textures.	-
dc.format.extent	10	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.title	Semantic-Aware Dynamic Parameter for Video Inpainting Transformer	-
dc.type	Article	-
dc.identifier.doi	10.1109/ICCV51070.2023.01190	-
dc.identifier.scopusid	2-s2.0-85185870970	-
dc.identifier.wosid	001169499005036	-
dc.identifier.bibliographicCitation	Proceedings of the IEEE International Conference on Computer Vision, pp 12903 - 12912	-
dc.citation.title	Proceedings of the IEEE International Conference on Computer Vision	-
dc.citation.startPage	12903	-
dc.citation.endPage	12912	-
dc.type.docType	Proceedings Paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Imaging Science & Photographic Technology	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.relation.journalWebOfScienceCategory	Computer Science, Theory & Methods	-
dc.relation.journalWebOfScienceCategory	Imaging Science & Photographic Technology	-
dc.identifier.url	https://ieeexplore.ieee.org/document/10378200	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles; 서울 공과대학 > ETC > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Baik, Sungyong photo

Baik, Sungyong: COLLEGE OF ENGINEERING (DEPARTMENT OF INTELLIGENCE COMPUTING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE