Cited 0 time in
Semantic-Aware Dynamic Parameter for Video Inpainting Transformer
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Lee, Eunhye | - |
| dc.contributor.author | Yoo, Jinsu | - |
| dc.contributor.author | Yang, Yunjeong | - |
| dc.contributor.author | Baik, Sungyong | - |
| dc.contributor.author | Kim, Tae Hyun | - |
| dc.date.accessioned | 2024-11-28T14:31:37Z | - |
| dc.date.available | 2024-11-28T14:31:37Z | - |
| dc.date.issued | 2023-10 | - |
| dc.identifier.issn | 1550-5499 | - |
| dc.identifier.issn | 2380-7504 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/196990 | - |
| dc.description.abstract | Recent learning-based video inpainting approaches have achieved considerable progress. However, they still cannot fully utilize semantic information within the video frames and predict improper scene layout, failing to restore clear object boundaries for mixed scenes. To mitigate this problem, we introduce a new transformer-based video inpainting technique that can exploit semantic information within the input and considerably improve reconstruction quality. In this study, we use the mixture-of-experts scheme and train multiple experts to handle mixed scenes, including various semantics. We leverage these multiple experts and produce locally (token-wise) different network parameters to achieve semantic-aware inpainting results. Extensive experiments on YouTube-VOS and DAVIS benchmark datasets demonstrate that, compared with existing conventional video inpainting approaches, the proposed method has superior performance in synthesizing visually pleasing videos with much clearer semantic structures and textures. | - |
| dc.format.extent | 10 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
| dc.title | Semantic-Aware Dynamic Parameter for Video Inpainting Transformer | - |
| dc.type | Article | - |
| dc.identifier.doi | 10.1109/ICCV51070.2023.01190 | - |
| dc.identifier.scopusid | 2-s2.0-85185870970 | - |
| dc.identifier.wosid | 001169499005036 | - |
| dc.identifier.bibliographicCitation | Proceedings of the IEEE International Conference on Computer Vision, pp 12903 - 12912 | - |
| dc.citation.title | Proceedings of the IEEE International Conference on Computer Vision | - |
| dc.citation.startPage | 12903 | - |
| dc.citation.endPage | 12912 | - |
| dc.type.docType | Proceedings Paper | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalResearchArea | Imaging Science & Photographic Technology | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Theory & Methods | - |
| dc.relation.journalWebOfScienceCategory | Imaging Science & Photographic Technology | - |
| dc.identifier.url | https://ieeexplore.ieee.org/document/10378200 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
