Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

BaSSL: Boundary-aware Self-Supervised Learning for Video Scene Segmentation

Full metadata record
DC Field Value Language
dc.contributor.authorMun, Jonghwan-
dc.contributor.authorShin, Minchul-
dc.contributor.authorHan, Gunsoo-
dc.contributor.authorLee, Sangho-
dc.contributor.authorHa, Seongsu-
dc.contributor.authorLee, Joonseok-
dc.contributor.authorKim, Eun-Sol-
dc.date.accessioned2023-06-01T07:18:21Z-
dc.date.available2023-06-01T07:18:21Z-
dc.date.created2023-05-03-
dc.date.issued2023-03-
dc.identifier.issn0302-9743-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/186008-
dc.description.abstractSelf-supervised learning has drawn attention through its effectiveness in learning in-domain representations with no ground-truth annotations; in particular, it is shown that properly designed pretext tasks bring significant performance gains for downstream tasks. Inspired from this, we tackle video scene segmentation, which is a task of temporally localizing scene boundaries in a long video, with a self-supervised learning framework where we mainly focus on designing effective pretext tasks. In our framework, given a long video, we adopt a sliding window scheme; from a sequence of shots in each window, we discover a moment with a maximum semantic transition and leverage it as pseudo-boundary to facilitate the pre-training. Specifically, we introduce three novel boundary-aware pretext tasks: 1) Shot-Scene Matching (SSM), 2) Contextual Group Matching (CGM) and 3) Pseudo-boundary Prediction (PP); SSM and CGM guide the model to maximize intra-scene similarity and inter-scene discrimination by capturing contextual relation between shots while PP encourages the model to identify transitional moments. We perform an extensive analysis to validate effectiveness of our method and achieve the new state-of-the-art on the MovieNet-SSeg benchmark. The code is available at https://github.com/kakaobrain/bassl.-
dc.language영어-
dc.language.isoen-
dc.publisherSpringer Science and Business Media Deutschland GmbH-
dc.titleBaSSL: Boundary-aware Self-Supervised Learning for Video Scene Segmentation-
dc.typeArticle-
dc.contributor.affiliatedAuthorKim, Eun-Sol-
dc.identifier.doi10.1007/978-3-031-26316-3_29-
dc.identifier.scopusid2-s2.0-85151062504-
dc.identifier.wosid001000822000029-
dc.identifier.bibliographicCitationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v.13844 LNCS, pp.485 - 501-
dc.relation.isPartOfLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)-
dc.citation.titleLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)-
dc.citation.volume13844 LNCS-
dc.citation.startPage485-
dc.citation.endPage501-
dc.type.rimsART-
dc.type.docTypeProceedings Paper-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryComputer Science, Theory & Methods-
dc.subject.keywordPlusComputer vision-
dc.subject.keywordPlusSemantic Segmentation-
dc.subject.keywordPlusSemantics-
dc.subject.keywordPlusDomain representations-
dc.subject.keywordPlusDown-stream-
dc.subject.keywordPlusGround truth-
dc.subject.keywordPlusGroup matching-
dc.subject.keywordPlusLearning frameworks-
dc.subject.keywordPlusPerformance Gain-
dc.subject.keywordPlusScene matching-
dc.subject.keywordPlusSelf-supervised learning-
dc.subject.keywordPlusSliding window schemes-
dc.subject.keywordPlusVideo scene segmentation-
dc.subject.keywordPlusSupervised learning-
dc.subject.keywordAuthorSelf-supervised learning-
dc.subject.keywordAuthorVideo scene segmentation-
dc.identifier.urlhttps://link.springer.com/chapter/10.1007/978-3-031-26316-3_29-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Eun Sol photo

Kim, Eun Sol
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE