BaSSL: Boundary-aware Self-Supervised Learning for Video Scene Segmentation

Mun, Jonghwan; Shin, Minchul; Han, Gunsoo; Lee, Sangho; Ha, Seongsu; Lee, Joonseok; Kim, Eun-Sol

doi:10.1007/978-3-031-26316-3_29

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

BaSSL: Boundary-aware Self-Supervised Learning for Video Scene Segmentation

Full metadata record

DC Field	Value	Language
dc.contributor.author	Mun, Jonghwan	-
dc.contributor.author	Shin, Minchul	-
dc.contributor.author	Han, Gunsoo	-
dc.contributor.author	Lee, Sangho	-
dc.contributor.author	Ha, Seongsu	-
dc.contributor.author	Lee, Joonseok	-
dc.contributor.author	Kim, Eun-Sol	-
dc.date.accessioned	2023-06-01T07:18:21Z	-
dc.date.available	2023-06-01T07:18:21Z	-
dc.date.created	2023-05-03	-
dc.date.issued	2023-03	-
dc.identifier.issn	0302-9743	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/186008	-
dc.description.abstract	Self-supervised learning has drawn attention through its effectiveness in learning in-domain representations with no ground-truth annotations; in particular, it is shown that properly designed pretext tasks bring significant performance gains for downstream tasks. Inspired from this, we tackle video scene segmentation, which is a task of temporally localizing scene boundaries in a long video, with a self-supervised learning framework where we mainly focus on designing effective pretext tasks. In our framework, given a long video, we adopt a sliding window scheme; from a sequence of shots in each window, we discover a moment with a maximum semantic transition and leverage it as pseudo-boundary to facilitate the pre-training. Specifically, we introduce three novel boundary-aware pretext tasks: 1) Shot-Scene Matching (SSM), 2) Contextual Group Matching (CGM) and 3) Pseudo-boundary Prediction (PP); SSM and CGM guide the model to maximize intra-scene similarity and inter-scene discrimination by capturing contextual relation between shots while PP encourages the model to identify transitional moments. We perform an extensive analysis to validate effectiveness of our method and achieve the new state-of-the-art on the MovieNet-SSeg benchmark. The code is available at https://github.com/kakaobrain/bassl.	-
dc.language	영어	-
dc.language.iso	en	-
dc.publisher	Springer Science and Business Media Deutschland GmbH	-
dc.title	BaSSL: Boundary-aware Self-Supervised Learning for Video Scene Segmentation	-
dc.type	Article	-
dc.contributor.affiliatedAuthor	Kim, Eun-Sol	-
dc.identifier.doi	10.1007/978-3-031-26316-3_29	-
dc.identifier.scopusid	2-s2.0-85151062504	-
dc.identifier.wosid	001000822000029	-
dc.identifier.bibliographicCitation	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v.13844 LNCS, pp.485 - 501	-
dc.relation.isPartOf	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)	-
dc.citation.title	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)	-
dc.citation.volume	13844 LNCS	-
dc.citation.startPage	485	-
dc.citation.endPage	501	-
dc.type.rims	ART	-
dc.type.docType	Proceedings Paper	-
dc.description.journalClass	1	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.relation.journalWebOfScienceCategory	Computer Science, Theory & Methods	-
dc.subject.keywordPlus	Computer vision	-
dc.subject.keywordPlus	Semantic Segmentation	-
dc.subject.keywordPlus	Semantics	-
dc.subject.keywordPlus	Domain representations	-
dc.subject.keywordPlus	Down-stream	-
dc.subject.keywordPlus	Ground truth	-
dc.subject.keywordPlus	Group matching	-
dc.subject.keywordPlus	Learning frameworks	-
dc.subject.keywordPlus	Performance Gain	-
dc.subject.keywordPlus	Scene matching	-
dc.subject.keywordPlus	Self-supervised learning	-
dc.subject.keywordPlus	Sliding window schemes	-
dc.subject.keywordPlus	Video scene segmentation	-
dc.subject.keywordPlus	Supervised learning	-
dc.subject.keywordAuthor	Self-supervised learning	-
dc.subject.keywordAuthor	Video scene segmentation	-
dc.identifier.url	https://link.springer.com/chapter/10.1007/978-3-031-26316-3_29	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Kim, Eun Sol photo

Kim, Eun Sol: COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :6,179,360; Today View :1,448

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1365

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE