Detailed Information

Cited 0 time in webofscience Cited 1 time in scopus
Metadata Downloads

Weakly supervised multi-class semantic video segmentation for road scenes

Full metadata record
DC Field Value Language
dc.contributor.authorAwan, Mehwish-
dc.contributor.authorShin, Jitae-
dc.date.accessioned2023-05-17T00:41:48Z-
dc.date.available2023-05-17T00:41:48Z-
dc.date.created2023-05-15-
dc.date.issued2023-04-
dc.identifier.issn1077-3142-
dc.identifier.urihttps://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/87792-
dc.description.abstractWeakly supervised multi-class video segmentation is one of the most challenging yet least studied research problems in computer vision. This study aims to investigate two main items: (1) effective feature update for temporal changes combined with feature reuse between temporal frames; and (2) learn object patterns in complex scenes specifically for videos under weak supervision. Associating image tags to visual appearance is not a straightforward learning task, especially for complex scenes. Therefore, in this paper, we present manifold augmentations to obtain reliable pixel labels from image tags. We propose a framework comprised of two key modules: a temporal split module for efficient video processing and a pseudo per-pixel seed generation module for precise pixel-level supervision. Particularly, in our model, we utilize and explore temporal correlations via temporal split module and temporal attention. To reuse the extracted features and incorporate temporal updates for precise and fast computation, a channel-wise temporal split mechanism between successive video frames is presented. Furthermore, we evaluated proposed modules in two additional settings: (1) fully or sparsely supervised road scene video segmentation; and (2) weakly supervised segmentation for complex road scene images. Experiments are conducted on the Cityscapes and CamVid datasets, using DeepLabv3 as segmentation network and LiteFlowNet to compute motion vectors.-
dc.language영어-
dc.language.isoen-
dc.publisherACADEMIC PRESS INC ELSEVIER SCIENCE-
dc.relation.isPartOfCOMPUTER VISION AND IMAGE UNDERSTANDING-
dc.titleWeakly supervised multi-class semantic video segmentation for road scenes-
dc.typeArticle-
dc.type.rimsART-
dc.description.journalClass1-
dc.identifier.wosid000972603700001-
dc.identifier.doi10.1016/j.cviu.2023.103664-
dc.identifier.bibliographicCitationCOMPUTER VISION AND IMAGE UNDERSTANDING, v.230-
dc.description.isOpenAccessN-
dc.identifier.scopusid2-s2.0-85149734166-
dc.citation.titleCOMPUTER VISION AND IMAGE UNDERSTANDING-
dc.citation.volume230-
dc.contributor.affiliatedAuthorAwan, Mehwish-
dc.type.docTypeArticle-
dc.subject.keywordAuthorSemantic video segmentation-
dc.subject.keywordAuthorWeakly supervised learning-
dc.subject.keywordAuthorWeakly supervised semantic segmentation-
dc.subject.keywordAuthorJoint classification and segmentation learning-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher ,  photo

,
College of IT Convergence (컴퓨터공학부(컴퓨터공학전공))
Read more

Altmetrics

Total Views & Downloads

BROWSE