Semantic video segmentation with dynamic keyframe selection and distortion-aware feature rectification

Awan, Mehwish; Shin, Jitae

Detailed Information

Cited 4 time in webofscience

Cited 4 time in scopus

Metadata Downloads

Semantic video segmentation with dynamic keyframe selection and distortion-aware feature rectification

Authors: Awan, Mehwish; Shin, Jitae

Issue Date: Jun-2021

Publisher: ELSEVIER

Keywords: Semantic video segmentation; Feature warping; Distortion-aware feature correction; Policy network; Dynamic keyframe selection scheme; Reinforcement learning; Deep learning

Citation: IMAGE AND VISION COMPUTING, v.110

Journal Title: IMAGE AND VISION COMPUTING

Volume: 110

URI: https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/84785

DOI: 10.1016/j.imavis.2021.104184

ISSN: 0262-8856

Abstract: The per-frame segmentation methods have a high computational cost, thereby, these methods are insufficient to cope with the fast inference need of semantic video segmentation. To efficaciously reuse the extracted features by feature propagation, in this paper, we present distortion-aware feature rectification and online selection of keyframes for fast and accurate video segmentation. The proposed dynamic keyframe scheduling scheme is based on the extent of temporal variations using reinforcement learning. We employ policy gradient reinforcement strategy to learn policy function for maximizing the expected reward. The policy network has two actions (key and non-key) in the action space. State information is derived from the element-wise difference frame of the current frame and the warped current frame generated by the propagated previous frame. Afterward, an adaptive partial feature rectification with distortion-aware corrections is performed for the warped features of the non-key frames. Precise feature propagation is a critical task to uphold the temporal updates in the video sequence since it enormously affects the accuracy as well as the throughput of the whole video analysis framework. The distorted feature maps are revised with the light-weight feature extractor by the guidance of the distortion map while the correctly propagated features are not influenced. Deep feature flow approach is adopted for feature propagation. We evaluate our scheme on the Cityscapes and CamVid datasets with DeepLabv3 as segmentation network and LiteFlowNet for computing flow fields. Experimental results show that the proposed method outperforms the previous state-of-the-art methods significantly both in terms of accuracy and throughput. (c) 2021 Elsevier B.V. All rights reserved.

Files in This Item: There are no files associated with this item.

Appears in Collections: ETC > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher , photo

,: College of IT Convergence (컴퓨터공학부(컴퓨터공학전공))

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :4,239,002; Today View :10,994

RSS_1.0 RSS_2.0 ATOM_1.0

1342, Seongnam-daero, Sujeong-gu, Seongnam-si, Gyeonggi-do, Republic of Korea(13120)031-750-5114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE