LiveCap: Live Video Captioning with Sequential Encoding Network
- Authors
- Choi, Wangyu; Yoon, Jungwon
- Issue Date
- Oct-2022
- Publisher
- IEEE Computer Society
- Citation
- International Conference on ICT Convergence, v.2022-October, pp 1894 - 1896
- Pages
- 3
- Indexed
- SCOPUS
- Journal Title
- International Conference on ICT Convergence
- Volume
- 2022-October
- Start Page
- 1894
- End Page
- 1896
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/182239
- DOI
- 10.1109/ICTC55196.2022.9952747
- ISSN
- 2162-1233
2162-1241
- Abstract
- Today, video captioning frameworks are very useful in places such as video surveillance systems. Most of these systems require real-time captioning, however existing video captioning frameworks still have some limitations in live video. Specifically, they require the whole video to describe. In this paper, we propose LiveCap, a framework for generating sentences corresponding to the current scene in real time from live video. LiveCap consists of three modules: sequential encoding network, captioning network, and context gating network. Our framework accumulates context for sequentially given video segments (sequential encoding network) and generates sentences based on it (captioning network). Furthermore, the context gating network controls the flow between the two networks to determine when to generate sentences. We train and test LiveCap on the ActivityNet Captions dataset and verify that LiveCap generates fluent and coherent captions in live video.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 산업융합학부 > 서울 산업융합학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.