Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

LiveCap: Live Video Captioning with Sequential Encoding Network

Authors
Choi, WangyuYoon, Jungwon
Issue Date
Oct-2022
Publisher
IEEE Computer Society
Citation
International Conference on ICT Convergence, v.2022-October, pp.1894 - 1896
Indexed
SCOPUS
Journal Title
International Conference on ICT Convergence
Volume
2022-October
Start Page
1894
End Page
1896
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/182239
DOI
10.1109/ICTC55196.2022.9952747
ISSN
2162-1233
Abstract
Today, video captioning frameworks are very useful in places such as video surveillance systems. Most of these systems require real-time captioning, however existing video captioning frameworks still have some limitations in live video. Specifically, they require the whole video to describe. In this paper, we propose LiveCap, a framework for generating sentences corresponding to the current scene in real time from live video. LiveCap consists of three modules: sequential encoding network, captioning network, and context gating network. Our framework accumulates context for sequentially given video segments (sequential encoding network) and generates sentences based on it (captioning network). Furthermore, the context gating network controls the flow between the two networks to determine when to generate sentences. We train and test LiveCap on the ActivityNet Captions dataset and verify that LiveCap generates fluent and coherent captions in live video.
Files in This Item
Go to Link
Appears in
Collections
서울 산업융합학부 > 서울 산업융합학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Yoon, Jungwon photo

Yoon, Jungwon
SCHOOL OF INDUSTRIAL INFORMATION STUDIES (DIVISION OF INDUSTRIAL INFORMATION STUDIES)
Read more

Altmetrics

Total Views & Downloads

BROWSE